Rohan03 commited on
Commit
fdbceb4
Β·
verified Β·
1 Parent(s): 60a22e4

Sprint 12: CHANGELOG.md + v3 migration guide

Browse files
Files changed (1) hide show
  1. CHANGELOG.md +86 -0
CHANGELOG.md ADDED
@@ -0,0 +1,86 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # CHANGELOG
2
+
3
+ ## v3.0.0a1 (2026-05-01)
4
+
5
+ ### Added β€” Track A: Foundation
6
+ - **Canonical Event Runtime** (`purpose_agent/runtime/`)
7
+ - `PAEvent` frozen dataclass with 30 event kinds
8
+ - `EventBus` async pub/sub with backpressure, replay, lane isolation
9
+ - `Visibility` levels (public/internal/debug) β€” no raw CoT leaks
10
+ - `streaming_v3.py` AG-UI compatible stream adapters
11
+ - **Durable Execution** (`runtime/checkpoint.py`, `runtime/state.py`)
12
+ - `RunState` with node tracking, idempotency keys, serialization
13
+ - `Checkpointer` protocol + InMemory/JSONL/SQLite implementations
14
+ - Resume from crash, HITL pause/resume
15
+ - **Memory Homeostasis** (`memory_homeostasis.py`)
16
+ - `MemoryBudget` hard limits (active cards, injected tokens, per-kind caps)
17
+ - `MemoryArchive` append-only cold storage
18
+ - `ConsolidationEngine` (cluster β†’ merge β†’ compress β†’ hibernate)
19
+ - `QFunctionRetriever` multi-signal ranking under token budget
20
+
21
+ ### Added β€” Track B: Interoperability
22
+ - **Protocol Stack** (`purpose_agent/protocols/`)
23
+ - `MCPToolBridge` β€” MCP server registration, tool wrapping, security policy
24
+ - `A2AClient` + `AgentCard` β€” agent-to-agent delegation with circuit breaker
25
+ - `AGUIAdapter` β€” PAEvent β†’ AG-UI frontend streaming
26
+ - `parse_agents_md()` β€” repo-local instruction parser with precedence
27
+ - **Quorum Coordinator** (`quorum.py`)
28
+ - Agreement β†’ merge, disagreement β†’ escalate, critical risk β†’ HITL
29
+ - `CriticEnsemble` (correctness/safety/efficiency/alignment)
30
+
31
+ ### Added β€” Track C: Intelligence
32
+ - **LLM Call Router** (`routing.py`)
33
+ - `TaskComplexityClassifier` (simple/moderate/complex/critical)
34
+ - `ModelSelector` local-first, smallest-capable
35
+ - Cost budget hard cap
36
+ - **MAS Generator** (`mas_generator.py`)
37
+ - `generate(use_case)` β†’ complete system (agents + flow + tools + evals + routing)
38
+ - 5 templates: code, security, research, data, operations
39
+ - **Skill Genome** (`purpose_agent/skills/`)
40
+ - `SkillCard` versioned, testable, evolvable
41
+ - `SkillGenome` lineage tracking + rollback
42
+ - `SkillCI` immune scan β†’ fitness test β†’ promote/archive + mutation
43
+
44
+ ### Added β€” Track D: Optimization
45
+ - **Capability Fingerprint** (`optimization/fingerprint.py`)
46
+ - Domains, tool motifs, action patterns, failure modes, latency/cost
47
+ - **Dataset Builder** (`optimization/dataset.py`)
48
+ - Trace β†’ filtered examples with immune scan + dedup + train/val/test split
49
+ - **Prompt Pack** (`optimization/prompt_pack.py`)
50
+ - Token-budget-compliant optimization artifact (epigenetics first)
51
+ - **Shadow Evaluator** (`optimization/shadow_eval.py`)
52
+ - Candidate vs baseline comparison, promote/rollback decisions
53
+ - **Agentic Optimizer** (`optimization/optimizer.py`)
54
+ - Trend monitoring: improving β†’ continue, plateau β†’ optimize, degrading β†’ rollback
55
+ - **Distillation Planning** (`optimization/lora_plan.py`)
56
+ - Dry-run plans (none/prompt_pack/lora/distill/pdq), no GPU required for planning
57
+
58
+ ### Added β€” Track E: Hardening
59
+ - **Robustness Benchmark** (`benchmark_v3.py`)
60
+ - 35+ checks: core loop, tools, immune, memory, routing, skills, events, protocols
61
+ - Runs with mock backend in CI
62
+
63
+ ### Backward Compatibility
64
+ - All v2.1.1 public imports still work
65
+ - `pa.purpose()`, `Team.run()`, `Spark`, `Flow`, `swarm`, `Council`, `Vault` unchanged
66
+ - Old names (`Agent`, `Graph`, `parallel`, `Conversation`, `KnowledgeStore`) are aliases
67
+ - `tests/compat/test_public_api_211.py` validates full API surface
68
+
69
+ ---
70
+
71
+ ## v2.1.1 (2026-04-30)
72
+ - Creative naming: Spark, Flow, swarm, Council, Vault
73
+ - 19/19 production test pass with Llama-3.3-70B
74
+ - Published on PyPI
75
+
76
+ ## v2.0.0 (2026-04-29)
77
+ - V2 kernel: evidence-gated memory, immune system, RunMode, trace, compiler
78
+ - 13 research papers implemented
79
+ - 6 breakthroughs: self-improving critic, MoH, hindsight, evolution, adversarial
80
+ - TOML prompts, robust_parser, secure tools
81
+ - 119/119 tests pass
82
+
83
+ ## v0.2.0 (2026-04-28)
84
+ - Initial release: Actor, Purpose Function, Experience Replay, Optimizer
85
+ - SLM-native backends, streaming, tools, multi-agent, HITL, evaluation
86
+ - Published on PyPI