Sprint 12: CHANGELOG.md + v3 migration guide
Browse files- CHANGELOG.md +86 -0
CHANGELOG.md
ADDED
|
@@ -0,0 +1,86 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# CHANGELOG
|
| 2 |
+
|
| 3 |
+
## v3.0.0a1 (2026-05-01)
|
| 4 |
+
|
| 5 |
+
### Added β Track A: Foundation
|
| 6 |
+
- **Canonical Event Runtime** (`purpose_agent/runtime/`)
|
| 7 |
+
- `PAEvent` frozen dataclass with 30 event kinds
|
| 8 |
+
- `EventBus` async pub/sub with backpressure, replay, lane isolation
|
| 9 |
+
- `Visibility` levels (public/internal/debug) β no raw CoT leaks
|
| 10 |
+
- `streaming_v3.py` AG-UI compatible stream adapters
|
| 11 |
+
- **Durable Execution** (`runtime/checkpoint.py`, `runtime/state.py`)
|
| 12 |
+
- `RunState` with node tracking, idempotency keys, serialization
|
| 13 |
+
- `Checkpointer` protocol + InMemory/JSONL/SQLite implementations
|
| 14 |
+
- Resume from crash, HITL pause/resume
|
| 15 |
+
- **Memory Homeostasis** (`memory_homeostasis.py`)
|
| 16 |
+
- `MemoryBudget` hard limits (active cards, injected tokens, per-kind caps)
|
| 17 |
+
- `MemoryArchive` append-only cold storage
|
| 18 |
+
- `ConsolidationEngine` (cluster β merge β compress β hibernate)
|
| 19 |
+
- `QFunctionRetriever` multi-signal ranking under token budget
|
| 20 |
+
|
| 21 |
+
### Added β Track B: Interoperability
|
| 22 |
+
- **Protocol Stack** (`purpose_agent/protocols/`)
|
| 23 |
+
- `MCPToolBridge` β MCP server registration, tool wrapping, security policy
|
| 24 |
+
- `A2AClient` + `AgentCard` β agent-to-agent delegation with circuit breaker
|
| 25 |
+
- `AGUIAdapter` β PAEvent β AG-UI frontend streaming
|
| 26 |
+
- `parse_agents_md()` β repo-local instruction parser with precedence
|
| 27 |
+
- **Quorum Coordinator** (`quorum.py`)
|
| 28 |
+
- Agreement β merge, disagreement β escalate, critical risk β HITL
|
| 29 |
+
- `CriticEnsemble` (correctness/safety/efficiency/alignment)
|
| 30 |
+
|
| 31 |
+
### Added β Track C: Intelligence
|
| 32 |
+
- **LLM Call Router** (`routing.py`)
|
| 33 |
+
- `TaskComplexityClassifier` (simple/moderate/complex/critical)
|
| 34 |
+
- `ModelSelector` local-first, smallest-capable
|
| 35 |
+
- Cost budget hard cap
|
| 36 |
+
- **MAS Generator** (`mas_generator.py`)
|
| 37 |
+
- `generate(use_case)` β complete system (agents + flow + tools + evals + routing)
|
| 38 |
+
- 5 templates: code, security, research, data, operations
|
| 39 |
+
- **Skill Genome** (`purpose_agent/skills/`)
|
| 40 |
+
- `SkillCard` versioned, testable, evolvable
|
| 41 |
+
- `SkillGenome` lineage tracking + rollback
|
| 42 |
+
- `SkillCI` immune scan β fitness test β promote/archive + mutation
|
| 43 |
+
|
| 44 |
+
### Added β Track D: Optimization
|
| 45 |
+
- **Capability Fingerprint** (`optimization/fingerprint.py`)
|
| 46 |
+
- Domains, tool motifs, action patterns, failure modes, latency/cost
|
| 47 |
+
- **Dataset Builder** (`optimization/dataset.py`)
|
| 48 |
+
- Trace β filtered examples with immune scan + dedup + train/val/test split
|
| 49 |
+
- **Prompt Pack** (`optimization/prompt_pack.py`)
|
| 50 |
+
- Token-budget-compliant optimization artifact (epigenetics first)
|
| 51 |
+
- **Shadow Evaluator** (`optimization/shadow_eval.py`)
|
| 52 |
+
- Candidate vs baseline comparison, promote/rollback decisions
|
| 53 |
+
- **Agentic Optimizer** (`optimization/optimizer.py`)
|
| 54 |
+
- Trend monitoring: improving β continue, plateau β optimize, degrading β rollback
|
| 55 |
+
- **Distillation Planning** (`optimization/lora_plan.py`)
|
| 56 |
+
- Dry-run plans (none/prompt_pack/lora/distill/pdq), no GPU required for planning
|
| 57 |
+
|
| 58 |
+
### Added β Track E: Hardening
|
| 59 |
+
- **Robustness Benchmark** (`benchmark_v3.py`)
|
| 60 |
+
- 35+ checks: core loop, tools, immune, memory, routing, skills, events, protocols
|
| 61 |
+
- Runs with mock backend in CI
|
| 62 |
+
|
| 63 |
+
### Backward Compatibility
|
| 64 |
+
- All v2.1.1 public imports still work
|
| 65 |
+
- `pa.purpose()`, `Team.run()`, `Spark`, `Flow`, `swarm`, `Council`, `Vault` unchanged
|
| 66 |
+
- Old names (`Agent`, `Graph`, `parallel`, `Conversation`, `KnowledgeStore`) are aliases
|
| 67 |
+
- `tests/compat/test_public_api_211.py` validates full API surface
|
| 68 |
+
|
| 69 |
+
---
|
| 70 |
+
|
| 71 |
+
## v2.1.1 (2026-04-30)
|
| 72 |
+
- Creative naming: Spark, Flow, swarm, Council, Vault
|
| 73 |
+
- 19/19 production test pass with Llama-3.3-70B
|
| 74 |
+
- Published on PyPI
|
| 75 |
+
|
| 76 |
+
## v2.0.0 (2026-04-29)
|
| 77 |
+
- V2 kernel: evidence-gated memory, immune system, RunMode, trace, compiler
|
| 78 |
+
- 13 research papers implemented
|
| 79 |
+
- 6 breakthroughs: self-improving critic, MoH, hindsight, evolution, adversarial
|
| 80 |
+
- TOML prompts, robust_parser, secure tools
|
| 81 |
+
- 119/119 tests pass
|
| 82 |
+
|
| 83 |
+
## v0.2.0 (2026-04-28)
|
| 84 |
+
- Initial release: Actor, Purpose Function, Experience Replay, Optimizer
|
| 85 |
+
- SLM-native backends, streaming, tools, multi-agent, HITL, evaluation
|
| 86 |
+
- Published on PyPI
|