cgae-server / TODO.md
rb125
economy step function with temporal dynamics, snapshots, and ETH top-ups
42b28ae
# CGAE Development Checklist
## Phase 1: Complete CGAE Protocol (~4 commits, ~800 lines)
### Commit 1: Economy step() + temporal dynamics (~250 lines added to economy.py)
- [ ] `EconomySnapshot` dataclass
- [ ] `step()` β€” advance economy by one time step (decay, spot-audits, storage costs, expiry)
- [ ] `_take_snapshot()` + `export_state()`
- [ ] Test-ETH top-up mechanism (keeps agents solvent during simulation)
- [ ] Tests: step produces snapshots, top-ups work, insolvency suspends agents
**Verify:** `python3 -m pytest tests/ -q`
### Commit 2: Model configs + LLM agent (~440 lines)
- [ ] `models_config.py` β€” 11 contestants + 3 jury (Azure/Bedrock/Gemma)
- [ ] `llm_agent.py` β€” chat interface for Azure OpenAI, Azure AI Foundry, Bedrock Converse API
- [ ] Token tracking (input/output tokens, latency, cost)
- [ ] Test: agents instantiate with env vars
**Verify:** `python3 -c "from cgae_engine.models_config import CONTESTANT_MODELS, JURY_MODELS; print(f'{len(CONTESTANT_MODELS)} contestants, {len(JURY_MODELS)} jury')"`
### Commit 3: Synthetic runner (~500 lines)
- [ ] `server/runner.py` β€” full simulation loop with 5 strategy agents
- [ ] Metric tracking (safety, balances, contracts, tier distribution)
- [ ] Result export to JSON
- [ ] Test: 50-step simulation completes, safety > 0
**Verify:** `python3 -m server.runner --steps 50`
### Commit 4: Economy extensions β€” delegation + tier upgrades (~280 lines added to economy.py)
- [ ] `can_delegate()` β€” chain-level tier enforcement
- [ ] `request_tier_upgrade()` β€” scaling-gate upgrade flow
- [ ] `record_delegation()` β€” audit trail for delegated tasks
- [ ] `complete_contract()` with `verification_override` + `liability_agent_id`
- [ ] Tests: delegation blocked when chain tier insufficient, upgrades work
**Verify:** `python3 -m pytest tests/ -q`
---
## Phase 2: Real LLM Simulation (~3 commits, ~2700 lines)
### Commit 5: Framework clients + audit orchestrator (~1130 lines)
- [ ] `framework_clients.py` β€” CDCT/DDFT/EECT HTTP API callers
- [ ] `audit.py` β€” orchestrates all three frameworks, computes robustness vector
- [ ] Pre-computed score fallback when APIs unavailable
**Verify:** `python3 -c "from cgae_engine.audit import AuditOrchestrator; print('audit ok')"`
### Commit 6: Autonomous agent (~890 lines)
- [ ] `agents/autonomous.py` β€” EV/RAEV planning, accounting layer
- [ ] Strategy selection (growth, conservative, balanced)
- [ ] Self-verification before submission
**Verify:** `python3 -c "from agents.autonomous import AutonomousAgent; print('autonomous ok')"`
### Commit 7: Live runner (~1575 lines)
- [ ] `server/live_runner.py` β€” real LLM calls, jury verification, cost accounting
- [ ] Default robustness profiles per model
- [ ] Round-by-round execution with metric export
**Verify:** `python3 -m server.live_runner` (requires API keys in .env)
---
## Phase 3: ENS Certification (~2 commits, ~300 lines)
### Commit 8: ENS manager (~280 lines)
- [ ] `cgae_engine/ens.py` β€” create subnames on Sepolia, set/read text records
- [ ] Text records: cgae.tier, cgae.cc, cgae.er, cgae.as, cgae.ih, cgae.wallet, cgae.family
- [ ] Register all 11 agent subnames under cgaeprotocol.eth
**Verify:** `python3 -c "from cgae_engine.ens import ENSManager; ens = ENSManager(); print(ens.resolve_text('gpt-5-4.cgaeprotocol.eth', 'cgae.tier'))"`
### Commit 9: ENS-gated economy (~50 lines changed in economy.py)
- [ ] Wire ENS into `accept_contract()` β€” resolve tier from ENS before allowing
- [ ] Wire ENS into `register_agent()` β€” create subname on registration
- [ ] Wire ENS into `audit_agent()` β€” update text records on certification
- [ ] Test: agent without ENS identity rejected
**Verify:** `python3 -m pytest tests/ -q`
---
## Phase 4: 0G Integration (~3 commits, ~900 lines)
### Commit 10: Smart contracts (~600 lines Solidity + JS)
- [ ] `contracts/src/CGAERegistry.sol` β€” on-chain agent identity + gate function
- [ ] `contracts/src/CGAEEscrow.sol` β€” contract escrow + budget ceiling
- [ ] Hardhat config for 0G Galileo testnet
- [ ] Deploy script + deployed.json
**Verify:** `cd contracts && npx hardhat compile`
### Commit 11: 0G Storage + wallet (~500 lines)
- [ ] `storage/upload_to_0g.mjs` β€” Node.js 0G SDK uploader
- [ ] `storage/zg_store.py` β€” Python wrapper
- [ ] `cgae_engine/wallet.py` β€” per-agent ETH keypairs, treasury disbursements
- [ ] `cgae_engine/onchain.py` β€” write certifications to CGAERegistry
**Verify:** `python3 -c "from cgae_engine.wallet import WalletManager; wm = WalletManager(dry_run=True); w = wm.create_agent_wallet('test'); print(w.address)"`
### Commit 12: Wire 0G into audit pipeline (~50 lines changed)
- [ ] Audit certificates uploaded to 0G Storage after each assessment
- [ ] Merkle root hash stored on-chain via CGAERegistry.certify()
- [ ] On-chain bridge called after each certification
**Verify:** `python3 -c "from storage.zg_store import check_setup; print(check_setup())"`
---
## Phase 5: Dashboard (~3 commits)
### Commit 13: FastAPI backend (~60 lines)
- [ ] `dashboard-next/api.py` β€” serves economy data as JSON endpoints
**Verify:** `cd dashboard-next && uvicorn api:app --port 8000` then `curl localhost:8000/api/health`
### Commit 14: Next.js frontend (~400 lines)
- [ ] Dark ETH-native theme
- [ ] Overview tab (safety chart, earnings)
- [ ] Agents tab (ENS names, tiers, balances)
- [ ] Trades tab (expandable task details)
- [ ] On-chain tab (0G contracts + ENS registry)
**Verify:** `cd dashboard-next && npm run build`
### Commit 15: Polish + final README
- [ ] .env.example
- [ ] Full README with architecture, setup, design decisions
- [ ] Demo video link (when recorded)