Spaces:
Paused
Paused
| # CGAE Development Checklist | |
| ## Phase 1: Complete CGAE Protocol (~4 commits, ~800 lines) | |
| ### Commit 1: Economy step() + temporal dynamics (~250 lines added to economy.py) | |
| - [ ] `EconomySnapshot` dataclass | |
| - [ ] `step()` β advance economy by one time step (decay, spot-audits, storage costs, expiry) | |
| - [ ] `_take_snapshot()` + `export_state()` | |
| - [ ] Test-ETH top-up mechanism (keeps agents solvent during simulation) | |
| - [ ] Tests: step produces snapshots, top-ups work, insolvency suspends agents | |
| **Verify:** `python3 -m pytest tests/ -q` | |
| ### Commit 2: Model configs + LLM agent (~440 lines) | |
| - [ ] `models_config.py` β 11 contestants + 3 jury (Azure/Bedrock/Gemma) | |
| - [ ] `llm_agent.py` β chat interface for Azure OpenAI, Azure AI Foundry, Bedrock Converse API | |
| - [ ] Token tracking (input/output tokens, latency, cost) | |
| - [ ] Test: agents instantiate with env vars | |
| **Verify:** `python3 -c "from cgae_engine.models_config import CONTESTANT_MODELS, JURY_MODELS; print(f'{len(CONTESTANT_MODELS)} contestants, {len(JURY_MODELS)} jury')"` | |
| ### Commit 3: Synthetic runner (~500 lines) | |
| - [ ] `server/runner.py` β full simulation loop with 5 strategy agents | |
| - [ ] Metric tracking (safety, balances, contracts, tier distribution) | |
| - [ ] Result export to JSON | |
| - [ ] Test: 50-step simulation completes, safety > 0 | |
| **Verify:** `python3 -m server.runner --steps 50` | |
| ### Commit 4: Economy extensions β delegation + tier upgrades (~280 lines added to economy.py) | |
| - [ ] `can_delegate()` β chain-level tier enforcement | |
| - [ ] `request_tier_upgrade()` β scaling-gate upgrade flow | |
| - [ ] `record_delegation()` β audit trail for delegated tasks | |
| - [ ] `complete_contract()` with `verification_override` + `liability_agent_id` | |
| - [ ] Tests: delegation blocked when chain tier insufficient, upgrades work | |
| **Verify:** `python3 -m pytest tests/ -q` | |
| --- | |
| ## Phase 2: Real LLM Simulation (~3 commits, ~2700 lines) | |
| ### Commit 5: Framework clients + audit orchestrator (~1130 lines) | |
| - [ ] `framework_clients.py` β CDCT/DDFT/EECT HTTP API callers | |
| - [ ] `audit.py` β orchestrates all three frameworks, computes robustness vector | |
| - [ ] Pre-computed score fallback when APIs unavailable | |
| **Verify:** `python3 -c "from cgae_engine.audit import AuditOrchestrator; print('audit ok')"` | |
| ### Commit 6: Autonomous agent (~890 lines) | |
| - [ ] `agents/autonomous.py` β EV/RAEV planning, accounting layer | |
| - [ ] Strategy selection (growth, conservative, balanced) | |
| - [ ] Self-verification before submission | |
| **Verify:** `python3 -c "from agents.autonomous import AutonomousAgent; print('autonomous ok')"` | |
| ### Commit 7: Live runner (~1575 lines) | |
| - [ ] `server/live_runner.py` β real LLM calls, jury verification, cost accounting | |
| - [ ] Default robustness profiles per model | |
| - [ ] Round-by-round execution with metric export | |
| **Verify:** `python3 -m server.live_runner` (requires API keys in .env) | |
| --- | |
| ## Phase 3: ENS Certification (~2 commits, ~300 lines) | |
| ### Commit 8: ENS manager (~280 lines) | |
| - [ ] `cgae_engine/ens.py` β create subnames on Sepolia, set/read text records | |
| - [ ] Text records: cgae.tier, cgae.cc, cgae.er, cgae.as, cgae.ih, cgae.wallet, cgae.family | |
| - [ ] Register all 11 agent subnames under cgaeprotocol.eth | |
| **Verify:** `python3 -c "from cgae_engine.ens import ENSManager; ens = ENSManager(); print(ens.resolve_text('gpt-5-4.cgaeprotocol.eth', 'cgae.tier'))"` | |
| ### Commit 9: ENS-gated economy (~50 lines changed in economy.py) | |
| - [ ] Wire ENS into `accept_contract()` β resolve tier from ENS before allowing | |
| - [ ] Wire ENS into `register_agent()` β create subname on registration | |
| - [ ] Wire ENS into `audit_agent()` β update text records on certification | |
| - [ ] Test: agent without ENS identity rejected | |
| **Verify:** `python3 -m pytest tests/ -q` | |
| --- | |
| ## Phase 4: 0G Integration (~3 commits, ~900 lines) | |
| ### Commit 10: Smart contracts (~600 lines Solidity + JS) | |
| - [ ] `contracts/src/CGAERegistry.sol` β on-chain agent identity + gate function | |
| - [ ] `contracts/src/CGAEEscrow.sol` β contract escrow + budget ceiling | |
| - [ ] Hardhat config for 0G Galileo testnet | |
| - [ ] Deploy script + deployed.json | |
| **Verify:** `cd contracts && npx hardhat compile` | |
| ### Commit 11: 0G Storage + wallet (~500 lines) | |
| - [ ] `storage/upload_to_0g.mjs` β Node.js 0G SDK uploader | |
| - [ ] `storage/zg_store.py` β Python wrapper | |
| - [ ] `cgae_engine/wallet.py` β per-agent ETH keypairs, treasury disbursements | |
| - [ ] `cgae_engine/onchain.py` β write certifications to CGAERegistry | |
| **Verify:** `python3 -c "from cgae_engine.wallet import WalletManager; wm = WalletManager(dry_run=True); w = wm.create_agent_wallet('test'); print(w.address)"` | |
| ### Commit 12: Wire 0G into audit pipeline (~50 lines changed) | |
| - [ ] Audit certificates uploaded to 0G Storage after each assessment | |
| - [ ] Merkle root hash stored on-chain via CGAERegistry.certify() | |
| - [ ] On-chain bridge called after each certification | |
| **Verify:** `python3 -c "from storage.zg_store import check_setup; print(check_setup())"` | |
| --- | |
| ## Phase 5: Dashboard (~3 commits) | |
| ### Commit 13: FastAPI backend (~60 lines) | |
| - [ ] `dashboard-next/api.py` β serves economy data as JSON endpoints | |
| **Verify:** `cd dashboard-next && uvicorn api:app --port 8000` then `curl localhost:8000/api/health` | |
| ### Commit 14: Next.js frontend (~400 lines) | |
| - [ ] Dark ETH-native theme | |
| - [ ] Overview tab (safety chart, earnings) | |
| - [ ] Agents tab (ENS names, tiers, balances) | |
| - [ ] Trades tab (expandable task details) | |
| - [ ] On-chain tab (0G contracts + ENS registry) | |
| **Verify:** `cd dashboard-next && npm run build` | |
| ### Commit 15: Polish + final README | |
| - [ ] .env.example | |
| - [ ] Full README with architecture, setup, design decisions | |
| - [ ] Demo video link (when recorded) | |