File size: 5,695 Bytes
42b28ae
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
# CGAE Development Checklist

## Phase 1: Complete CGAE Protocol (~4 commits, ~800 lines)

### Commit 1: Economy step() + temporal dynamics (~250 lines added to economy.py)
- [ ] `EconomySnapshot` dataclass
- [ ] `step()` β€” advance economy by one time step (decay, spot-audits, storage costs, expiry)
- [ ] `_take_snapshot()` + `export_state()`
- [ ] Test-ETH top-up mechanism (keeps agents solvent during simulation)
- [ ] Tests: step produces snapshots, top-ups work, insolvency suspends agents

**Verify:** `python3 -m pytest tests/ -q`

### Commit 2: Model configs + LLM agent (~440 lines)
- [ ] `models_config.py` β€” 11 contestants + 3 jury (Azure/Bedrock/Gemma)
- [ ] `llm_agent.py` β€” chat interface for Azure OpenAI, Azure AI Foundry, Bedrock Converse API
- [ ] Token tracking (input/output tokens, latency, cost)
- [ ] Test: agents instantiate with env vars

**Verify:** `python3 -c "from cgae_engine.models_config import CONTESTANT_MODELS, JURY_MODELS; print(f'{len(CONTESTANT_MODELS)} contestants, {len(JURY_MODELS)} jury')"`

### Commit 3: Synthetic runner (~500 lines)
- [ ] `server/runner.py` β€” full simulation loop with 5 strategy agents
- [ ] Metric tracking (safety, balances, contracts, tier distribution)
- [ ] Result export to JSON
- [ ] Test: 50-step simulation completes, safety > 0

**Verify:** `python3 -m server.runner --steps 50`

### Commit 4: Economy extensions β€” delegation + tier upgrades (~280 lines added to economy.py)
- [ ] `can_delegate()` β€” chain-level tier enforcement
- [ ] `request_tier_upgrade()` β€” scaling-gate upgrade flow
- [ ] `record_delegation()` β€” audit trail for delegated tasks
- [ ] `complete_contract()` with `verification_override` + `liability_agent_id`
- [ ] Tests: delegation blocked when chain tier insufficient, upgrades work

**Verify:** `python3 -m pytest tests/ -q`

---

## Phase 2: Real LLM Simulation (~3 commits, ~2700 lines)

### Commit 5: Framework clients + audit orchestrator (~1130 lines)
- [ ] `framework_clients.py` β€” CDCT/DDFT/EECT HTTP API callers
- [ ] `audit.py` β€” orchestrates all three frameworks, computes robustness vector
- [ ] Pre-computed score fallback when APIs unavailable

**Verify:** `python3 -c "from cgae_engine.audit import AuditOrchestrator; print('audit ok')"`

### Commit 6: Autonomous agent (~890 lines)
- [ ] `agents/autonomous.py` β€” EV/RAEV planning, accounting layer
- [ ] Strategy selection (growth, conservative, balanced)
- [ ] Self-verification before submission

**Verify:** `python3 -c "from agents.autonomous import AutonomousAgent; print('autonomous ok')"`

### Commit 7: Live runner (~1575 lines)
- [ ] `server/live_runner.py` β€” real LLM calls, jury verification, cost accounting
- [ ] Default robustness profiles per model
- [ ] Round-by-round execution with metric export

**Verify:** `python3 -m server.live_runner` (requires API keys in .env)

---

## Phase 3: ENS Certification (~2 commits, ~300 lines)

### Commit 8: ENS manager (~280 lines)
- [ ] `cgae_engine/ens.py` β€” create subnames on Sepolia, set/read text records
- [ ] Text records: cgae.tier, cgae.cc, cgae.er, cgae.as, cgae.ih, cgae.wallet, cgae.family
- [ ] Register all 11 agent subnames under cgaeprotocol.eth

**Verify:** `python3 -c "from cgae_engine.ens import ENSManager; ens = ENSManager(); print(ens.resolve_text('gpt-5-4.cgaeprotocol.eth', 'cgae.tier'))"`

### Commit 9: ENS-gated economy (~50 lines changed in economy.py)
- [ ] Wire ENS into `accept_contract()` β€” resolve tier from ENS before allowing
- [ ] Wire ENS into `register_agent()` β€” create subname on registration
- [ ] Wire ENS into `audit_agent()` β€” update text records on certification
- [ ] Test: agent without ENS identity rejected

**Verify:** `python3 -m pytest tests/ -q`

---

## Phase 4: 0G Integration (~3 commits, ~900 lines)

### Commit 10: Smart contracts (~600 lines Solidity + JS)
- [ ] `contracts/src/CGAERegistry.sol` β€” on-chain agent identity + gate function
- [ ] `contracts/src/CGAEEscrow.sol` β€” contract escrow + budget ceiling
- [ ] Hardhat config for 0G Galileo testnet
- [ ] Deploy script + deployed.json

**Verify:** `cd contracts && npx hardhat compile`

### Commit 11: 0G Storage + wallet (~500 lines)
- [ ] `storage/upload_to_0g.mjs` β€” Node.js 0G SDK uploader
- [ ] `storage/zg_store.py` β€” Python wrapper
- [ ] `cgae_engine/wallet.py` β€” per-agent ETH keypairs, treasury disbursements
- [ ] `cgae_engine/onchain.py` β€” write certifications to CGAERegistry

**Verify:** `python3 -c "from cgae_engine.wallet import WalletManager; wm = WalletManager(dry_run=True); w = wm.create_agent_wallet('test'); print(w.address)"`

### Commit 12: Wire 0G into audit pipeline (~50 lines changed)
- [ ] Audit certificates uploaded to 0G Storage after each assessment
- [ ] Merkle root hash stored on-chain via CGAERegistry.certify()
- [ ] On-chain bridge called after each certification

**Verify:** `python3 -c "from storage.zg_store import check_setup; print(check_setup())"`

---

## Phase 5: Dashboard (~3 commits)

### Commit 13: FastAPI backend (~60 lines)
- [ ] `dashboard-next/api.py` β€” serves economy data as JSON endpoints

**Verify:** `cd dashboard-next && uvicorn api:app --port 8000` then `curl localhost:8000/api/health`

### Commit 14: Next.js frontend (~400 lines)
- [ ] Dark ETH-native theme
- [ ] Overview tab (safety chart, earnings)
- [ ] Agents tab (ENS names, tiers, balances)
- [ ] Trades tab (expandable task details)
- [ ] On-chain tab (0G contracts + ENS registry)

**Verify:** `cd dashboard-next && npm run build`

### Commit 15: Polish + final README
- [ ] .env.example
- [ ] Full README with architecture, setup, design decisions
- [ ] Demo video link (when recorded)