Ashira Pitchayapakayakul commited on
Commit
a49e89a
Β·
1 Parent(s): 266304a

docs: 100 must-have + 100 nice-to-have feature roadmap

Browse files

Categorized: pipeline, data, models, infra, observability, PRD, memory,
self-improvement, datasets, tools, security, multi-modal, CLI UX, cloud,
codebase intelligence, training flywheel, agents, performance, compliance.

Tracks current cadence + verified-working items from today's commits.
Living document β€” update as features ship.

Files changed (1) hide show
  1. FEATURES.md +350 -0
FEATURES.md ADDED
@@ -0,0 +1,350 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Surrogate-1 Feature Roadmap
2
+
3
+ **Updated**: 2026-04-28
4
+ **Status legend**: βœ… shipped β”‚ 🚧 in progress β”‚ ⏳ planned β”‚ πŸ’‘ idea
5
+
6
+ ---
7
+
8
+ ## 🟒 Already Shipped (Foundation)
9
+
10
+ ### Pipeline (parallel orchestrate)
11
+ - βœ… 6-stage chain: SA β†’ [Architect βˆ₯ QA-TDD] β†’ DEV β†’ [QA-Verify βˆ₯ OPS] β†’ Reviewer
12
+ - βœ… Direct LLM call (skip broken tool-loop)
13
+ - βœ… Marker-extraction β†’ real code blocks β†’ real files in cwd
14
+ - βœ… Auto-commit + git push on APPROVE
15
+ - βœ… 12-rung LLM ladder (Cerebras / Groq / Gemini Γ— 2 / Samba / GH Models / Chutes / OR Γ— 2 / **HF Router Γ— 4**)
16
+
17
+ ### Data + Knowledge
18
+ - βœ… 26 public datasets covering all SDLC domains
19
+ - βœ… Training-pair feedback loop (every stage β†’ ~/.surrogate/training-pairs.jsonl β†’ HF dataset every 3 min)
20
+ - βœ… Web research preamble (DDG search β†’ context for PRD/orchestrate)
21
+ - βœ… Agentic crawler (URL frontier + visited stamps + BFS link discovery, 6 workers)
22
+ - βœ… Skill synthesis daemon (3-min cycles β†’ ~/.surrogate/skills/{cat}/SKILL.md)
23
+ - βœ… Continuous scrape (8 workers, 5-30s cool-down)
24
+
25
+ ### Models (Ollama on HF)
26
+ - βœ… qwen3-coder:30b-a3b (primary, 16GB MoE)
27
+ - βœ… devstral:24b (Mistral SWE-agent, 53.6% SWE-bench)
28
+ - βœ… qwen2.5-coder:14b (fallback)
29
+ - βœ… yi-coder:9b (128k context)
30
+ - βœ… nomic-embed-text (RAG embeddings)
31
+
32
+ ### Agent Roster (19 SDLC experts)
33
+ - βœ… solution-architect, tech-architect (design)
34
+ - βœ… dev-frontend, dev-backend, dev-mobile, dev-fullstack, dev-database (impl)
35
+ - βœ… qa-engineer, qa-perf, qa-security (test)
36
+ - βœ… devops, sre, cloud-architect (infra)
37
+ - βœ… devsecops, cloud-security (security)
38
+ - βœ… data-engineer, ml-engineer (data/ML)
39
+ - βœ… tech-writer, reviewer (docs/gate)
40
+
41
+ ### Infrastructure
42
+ - βœ… HF Space (CPU 16GB free) running 24/7
43
+ - βœ… /data persistent volume (state + logs + memory + skills + sessions + training-pairs)
44
+ - βœ… Backward-compat symlinks (~/.claude/* β†’ ~/.surrogate/*)
45
+ - βœ… Mac CLI clean (20 essential files only, 118 daemons archived)
46
+ - βœ… Status server: /, /health, /logs/{name}, /logs-list
47
+
48
+ ---
49
+
50
+ ## πŸ”΄ Must-Have (next 30 days)
51
+
52
+ ### Reliability + Observability
53
+ 1. ⏳ Heartbeat alarm β†’ Discord webhook if HF Space down >5 min
54
+ 2. ⏳ Auto-retry on transient errors (provider 429/503 β†’ wait + retry next rung)
55
+ 3. ⏳ Cost meter per stage (tokens Γ— $/1M, alert >$1/day)
56
+ 4. ⏳ Regression test suite (run nightly: orchestrate test fixtures, expect APPROVE)
57
+ 5. ⏳ Dataset upload deduplication (md5 of slice β†’ skip if same as last)
58
+ 6. ⏳ Token-pool health check (rotate to next when 429)
59
+ 7. ⏳ Disk usage alert (>80% /data β†’ cleanup oldest scrape state)
60
+ 8. ⏳ Memory leak watchdog (kill daemon RSS >1.5GB, restart)
61
+ 9. ⏳ Crash recovery (auto-resume cron loop on SIGCHLD)
62
+ 10. ⏳ Snapshot scrape ledger to HF dataset weekly
63
+
64
+ ### PRD + Project bootstrap
65
+ 11. ⏳ Claude Projects-style PRD wizard (single description input β†’ auto-extract β†’ 1-3 follow-ups β†’ PRD)
66
+ 12. ⏳ PRD template library (web app / API / CLI / mobile / data pipeline / ML)
67
+ 13. ⏳ Auto-detect existing repo β†’ reverse-engineer surrogate.md
68
+ 14. ⏳ PRD versioning (v1, v2 with diff)
69
+ 15. ⏳ "Spec mode" β€” refine PRD interactively before any code
70
+
71
+ ### Pipeline quality
72
+ 16. ⏳ Self-critique loop (after dev: model A reviews model B output β†’ re-dev if NEEDS-WORK)
73
+ 17. ⏳ Regression test on touched files (re-run existing tests)
74
+ 18. ⏳ Lint + type-check + security scan in pipeline (ruff, mypy, semgrep)
75
+ 19. ⏳ Diff approval UI (show changes before commit, esp. yolo mode)
76
+ 20. ⏳ Search-replace block edits (Aider-style, less risky than full rewrite)
77
+
78
+ ### Domain expert routing
79
+ 21. ⏳ Auto-route DEV stage to specialist (frontend/backend/mobile/iac) based on task keywords
80
+ 22. ⏳ Multi-specialist parallel work (e.g., backend API + frontend UI in same task β†’ spawn both)
81
+ 23. ⏳ Specialist-specific eval (frontend agent β†’ check WCAG; backend β†’ check N+1)
82
+
83
+ ### Memory + Context
84
+ 24. ⏳ Episodic memory (last 50 sessions retrieval for similar tasks)
85
+ 25. ⏳ Procedural memory (how-to library auto-generated from successful runs)
86
+ 26. ⏳ Project context cache (surrogate.md + repo-map persisted across sessions)
87
+ 27. ⏳ Cross-project pattern share (skill from project A β†’ applicable to project B)
88
+ 28. ⏳ Long-term retention (key decisions β†’ ADR auto-generation)
89
+
90
+ ### Self-improvement loop
91
+ 29. ⏳ Reflexion lessons β†’ injected into next-similar-task prompt
92
+ 30. ⏳ Failed orchestrate β†’ root-cause analysis β†’ improvement queue
93
+ 31. ⏳ Weekly LoRA fine-tune trigger (on accumulated training pairs, autotrain)
94
+ 32. ⏳ A/B test prompts (variant A vs B, pick winner by APPROVE rate)
95
+ 33. ⏳ Voyager-style skill crystallization (pattern repeated 3+ times β†’ permanent skill)
96
+
97
+ ### Datasets + Training
98
+ 34. ⏳ SRE postmortem corpus (scrape danluu/post-mortems β†’ ~600 incident β†’ instruction-pair)
99
+ 35. ⏳ AWS Well-Architected synthetic Q/A (PDFs β†’ distilabel pipeline β†’ 5k pairs)
100
+ 36. ⏳ Internal axentx code β†’ instruction pairs (commit messages + diffs)
101
+ 37. οΏ½οΏ½οΏ½ Training pair quality scoring (filter low-quality before HF upload)
102
+ 38. ⏳ DPO preference pairs from reviewer (chosen/rejected from REWORK cycles)
103
+ 39. ⏳ Synthetic ADR generation (real OSS examples β†’ expand via distilabel)
104
+
105
+ ### Tools + Integrations
106
+ 40. ⏳ MCP client support (Claude Desktop schema β€” connect external tools)
107
+ 41. ⏳ ToolSearch lazy-load (don't blow context on full tool list)
108
+ 42. ⏳ Constitutional Critic from ~/.surrogate/agents/roster.json (auto-load)
109
+ 43. ⏳ Repo-map context (tree-sitter symbol graph β†’ smarter file selection)
110
+ 44. ⏳ Tool-call traces saved as training data (every tool use β†’ pair)
111
+
112
+ ### Security + Safety
113
+ 45. ⏳ Secret-scan pre-commit hook (gitleaks integration)
114
+ 46. ⏳ Rate limit per-IP (HF Space /chat endpoint)
115
+ 47. ⏳ Allowlist/denylist for git push (don't push to main without flag)
116
+ 48. ⏳ PII scrubber for training pairs (remove emails, IPs, names before upload)
117
+ 49. ⏳ Sandbox tool execution (no rm -rf, no curl |sh, no destructive ops)
118
+ 50. ⏳ Audit log for every orchestrate run (who/what/when/result)
119
+
120
+ ### Multi-modal + I/O
121
+ 51. ⏳ Voice input (Whisper transcribe β†’ surrogate)
122
+ 52. ⏳ Image input (architectural diagrams β†’ analysis)
123
+ 53. ⏳ Screen recording β†’ video β†’ tutorial agent
124
+ 54. ⏳ Discord voice channel (TTS responses)
125
+
126
+ ### CLI UX
127
+ 55. ⏳ /resume <session-id> (continue past session)
128
+ 56. ⏳ /diff (show pending changes before commit)
129
+ 57. ⏳ /undo (rollback last orchestrate via git stash)
130
+ 58. ⏳ /share (publish session as gist for review)
131
+ 59. ⏳ Tab autocomplete for slash commands
132
+ 60. ⏳ Cost-meter live in statusline (running $ this session)
133
+
134
+ ### Cloud / multi-region
135
+ 61. ⏳ Mirror to Cloudflare Workers AI (free tier backup)
136
+ 62. ⏳ Egress whitelist for Discord on HF Pro tier
137
+ 63. ⏳ HF Space upgrade auto-scale (when load > 80%)
138
+ 64. ⏳ Backup strategy: weekly snapshot of /data β†’ HF dataset
139
+
140
+ ### Codebase intelligence
141
+ 65. ⏳ Symbol search (tree-sitter index, not just text grep)
142
+ 66. ⏳ Cross-file refactor (rename across project safely)
143
+ 67. ⏳ Type-aware code completion (LSP integration)
144
+ 68. ⏳ Dead code detection (vulture, ts-prune)
145
+ 69. ⏳ Dependency graph viz (per-project)
146
+
147
+ ### Training data flywheel
148
+ 70. ⏳ Trace storage on HF (axentx/surrogate-1-traces dataset)
149
+ 71. ⏳ Auto-tag training pairs by domain (frontend/backend/etc)
150
+ 72. ⏳ Quality gate before training pair upload (β‰₯ N tokens, well-formed)
151
+ 73. ⏳ Weekly eval on SWE-bench-Lite (track improvement)
152
+ 74. ⏳ DPO data generation (REWORK cycles β†’ preference pairs)
153
+
154
+ ### Discord + notifications
155
+ 75. ⏳ Discord webhook for every commit (axentx repo notifications)
156
+ 76. ⏳ Daily digest webhook (commits + pairs + scrape stats)
157
+ 77. ⏳ Failure alerts (orchestrate fail β†’ ping)
158
+ 78. ⏳ Slash commands `/orchestrate "task"` from Discord
159
+
160
+ ### HF integrations
161
+ 79. ⏳ TEI server (text-embeddings-inference) for RAG
162
+ 80. ⏳ TGI server (text-generation-inference) for self-hosted LLM
163
+ 81. ⏳ autotrain weekly LoRA on training pairs
164
+ 82. ⏳ HF Inference Providers as primary (paid bypass)
165
+ 83. ⏳ HF Spaces gradio UI (visualize chain status)
166
+
167
+ ### Agent quality
168
+ 84. ⏳ Specialist eval per agent (e.g., dev-backend on RealWorld benchmark)
169
+ 85. ⏳ Multi-model consensus on critical decisions (architecture, security)
170
+ 86. ⏳ Constitutional rules (no hard-coded secrets, validate input)
171
+ 87. ⏳ Tool use tracking per agent (which tools each agent calls)
172
+ 88. ⏳ Persona consistency check (review for tone/style mid-thread)
173
+
174
+ ### Project management
175
+ 89. ⏳ Burndown chart per surrogate.md plan
176
+ 90. ⏳ Story-point estimation from PRD
177
+ 91. ⏳ Auto-create GitHub issues from `- [ ]` plan items
178
+ 92. ⏳ PR description auto-write from commit list
179
+ 93. ⏳ Sprint retrospective auto-summary
180
+
181
+ ### Performance
182
+ 94. ⏳ Profile + optimize orchestrate cycle time (target < 90s p50)
183
+ 95. ⏳ Streaming responses (LLM tokens flow live, don't wait for full)
184
+ 96. ⏳ Local cache for repeated identical prompts
185
+ 97. ⏳ Parallel model calls (race fastest-first, kill rest)
186
+ 98. ⏳ Edge inference (qwen3-coder on Cerebras WaferScale via API)
187
+
188
+ ### Compliance + Governance
189
+ 99. ⏳ License audit per file generated (OSS license compatibility)
190
+ 100. ⏳ Commit signing (gpg/sigstore)
191
+
192
+ ---
193
+
194
+ ## πŸ’‘ Nice-to-Have (future)
195
+
196
+ ### Multi-agent collaboration
197
+ 1. πŸ’‘ MoA (Mixture of Agents) β€” 3 LLMs propose, judge picks best
198
+ 2. πŸ’‘ Debate mode (2 agents argue, third synthesizes)
199
+ 3. πŸ’‘ Tournament-style code review (3 reviewers, majority verdict)
200
+ 4. πŸ’‘ Hierarchical agents (manager β†’ workers β†’ reporter)
201
+ 5. πŸ’‘ Autonomous research squad (3 agents split topics, merge findings)
202
+
203
+ ### UI / UX
204
+ 6. πŸ’‘ Web dashboard (real-time pipeline status, training pair count, model health)
205
+ 7. πŸ’‘ VSCode extension (`surrogate /auto` from editor)
206
+ 8. πŸ’‘ IntelliJ plugin
207
+ 9. πŸ’‘ Mobile app (iOS/Android) for on-the-go orchestrate
208
+ 10. πŸ’‘ Apple Watch glance (current task status)
209
+
210
+ ### Voice + Audio
211
+ 11. πŸ’‘ Whisper realtime transcription
212
+ 12. πŸ’‘ ElevenLabs TTS for status reports
213
+ 13. πŸ’‘ Daily audio briefing podcast
214
+ 14. πŸ’‘ Voice clone of user for replies
215
+
216
+ ### Visual
217
+ 15. πŸ’‘ Architecture diagram auto-generation (mermaid β†’ SVG)
218
+ 16. πŸ’‘ Dependency graph live render
219
+ 17. πŸ’‘ Heat map of code changes per file
220
+ 18. πŸ’‘ 3D codebase visualization (gource-style)
221
+
222
+ ### Integrations
223
+ 19. πŸ’‘ Linear / Jira sync (pull tickets, update status)
224
+ 20. πŸ’‘ Slack bot
225
+ 21. πŸ’‘ Microsoft Teams bot
226
+ 22. πŸ’‘ Notion sync (PRD ↔ Notion page)
227
+ 23. πŸ’‘ Figma plugin (design β†’ code via DEV agent)
228
+ 24. πŸ’‘ Storybook integration (component dev)
229
+ 25. πŸ’‘ Sentry integration (errors β†’ fix queue)
230
+ 26. πŸ’‘ PagerDuty integration (incident β†’ SRE agent)
231
+ 27. πŸ’‘ GitHub Copilot bridge (delegate to Surrogate for complex)
232
+ 28. πŸ’‘ Cursor IDE integration
233
+
234
+ ### ML / Self-improvement
235
+ 29. πŸ’‘ RLHF from APPROVE/REWORK signals
236
+ 30. πŸ’‘ RLAIF (AI feedback on agent outputs)
237
+ 31. πŸ’‘ Continual pre-training on axentx code corpus
238
+ 32. πŸ’‘ Distillation (qwen-coder-30B β†’ 7B for edge)
239
+ 33. πŸ’‘ Quantization-aware fine-tuning
240
+ 34. πŸ’‘ Speculative decoding for faster inference
241
+ 35. πŸ’‘ Mixture-of-experts custom training
242
+
243
+ ### Datasets
244
+ 36. πŸ’‘ Real-time scrape of GitHub trending (every 1h)
245
+ 37. πŸ’‘ Scrape Hacker News top stories daily
246
+ 38. πŸ’‘ Scrape Reddit r/programming weekly
247
+ 39. πŸ’‘ Scrape Twitter dev threads (X API tier 1 = $100/m, skip)
248
+ 40. πŸ’‘ Curated YouTube transcripts (developer talks, RustConf, KubeCon)
249
+ 41. πŸ’‘ Scrape arxiv-sanity for AI papers
250
+ 42. πŸ’‘ Crawl AWS/GCP/Azure docs nightly
251
+ 43. πŸ’‘ PR diff archive (axentx own PRs as training)
252
+ 44. πŸ’‘ Stack Overflow accepted answers (dump filter)
253
+ 45. πŸ’‘ GitHub issue resolutions (closed issue β†’ PR linkage)
254
+
255
+ ### Cloud / Deployment
256
+ 46. πŸ’‘ Multi-region HF Spaces (ap-southeast + us-east + eu-west)
257
+ 47. πŸ’‘ K8s deployment manifests (move beyond HF when scale demands)
258
+ 48. πŸ’‘ Kubernetes operator for axentx orchestration
259
+ 49. πŸ’‘ Lambda@Edge for global low-latency inference
260
+ 50. πŸ’‘ IPFS publish of PRDs (decentralized)
261
+
262
+ ### Privacy + Security
263
+ 51. πŸ’‘ E2E encryption for Discord chat
264
+ 52. πŸ’‘ Air-gapped mode (Mac-only, no cloud)
265
+ 53. πŸ’‘ Federated learning (multiple users contribute, no central data)
266
+ 54. πŸ’‘ Zero-knowledge proofs for code provenance
267
+ 55. πŸ’‘ Confidential computing (Intel SGX) for sensitive code
268
+ 56. πŸ’‘ GDPR compliance toolkit (PII scrub, right-to-delete)
269
+ 57. πŸ’‘ SOC 2 Type II readiness checklist
270
+ 58. πŸ’‘ ISO 27001 audit prep
271
+
272
+ ### Specialty agents
273
+ 59. πŸ’‘ Compiler engineer (LLVM, optimization passes)
274
+ 60. πŸ’‘ Embedded systems (microcontroller code, real-time)
275
+ 61. πŸ’‘ Game dev (Unity, Unreal, Godot)
276
+ 62. πŸ’‘ Blockchain (Solidity, smart contracts, security)
277
+ 63. πŸ’‘ Quantum computing (Qiskit, circuits)
278
+ 64. πŸ’‘ Robotics (ROS, motion planning)
279
+ 65. πŸ’‘ Bioinformatics (BLAST, sequence analysis)
280
+ 66. πŸ’‘ Quantitative finance (backtesting, risk)
281
+ 67. πŸ’‘ Climate modeling
282
+ 68. πŸ’‘ Legal tech (contract review)
283
+
284
+ ### Education
285
+ 69. πŸ’‘ Teach mode (explain decisions step-by-step for learners)
286
+ 70. πŸ’‘ Pair programming mode (turn-taking with user)
287
+ 71. πŸ’‘ Code review school (annotated learning examples)
288
+ 72. πŸ’‘ Daily challenge generator (LeetCode-style, personalized)
289
+ 73. πŸ’‘ Concept explainer (DDD, hexagonal, CAP theorem on demand)
290
+
291
+ ### Productivity
292
+ 74. πŸ’‘ Calendar integration (block focus time when in flow)
293
+ 75. πŸ’‘ Pomodoro mode
294
+ 76. πŸ’‘ Energy/mood tracker (suggest break when fatigued)
295
+ 77. πŸ’‘ Distraction blocker (no Twitter when Surrogate active)
296
+ 78. πŸ’‘ Focus music generator (lo-fi via Suno API)
297
+
298
+ ### Emerging tech
299
+ 79. πŸ’‘ ASI safety guardrails (per Anthropic Constitutional AI)
300
+ 80. πŸ’‘ World model simulation (test ideas in synth environment)
301
+ 81. πŸ’‘ Causal reasoning (vs correlation)
302
+ 82. πŸ’‘ Theorem prover integration (Lean, Coq for verified code)
303
+ 83. πŸ’‘ Differential privacy in training
304
+ 84. πŸ’‘ Explainable AI for code reviews
305
+
306
+ ### Localization
307
+ 85. πŸ’‘ Thai-native pipeline (โค้ดแΰΈ₯ΰΈ° comments ΰΉ€ΰΈ›ΰΉ‡ΰΈ™ΰΉ„ΰΈ—ΰΈ’)
308
+ 86. πŸ’‘ Japanese, Korean, Chinese support
309
+ 87. πŸ’‘ RTL languages (Arabic, Hebrew)
310
+ 88. πŸ’‘ Local LLM Thai-fluent (typhoon, openthaigpt)
311
+ 89. πŸ’‘ Cultural code review (idioms per locale)
312
+
313
+ ### Marketing + community
314
+ 90. πŸ’‘ Public Surrogate-1 demo Space (read-only)
315
+ 91. πŸ’‘ Twitter bot posts daily Surrogate-1 wins
316
+ 92. πŸ’‘ GitHub discussions for community
317
+ 93. πŸ’‘ Discord server for users
318
+ 94. πŸ’‘ Newsletter (weekly improvements)
319
+ 95. πŸ’‘ Blog (axentx engineering)
320
+
321
+ ### Speculative
322
+ 96. πŸ’‘ Surrogate-2 (full local inference, no cloud dep)
323
+ 97. πŸ’‘ Custom silicon (qwen-coder optimized FPGA)
324
+ 98. πŸ’‘ BCI integration (Neuralink-style direct intent)
325
+ 99. πŸ’‘ Physical robot (Boston Dynamics + Surrogate brain)
326
+ 100. πŸ’‘ ASI alignment research collaboration
327
+
328
+ ---
329
+
330
+ ## Current Cadence (auto-running on HF)
331
+
332
+ | Task | Frequency | Status |
333
+ |---|---|---|
334
+ | Continuous scrape | 8 workers, 5-30s cool-down | βœ… |
335
+ | Agentic crawler | 6 workers, BFS frontier | βœ… |
336
+ | Skill synthesis | every 3 min | βœ… |
337
+ | surrogate-dev-loop | every 2 min | βœ… |
338
+ | work-queue producer | every 5 min | βœ… |
339
+ | training-pair push to HF | every 3 min | βœ… |
340
+ | auto-orchestrate-loop | every 20 min | βœ… |
341
+ | research-apply | every 30 min | βœ… |
342
+ | keyword tuner | every 60 min | βœ… |
343
+ | research-loop | every 6h | βœ… |
344
+ | dataset-enrich | every 12h | βœ… |
345
+
346
+ ## Verified working (2026-04-28)
347
+ - 5 commits to HF dataset in 12 min (~4047 pairs uploaded)
348
+ - Pipeline produces real Python/Go code with DDD patterns
349
+ - Reviewer issues APPROVE / REWORK / REJECT verdicts
350
+ - Training feedback loop closing (every stage β†’ HF)