Spaces:
Running
Running
Ashira Pitchayapakayakul commited on
Commit Β·
a49e89a
1
Parent(s): 266304a
docs: 100 must-have + 100 nice-to-have feature roadmap
Browse filesCategorized: pipeline, data, models, infra, observability, PRD, memory,
self-improvement, datasets, tools, security, multi-modal, CLI UX, cloud,
codebase intelligence, training flywheel, agents, performance, compliance.
Tracks current cadence + verified-working items from today's commits.
Living document β update as features ship.
- FEATURES.md +350 -0
FEATURES.md
ADDED
|
@@ -0,0 +1,350 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Surrogate-1 Feature Roadmap
|
| 2 |
+
|
| 3 |
+
**Updated**: 2026-04-28
|
| 4 |
+
**Status legend**: β
shipped β π§ in progress β β³ planned β π‘ idea
|
| 5 |
+
|
| 6 |
+
---
|
| 7 |
+
|
| 8 |
+
## π’ Already Shipped (Foundation)
|
| 9 |
+
|
| 10 |
+
### Pipeline (parallel orchestrate)
|
| 11 |
+
- β
6-stage chain: SA β [Architect β₯ QA-TDD] β DEV β [QA-Verify β₯ OPS] β Reviewer
|
| 12 |
+
- β
Direct LLM call (skip broken tool-loop)
|
| 13 |
+
- β
Marker-extraction β real code blocks β real files in cwd
|
| 14 |
+
- β
Auto-commit + git push on APPROVE
|
| 15 |
+
- β
12-rung LLM ladder (Cerebras / Groq / Gemini Γ 2 / Samba / GH Models / Chutes / OR Γ 2 / **HF Router Γ 4**)
|
| 16 |
+
|
| 17 |
+
### Data + Knowledge
|
| 18 |
+
- β
26 public datasets covering all SDLC domains
|
| 19 |
+
- β
Training-pair feedback loop (every stage β ~/.surrogate/training-pairs.jsonl β HF dataset every 3 min)
|
| 20 |
+
- β
Web research preamble (DDG search β context for PRD/orchestrate)
|
| 21 |
+
- β
Agentic crawler (URL frontier + visited stamps + BFS link discovery, 6 workers)
|
| 22 |
+
- β
Skill synthesis daemon (3-min cycles β ~/.surrogate/skills/{cat}/SKILL.md)
|
| 23 |
+
- β
Continuous scrape (8 workers, 5-30s cool-down)
|
| 24 |
+
|
| 25 |
+
### Models (Ollama on HF)
|
| 26 |
+
- β
qwen3-coder:30b-a3b (primary, 16GB MoE)
|
| 27 |
+
- β
devstral:24b (Mistral SWE-agent, 53.6% SWE-bench)
|
| 28 |
+
- β
qwen2.5-coder:14b (fallback)
|
| 29 |
+
- β
yi-coder:9b (128k context)
|
| 30 |
+
- β
nomic-embed-text (RAG embeddings)
|
| 31 |
+
|
| 32 |
+
### Agent Roster (19 SDLC experts)
|
| 33 |
+
- β
solution-architect, tech-architect (design)
|
| 34 |
+
- β
dev-frontend, dev-backend, dev-mobile, dev-fullstack, dev-database (impl)
|
| 35 |
+
- β
qa-engineer, qa-perf, qa-security (test)
|
| 36 |
+
- β
devops, sre, cloud-architect (infra)
|
| 37 |
+
- β
devsecops, cloud-security (security)
|
| 38 |
+
- β
data-engineer, ml-engineer (data/ML)
|
| 39 |
+
- β
tech-writer, reviewer (docs/gate)
|
| 40 |
+
|
| 41 |
+
### Infrastructure
|
| 42 |
+
- β
HF Space (CPU 16GB free) running 24/7
|
| 43 |
+
- β
/data persistent volume (state + logs + memory + skills + sessions + training-pairs)
|
| 44 |
+
- β
Backward-compat symlinks (~/.claude/* β ~/.surrogate/*)
|
| 45 |
+
- β
Mac CLI clean (20 essential files only, 118 daemons archived)
|
| 46 |
+
- β
Status server: /, /health, /logs/{name}, /logs-list
|
| 47 |
+
|
| 48 |
+
---
|
| 49 |
+
|
| 50 |
+
## π΄ Must-Have (next 30 days)
|
| 51 |
+
|
| 52 |
+
### Reliability + Observability
|
| 53 |
+
1. β³ Heartbeat alarm β Discord webhook if HF Space down >5 min
|
| 54 |
+
2. β³ Auto-retry on transient errors (provider 429/503 β wait + retry next rung)
|
| 55 |
+
3. β³ Cost meter per stage (tokens Γ $/1M, alert >$1/day)
|
| 56 |
+
4. β³ Regression test suite (run nightly: orchestrate test fixtures, expect APPROVE)
|
| 57 |
+
5. β³ Dataset upload deduplication (md5 of slice β skip if same as last)
|
| 58 |
+
6. β³ Token-pool health check (rotate to next when 429)
|
| 59 |
+
7. β³ Disk usage alert (>80% /data β cleanup oldest scrape state)
|
| 60 |
+
8. β³ Memory leak watchdog (kill daemon RSS >1.5GB, restart)
|
| 61 |
+
9. β³ Crash recovery (auto-resume cron loop on SIGCHLD)
|
| 62 |
+
10. β³ Snapshot scrape ledger to HF dataset weekly
|
| 63 |
+
|
| 64 |
+
### PRD + Project bootstrap
|
| 65 |
+
11. β³ Claude Projects-style PRD wizard (single description input β auto-extract β 1-3 follow-ups β PRD)
|
| 66 |
+
12. β³ PRD template library (web app / API / CLI / mobile / data pipeline / ML)
|
| 67 |
+
13. β³ Auto-detect existing repo β reverse-engineer surrogate.md
|
| 68 |
+
14. β³ PRD versioning (v1, v2 with diff)
|
| 69 |
+
15. β³ "Spec mode" β refine PRD interactively before any code
|
| 70 |
+
|
| 71 |
+
### Pipeline quality
|
| 72 |
+
16. β³ Self-critique loop (after dev: model A reviews model B output β re-dev if NEEDS-WORK)
|
| 73 |
+
17. β³ Regression test on touched files (re-run existing tests)
|
| 74 |
+
18. β³ Lint + type-check + security scan in pipeline (ruff, mypy, semgrep)
|
| 75 |
+
19. β³ Diff approval UI (show changes before commit, esp. yolo mode)
|
| 76 |
+
20. β³ Search-replace block edits (Aider-style, less risky than full rewrite)
|
| 77 |
+
|
| 78 |
+
### Domain expert routing
|
| 79 |
+
21. β³ Auto-route DEV stage to specialist (frontend/backend/mobile/iac) based on task keywords
|
| 80 |
+
22. β³ Multi-specialist parallel work (e.g., backend API + frontend UI in same task β spawn both)
|
| 81 |
+
23. β³ Specialist-specific eval (frontend agent β check WCAG; backend β check N+1)
|
| 82 |
+
|
| 83 |
+
### Memory + Context
|
| 84 |
+
24. β³ Episodic memory (last 50 sessions retrieval for similar tasks)
|
| 85 |
+
25. β³ Procedural memory (how-to library auto-generated from successful runs)
|
| 86 |
+
26. β³ Project context cache (surrogate.md + repo-map persisted across sessions)
|
| 87 |
+
27. β³ Cross-project pattern share (skill from project A β applicable to project B)
|
| 88 |
+
28. β³ Long-term retention (key decisions β ADR auto-generation)
|
| 89 |
+
|
| 90 |
+
### Self-improvement loop
|
| 91 |
+
29. β³ Reflexion lessons β injected into next-similar-task prompt
|
| 92 |
+
30. β³ Failed orchestrate β root-cause analysis β improvement queue
|
| 93 |
+
31. β³ Weekly LoRA fine-tune trigger (on accumulated training pairs, autotrain)
|
| 94 |
+
32. β³ A/B test prompts (variant A vs B, pick winner by APPROVE rate)
|
| 95 |
+
33. β³ Voyager-style skill crystallization (pattern repeated 3+ times β permanent skill)
|
| 96 |
+
|
| 97 |
+
### Datasets + Training
|
| 98 |
+
34. β³ SRE postmortem corpus (scrape danluu/post-mortems β ~600 incident β instruction-pair)
|
| 99 |
+
35. β³ AWS Well-Architected synthetic Q/A (PDFs β distilabel pipeline β 5k pairs)
|
| 100 |
+
36. β³ Internal axentx code β instruction pairs (commit messages + diffs)
|
| 101 |
+
37. οΏ½οΏ½οΏ½ Training pair quality scoring (filter low-quality before HF upload)
|
| 102 |
+
38. β³ DPO preference pairs from reviewer (chosen/rejected from REWORK cycles)
|
| 103 |
+
39. β³ Synthetic ADR generation (real OSS examples β expand via distilabel)
|
| 104 |
+
|
| 105 |
+
### Tools + Integrations
|
| 106 |
+
40. β³ MCP client support (Claude Desktop schema β connect external tools)
|
| 107 |
+
41. β³ ToolSearch lazy-load (don't blow context on full tool list)
|
| 108 |
+
42. β³ Constitutional Critic from ~/.surrogate/agents/roster.json (auto-load)
|
| 109 |
+
43. β³ Repo-map context (tree-sitter symbol graph β smarter file selection)
|
| 110 |
+
44. β³ Tool-call traces saved as training data (every tool use β pair)
|
| 111 |
+
|
| 112 |
+
### Security + Safety
|
| 113 |
+
45. β³ Secret-scan pre-commit hook (gitleaks integration)
|
| 114 |
+
46. β³ Rate limit per-IP (HF Space /chat endpoint)
|
| 115 |
+
47. β³ Allowlist/denylist for git push (don't push to main without flag)
|
| 116 |
+
48. β³ PII scrubber for training pairs (remove emails, IPs, names before upload)
|
| 117 |
+
49. β³ Sandbox tool execution (no rm -rf, no curl |sh, no destructive ops)
|
| 118 |
+
50. β³ Audit log for every orchestrate run (who/what/when/result)
|
| 119 |
+
|
| 120 |
+
### Multi-modal + I/O
|
| 121 |
+
51. β³ Voice input (Whisper transcribe β surrogate)
|
| 122 |
+
52. β³ Image input (architectural diagrams β analysis)
|
| 123 |
+
53. β³ Screen recording β video β tutorial agent
|
| 124 |
+
54. β³ Discord voice channel (TTS responses)
|
| 125 |
+
|
| 126 |
+
### CLI UX
|
| 127 |
+
55. β³ /resume <session-id> (continue past session)
|
| 128 |
+
56. β³ /diff (show pending changes before commit)
|
| 129 |
+
57. β³ /undo (rollback last orchestrate via git stash)
|
| 130 |
+
58. β³ /share (publish session as gist for review)
|
| 131 |
+
59. β³ Tab autocomplete for slash commands
|
| 132 |
+
60. β³ Cost-meter live in statusline (running $ this session)
|
| 133 |
+
|
| 134 |
+
### Cloud / multi-region
|
| 135 |
+
61. β³ Mirror to Cloudflare Workers AI (free tier backup)
|
| 136 |
+
62. β³ Egress whitelist for Discord on HF Pro tier
|
| 137 |
+
63. β³ HF Space upgrade auto-scale (when load > 80%)
|
| 138 |
+
64. β³ Backup strategy: weekly snapshot of /data β HF dataset
|
| 139 |
+
|
| 140 |
+
### Codebase intelligence
|
| 141 |
+
65. β³ Symbol search (tree-sitter index, not just text grep)
|
| 142 |
+
66. β³ Cross-file refactor (rename across project safely)
|
| 143 |
+
67. β³ Type-aware code completion (LSP integration)
|
| 144 |
+
68. β³ Dead code detection (vulture, ts-prune)
|
| 145 |
+
69. β³ Dependency graph viz (per-project)
|
| 146 |
+
|
| 147 |
+
### Training data flywheel
|
| 148 |
+
70. β³ Trace storage on HF (axentx/surrogate-1-traces dataset)
|
| 149 |
+
71. β³ Auto-tag training pairs by domain (frontend/backend/etc)
|
| 150 |
+
72. β³ Quality gate before training pair upload (β₯ N tokens, well-formed)
|
| 151 |
+
73. β³ Weekly eval on SWE-bench-Lite (track improvement)
|
| 152 |
+
74. β³ DPO data generation (REWORK cycles β preference pairs)
|
| 153 |
+
|
| 154 |
+
### Discord + notifications
|
| 155 |
+
75. β³ Discord webhook for every commit (axentx repo notifications)
|
| 156 |
+
76. β³ Daily digest webhook (commits + pairs + scrape stats)
|
| 157 |
+
77. β³ Failure alerts (orchestrate fail β ping)
|
| 158 |
+
78. β³ Slash commands `/orchestrate "task"` from Discord
|
| 159 |
+
|
| 160 |
+
### HF integrations
|
| 161 |
+
79. β³ TEI server (text-embeddings-inference) for RAG
|
| 162 |
+
80. β³ TGI server (text-generation-inference) for self-hosted LLM
|
| 163 |
+
81. β³ autotrain weekly LoRA on training pairs
|
| 164 |
+
82. β³ HF Inference Providers as primary (paid bypass)
|
| 165 |
+
83. β³ HF Spaces gradio UI (visualize chain status)
|
| 166 |
+
|
| 167 |
+
### Agent quality
|
| 168 |
+
84. β³ Specialist eval per agent (e.g., dev-backend on RealWorld benchmark)
|
| 169 |
+
85. β³ Multi-model consensus on critical decisions (architecture, security)
|
| 170 |
+
86. β³ Constitutional rules (no hard-coded secrets, validate input)
|
| 171 |
+
87. β³ Tool use tracking per agent (which tools each agent calls)
|
| 172 |
+
88. β³ Persona consistency check (review for tone/style mid-thread)
|
| 173 |
+
|
| 174 |
+
### Project management
|
| 175 |
+
89. β³ Burndown chart per surrogate.md plan
|
| 176 |
+
90. β³ Story-point estimation from PRD
|
| 177 |
+
91. β³ Auto-create GitHub issues from `- [ ]` plan items
|
| 178 |
+
92. β³ PR description auto-write from commit list
|
| 179 |
+
93. β³ Sprint retrospective auto-summary
|
| 180 |
+
|
| 181 |
+
### Performance
|
| 182 |
+
94. β³ Profile + optimize orchestrate cycle time (target < 90s p50)
|
| 183 |
+
95. β³ Streaming responses (LLM tokens flow live, don't wait for full)
|
| 184 |
+
96. β³ Local cache for repeated identical prompts
|
| 185 |
+
97. β³ Parallel model calls (race fastest-first, kill rest)
|
| 186 |
+
98. β³ Edge inference (qwen3-coder on Cerebras WaferScale via API)
|
| 187 |
+
|
| 188 |
+
### Compliance + Governance
|
| 189 |
+
99. β³ License audit per file generated (OSS license compatibility)
|
| 190 |
+
100. β³ Commit signing (gpg/sigstore)
|
| 191 |
+
|
| 192 |
+
---
|
| 193 |
+
|
| 194 |
+
## π‘ Nice-to-Have (future)
|
| 195 |
+
|
| 196 |
+
### Multi-agent collaboration
|
| 197 |
+
1. π‘ MoA (Mixture of Agents) β 3 LLMs propose, judge picks best
|
| 198 |
+
2. π‘ Debate mode (2 agents argue, third synthesizes)
|
| 199 |
+
3. π‘ Tournament-style code review (3 reviewers, majority verdict)
|
| 200 |
+
4. π‘ Hierarchical agents (manager β workers β reporter)
|
| 201 |
+
5. π‘ Autonomous research squad (3 agents split topics, merge findings)
|
| 202 |
+
|
| 203 |
+
### UI / UX
|
| 204 |
+
6. π‘ Web dashboard (real-time pipeline status, training pair count, model health)
|
| 205 |
+
7. π‘ VSCode extension (`surrogate /auto` from editor)
|
| 206 |
+
8. π‘ IntelliJ plugin
|
| 207 |
+
9. π‘ Mobile app (iOS/Android) for on-the-go orchestrate
|
| 208 |
+
10. π‘ Apple Watch glance (current task status)
|
| 209 |
+
|
| 210 |
+
### Voice + Audio
|
| 211 |
+
11. π‘ Whisper realtime transcription
|
| 212 |
+
12. π‘ ElevenLabs TTS for status reports
|
| 213 |
+
13. π‘ Daily audio briefing podcast
|
| 214 |
+
14. π‘ Voice clone of user for replies
|
| 215 |
+
|
| 216 |
+
### Visual
|
| 217 |
+
15. π‘ Architecture diagram auto-generation (mermaid β SVG)
|
| 218 |
+
16. π‘ Dependency graph live render
|
| 219 |
+
17. π‘ Heat map of code changes per file
|
| 220 |
+
18. π‘ 3D codebase visualization (gource-style)
|
| 221 |
+
|
| 222 |
+
### Integrations
|
| 223 |
+
19. π‘ Linear / Jira sync (pull tickets, update status)
|
| 224 |
+
20. π‘ Slack bot
|
| 225 |
+
21. π‘ Microsoft Teams bot
|
| 226 |
+
22. π‘ Notion sync (PRD β Notion page)
|
| 227 |
+
23. π‘ Figma plugin (design β code via DEV agent)
|
| 228 |
+
24. π‘ Storybook integration (component dev)
|
| 229 |
+
25. π‘ Sentry integration (errors β fix queue)
|
| 230 |
+
26. π‘ PagerDuty integration (incident β SRE agent)
|
| 231 |
+
27. π‘ GitHub Copilot bridge (delegate to Surrogate for complex)
|
| 232 |
+
28. π‘ Cursor IDE integration
|
| 233 |
+
|
| 234 |
+
### ML / Self-improvement
|
| 235 |
+
29. π‘ RLHF from APPROVE/REWORK signals
|
| 236 |
+
30. π‘ RLAIF (AI feedback on agent outputs)
|
| 237 |
+
31. π‘ Continual pre-training on axentx code corpus
|
| 238 |
+
32. π‘ Distillation (qwen-coder-30B β 7B for edge)
|
| 239 |
+
33. π‘ Quantization-aware fine-tuning
|
| 240 |
+
34. π‘ Speculative decoding for faster inference
|
| 241 |
+
35. π‘ Mixture-of-experts custom training
|
| 242 |
+
|
| 243 |
+
### Datasets
|
| 244 |
+
36. π‘ Real-time scrape of GitHub trending (every 1h)
|
| 245 |
+
37. π‘ Scrape Hacker News top stories daily
|
| 246 |
+
38. π‘ Scrape Reddit r/programming weekly
|
| 247 |
+
39. π‘ Scrape Twitter dev threads (X API tier 1 = $100/m, skip)
|
| 248 |
+
40. π‘ Curated YouTube transcripts (developer talks, RustConf, KubeCon)
|
| 249 |
+
41. π‘ Scrape arxiv-sanity for AI papers
|
| 250 |
+
42. π‘ Crawl AWS/GCP/Azure docs nightly
|
| 251 |
+
43. π‘ PR diff archive (axentx own PRs as training)
|
| 252 |
+
44. π‘ Stack Overflow accepted answers (dump filter)
|
| 253 |
+
45. π‘ GitHub issue resolutions (closed issue β PR linkage)
|
| 254 |
+
|
| 255 |
+
### Cloud / Deployment
|
| 256 |
+
46. π‘ Multi-region HF Spaces (ap-southeast + us-east + eu-west)
|
| 257 |
+
47. π‘ K8s deployment manifests (move beyond HF when scale demands)
|
| 258 |
+
48. π‘ Kubernetes operator for axentx orchestration
|
| 259 |
+
49. π‘ Lambda@Edge for global low-latency inference
|
| 260 |
+
50. π‘ IPFS publish of PRDs (decentralized)
|
| 261 |
+
|
| 262 |
+
### Privacy + Security
|
| 263 |
+
51. π‘ E2E encryption for Discord chat
|
| 264 |
+
52. π‘ Air-gapped mode (Mac-only, no cloud)
|
| 265 |
+
53. π‘ Federated learning (multiple users contribute, no central data)
|
| 266 |
+
54. π‘ Zero-knowledge proofs for code provenance
|
| 267 |
+
55. π‘ Confidential computing (Intel SGX) for sensitive code
|
| 268 |
+
56. π‘ GDPR compliance toolkit (PII scrub, right-to-delete)
|
| 269 |
+
57. π‘ SOC 2 Type II readiness checklist
|
| 270 |
+
58. π‘ ISO 27001 audit prep
|
| 271 |
+
|
| 272 |
+
### Specialty agents
|
| 273 |
+
59. π‘ Compiler engineer (LLVM, optimization passes)
|
| 274 |
+
60. π‘ Embedded systems (microcontroller code, real-time)
|
| 275 |
+
61. π‘ Game dev (Unity, Unreal, Godot)
|
| 276 |
+
62. π‘ Blockchain (Solidity, smart contracts, security)
|
| 277 |
+
63. π‘ Quantum computing (Qiskit, circuits)
|
| 278 |
+
64. π‘ Robotics (ROS, motion planning)
|
| 279 |
+
65. π‘ Bioinformatics (BLAST, sequence analysis)
|
| 280 |
+
66. π‘ Quantitative finance (backtesting, risk)
|
| 281 |
+
67. π‘ Climate modeling
|
| 282 |
+
68. π‘ Legal tech (contract review)
|
| 283 |
+
|
| 284 |
+
### Education
|
| 285 |
+
69. π‘ Teach mode (explain decisions step-by-step for learners)
|
| 286 |
+
70. π‘ Pair programming mode (turn-taking with user)
|
| 287 |
+
71. π‘ Code review school (annotated learning examples)
|
| 288 |
+
72. π‘ Daily challenge generator (LeetCode-style, personalized)
|
| 289 |
+
73. π‘ Concept explainer (DDD, hexagonal, CAP theorem on demand)
|
| 290 |
+
|
| 291 |
+
### Productivity
|
| 292 |
+
74. π‘ Calendar integration (block focus time when in flow)
|
| 293 |
+
75. π‘ Pomodoro mode
|
| 294 |
+
76. π‘ Energy/mood tracker (suggest break when fatigued)
|
| 295 |
+
77. π‘ Distraction blocker (no Twitter when Surrogate active)
|
| 296 |
+
78. π‘ Focus music generator (lo-fi via Suno API)
|
| 297 |
+
|
| 298 |
+
### Emerging tech
|
| 299 |
+
79. π‘ ASI safety guardrails (per Anthropic Constitutional AI)
|
| 300 |
+
80. π‘ World model simulation (test ideas in synth environment)
|
| 301 |
+
81. π‘ Causal reasoning (vs correlation)
|
| 302 |
+
82. π‘ Theorem prover integration (Lean, Coq for verified code)
|
| 303 |
+
83. π‘ Differential privacy in training
|
| 304 |
+
84. π‘ Explainable AI for code reviews
|
| 305 |
+
|
| 306 |
+
### Localization
|
| 307 |
+
85. π‘ Thai-native pipeline (ΰΉΰΈΰΉΰΈΰΉΰΈ₯ΰΈ° comments ΰΉΰΈΰΉΰΈΰΉΰΈΰΈ’)
|
| 308 |
+
86. π‘ Japanese, Korean, Chinese support
|
| 309 |
+
87. π‘ RTL languages (Arabic, Hebrew)
|
| 310 |
+
88. π‘ Local LLM Thai-fluent (typhoon, openthaigpt)
|
| 311 |
+
89. π‘ Cultural code review (idioms per locale)
|
| 312 |
+
|
| 313 |
+
### Marketing + community
|
| 314 |
+
90. π‘ Public Surrogate-1 demo Space (read-only)
|
| 315 |
+
91. π‘ Twitter bot posts daily Surrogate-1 wins
|
| 316 |
+
92. π‘ GitHub discussions for community
|
| 317 |
+
93. π‘ Discord server for users
|
| 318 |
+
94. π‘ Newsletter (weekly improvements)
|
| 319 |
+
95. π‘ Blog (axentx engineering)
|
| 320 |
+
|
| 321 |
+
### Speculative
|
| 322 |
+
96. π‘ Surrogate-2 (full local inference, no cloud dep)
|
| 323 |
+
97. π‘ Custom silicon (qwen-coder optimized FPGA)
|
| 324 |
+
98. π‘ BCI integration (Neuralink-style direct intent)
|
| 325 |
+
99. π‘ Physical robot (Boston Dynamics + Surrogate brain)
|
| 326 |
+
100. π‘ ASI alignment research collaboration
|
| 327 |
+
|
| 328 |
+
---
|
| 329 |
+
|
| 330 |
+
## Current Cadence (auto-running on HF)
|
| 331 |
+
|
| 332 |
+
| Task | Frequency | Status |
|
| 333 |
+
|---|---|---|
|
| 334 |
+
| Continuous scrape | 8 workers, 5-30s cool-down | β
|
|
| 335 |
+
| Agentic crawler | 6 workers, BFS frontier | β
|
|
| 336 |
+
| Skill synthesis | every 3 min | β
|
|
| 337 |
+
| surrogate-dev-loop | every 2 min | β
|
|
| 338 |
+
| work-queue producer | every 5 min | β
|
|
| 339 |
+
| training-pair push to HF | every 3 min | β
|
|
| 340 |
+
| auto-orchestrate-loop | every 20 min | β
|
|
| 341 |
+
| research-apply | every 30 min | β
|
|
| 342 |
+
| keyword tuner | every 60 min | β
|
|
| 343 |
+
| research-loop | every 6h | β
|
|
| 344 |
+
| dataset-enrich | every 12h | β
|
|
| 345 |
+
|
| 346 |
+
## Verified working (2026-04-28)
|
| 347 |
+
- 5 commits to HF dataset in 12 min (~4047 pairs uploaded)
|
| 348 |
+
- Pipeline produces real Python/Go code with DDD patterns
|
| 349 |
+
- Reviewer issues APPROVE / REWORK / REJECT verdicts
|
| 350 |
+
- Training feedback loop closing (every stage β HF)
|