Spaces:

cesjavi
/

aubm

Build error

aubm / ROADMAP.md

Set app version to 0.7.0 and update roadmap

ffac2f3 2 days ago

3.89 kB

Aubm Roadmap

This document outlines the strategic evolution of Aubm, moving from a robust orchestration core to an enterprise-ready multi-agent operating layer.

Autonomous Agent Execution: Multi-provider support (OpenAI, Groq, Gemini, etc.).
Project Orchestration: Intelligent task scheduling and dependency management (DAG).
Human-in-the-Loop: Approval and rejection workflows for agent outputs.
Semantic RAG: Contextual memory injection across project tasks.
Real-time Logs: Streaming agent thoughts and actions via SSE.
Cost Control: Token-based budgeting and execution blocking.

Multi-Agent Debates: Allow agents to cross-verify each other's outputs before human review.
Extended Toolbelt:
- Web Browser Tool (via Playwright) for live data fetching.
- Code Sandbox for executing and testing generated snippets.
- File Generation (Excel, Word, and advanced PDF layouts).
Collaborative Editing: Real-time collaborative output refining for humans.
Mobile Experience: Capacitor-based mobile app for project monitoring (initialized).

Fine-tuning Loop: Feedback loop (Like/Dislike) implemented for data collection.
Recursive Project Decomposition: Agents that can spawn sub-tasks and manage them.
Enterprise Security:
- SSO Integration (Google, GitHub via Supabase).
- Advanced RLS for granular team permissions.
- Audit logs for every LLM interaction.
Agent Marketplace: Community-driven agent templates and specialized skill sets.

Self-Healing Infrastructure: Agents that can monitor health and apply safe patches.
Voice Interaction: Control navigation and hear project/task status updates via browser voice APIs.
VR/AR Dashboard: Spatial DAG viewer scaffold for layered project/task visualization.

Operations Monitoring: Backend health summary endpoint and frontend monitoring dashboard with Supabase fallback.
Deployment Hardening: Dockerized backend/runtime profile and production CORS configuration.
Error Tracking: Sentry-compatible error reporting hooks for backend and frontend.
Performance Budgeting: Frontend code splitting and bundle-size targets.

Recursive Project Decomposition: Agents that can automatically break down goals.
Numerical Consistency (Semantic Backprop): Enforce absolute figures across tasks.
Visual Tooling: Integrated support for charts and AI illustrations.
Vercel Deployment: Monorepo serverless configuration.
Heuristic Output Guardrails: Prompt hardening, reviewer checks, and final-report filtering for placeholders, unsupported claims, and low-quality sections.
Asynchronous Task Queue: Dedicated background workers (worker.py).
Vectorized Long-term Memory: Cross-project semantic retrieval.
Self-Optimizing Agents: Meta-prompting loops based on human feedback.

Strict JSON Task Schemas: Enforce structured outputs per task type instead of free-form text.
Mandatory source_url per Claim: Require evidence links for competitor, pricing, release, benchmark, and market claims.
Entity Normalization Layer: Canonicalize entity names, merge aliases, and separate direct competitors from adjacent tools before final reporting.
Semantic Deduplication: Collapse equivalent claims written differently across tasks.
Evidence-Aware Final Report: Build the final report from normalized entities and validated claims only.

Last updated: May 6, 2026