Spaces:

cesjavi
/

aubm

Build error

File size: 3,889 Bytes

# Aubm Roadmap

This document outlines the strategic evolution of Aubm, moving from a robust orchestration core to an enterprise-ready multi-agent operating layer.

## Phase 1: Core Foundation (Completed)
- [x] Autonomous Agent Execution: Multi-provider support (OpenAI, Groq, Gemini, etc.).
- [x] Project Orchestration: Intelligent task scheduling and dependency management (DAG).
- [x] Human-in-the-Loop: Approval and rejection workflows for agent outputs.
- [x] Semantic RAG: Contextual memory injection across project tasks.
- [x] Real-time Logs: Streaming agent thoughts and actions via SSE.
- [x] Cost Control: Token-based budgeting and execution blocking.

## Phase 2: Advanced Collaboration and Tools (Completed)
- [x] Multi-Agent Debates: Allow agents to cross-verify each other's outputs before human review.
- [x] Extended Toolbelt:
  - [x] Web Browser Tool (via Playwright) for live data fetching.
  - [x] Code Sandbox for executing and testing generated snippets.
  - [x] File Generation (Excel, Word, and advanced PDF layouts).
- [x] Collaborative Editing: Real-time collaborative output refining for humans.
- [x] Mobile Experience: Capacitor-based mobile app for project monitoring (initialized).

## Phase 3: Intelligence and Scale (Completed)
- [x] Fine-tuning Loop: Feedback loop (Like/Dislike) implemented for data collection.
- [x] Recursive Project Decomposition: Agents that can spawn sub-tasks and manage them.
- [x] Enterprise Security:
  - [x] SSO Integration (Google, GitHub via Supabase).
  - [x] Advanced RLS for granular team permissions.
  - [x] Audit logs for every LLM interaction.
- [x] Agent Marketplace: Community-driven agent templates and specialized skill sets.

## Phase 4: Autonomy and Beyond (Completed)
- [x] Self-Healing Infrastructure: Agents that can monitor health and apply safe patches.
- [x] Voice Interaction: Control navigation and hear project/task status updates via browser voice APIs.
- [x] VR/AR Dashboard: Spatial DAG viewer scaffold for layered project/task visualization.

## Phase 5: Production Operations (Completed)
- [x] Operations Monitoring: Backend health summary endpoint and frontend monitoring dashboard with Supabase fallback.
- [x] Deployment Hardening: Dockerized backend/runtime profile and production CORS configuration.
- [x] Error Tracking: Sentry-compatible error reporting hooks for backend and frontend.
- [x] Performance Budgeting: Frontend code splitting and bundle-size targets.

## Phase 6: Distributed Scale and Intelligence (In Progress)
- [x] Recursive Project Decomposition: Agents that can automatically break down goals.
- [x] Numerical Consistency (Semantic Backprop): Enforce absolute figures across tasks.
- [x] Visual Tooling: Integrated support for charts and AI illustrations.
- [x] Vercel Deployment: Monorepo serverless configuration.
- [x] Heuristic Output Guardrails: Prompt hardening, reviewer checks, and final-report filtering for placeholders, unsupported claims, and low-quality sections.
- [ ] Asynchronous Task Queue: Dedicated background workers (`worker.py`).
- [ ] Vectorized Long-term Memory: Cross-project semantic retrieval.
- [ ] Self-Optimizing Agents: Meta-prompting loops based on human feedback.

## Phase 7: Structured Evidence and Entity Integrity (Next)
- [ ] Strict JSON Task Schemas: Enforce structured outputs per task type instead of free-form text.
- [ ] Mandatory `source_url` per Claim: Require evidence links for competitor, pricing, release, benchmark, and market claims.
- [ ] Entity Normalization Layer: Canonicalize entity names, merge aliases, and separate direct competitors from adjacent tools before final reporting.
- [ ] Semantic Deduplication: Collapse equivalent claims written differently across tasks.
- [ ] Evidence-Aware Final Report: Build the final report from normalized entities and validated claims only.

---

*Last updated: May 6, 2026*