Spaces:
Running
Running
| title: NEXUS OS v4.0 | |
| emoji: 🔥 | |
| colorFrom: red | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 6.14.0 | |
| app_file: app.py | |
| pinned: false | |
| tags: | |
| - ml-intern | |
| # NEXUS OS v4.0 — Intelligent Multi-Provider Router | |
| **COMPLETELY self-contained** — zero external dependencies except gradio + stdlib. | |
| No torch, no pinecone, no package imports that crash on startup. | |
| ## How It Works | |
| ### Intelligent Routing (Auto-Detected) | |
| The app queries ALL configured providers in parallel, measures health + latency, | |
| and picks the best one automatically. Falls back through the chain if any fail. | |
| | Priority | Provider | Free Tier | Strength | | |
| |----------|----------|-----------|----------| | |
| | **1** | **HF Inference Providers** | $0.10/mo credits | Auto-routing, single HF token | | |
| | **2** | **Groq** | Generous | Fastest inference (LPU chips) | | |
| | **3** | **DeepSeek** | 5M tokens | Best reasoning models | | |
| | **4** | **OpenRouter** | 25+ free models | Most model variety | | |
| | **5** | **Together AI** | Rate-limited 70B | Large models, slow | | |
| | **6** | **Ollama Relay** | Your local models | Via ngrok tunnel | | |
| | **7** | **Mock** | Always works | Simulated for testing | | |
| ### Setup | |
| **No setup needed for mock mode.** To get real inference, add API keys as Space secrets: | |
| | Secret | Provider | Get Key At | | |
| |--------|----------|------------| | |
| | `HF_TOKEN` | HF Inference Providers | Already active in Spaces | | |
| | `GROQ_API_KEY` | Groq | https://console.groq.com | | |
| | `DEEPSEEK_API_KEY` | DeepSeek | https://platform.deepseek.com | | |
| | `OPENROUTER_API_KEY` | OpenRouter | https://openrouter.ai | | |
| | `TOGETHER_API_KEY` | Together AI | https://api.together.xyz | | |
| | `OLLAMA_RELAY_URL` | Your local Ollama | `ngrok http 11434` | | |
| ## Features | |
| - **37+ real models** in registry | |
| - **Thermodynamic telemetry**: EEP, PTI, NEWI hallucination signals | |
| - **VRAM-aware filtering**: only shows models that fit your budget | |
| - **Per-token risk scoring**: hallucination detection simulation | |
| ## What's New in v4.0 | |
| - **Self-contained**: no `nexus_os_v2/` imports, no torch/pinecone dependencies | |
| - **5 real providers**: HF Router, Groq, DeepSeek, OpenRouter, Together AI | |
| - **Removed**: Kilocode (IDE plugin), OpenCode (IDE plugin), NVIDIA NIM (trial only), Fireworks ($1 credit) | |
| - **Intelligent routing**: parallel health checks, capability-based model selection | |
| ## Repository | |
| [specimba/nexus-os-v2](https://huggingface.co/datasets/specimba/nexus-os-v2) | |