nexus-os-space / README.md
specimba's picture
Update ML Intern artifact metadata
5b65a76 verified
---
title: NEXUS OS v4.0
emoji: 🔥
colorFrom: red
colorTo: purple
sdk: gradio
sdk_version: 6.14.0
app_file: app.py
pinned: false
tags:
- ml-intern
---
# NEXUS OS v4.0 — Intelligent Multi-Provider Router
**COMPLETELY self-contained** — zero external dependencies except gradio + stdlib.
No torch, no pinecone, no package imports that crash on startup.
## How It Works
### Intelligent Routing (Auto-Detected)
The app queries ALL configured providers in parallel, measures health + latency,
and picks the best one automatically. Falls back through the chain if any fail.
| Priority | Provider | Free Tier | Strength |
|----------|----------|-----------|----------|
| **1** | **HF Inference Providers** | $0.10/mo credits | Auto-routing, single HF token |
| **2** | **Groq** | Generous | Fastest inference (LPU chips) |
| **3** | **DeepSeek** | 5M tokens | Best reasoning models |
| **4** | **OpenRouter** | 25+ free models | Most model variety |
| **5** | **Together AI** | Rate-limited 70B | Large models, slow |
| **6** | **Ollama Relay** | Your local models | Via ngrok tunnel |
| **7** | **Mock** | Always works | Simulated for testing |
### Setup
**No setup needed for mock mode.** To get real inference, add API keys as Space secrets:
| Secret | Provider | Get Key At |
|--------|----------|------------|
| `HF_TOKEN` | HF Inference Providers | Already active in Spaces |
| `GROQ_API_KEY` | Groq | https://console.groq.com |
| `DEEPSEEK_API_KEY` | DeepSeek | https://platform.deepseek.com |
| `OPENROUTER_API_KEY` | OpenRouter | https://openrouter.ai |
| `TOGETHER_API_KEY` | Together AI | https://api.together.xyz |
| `OLLAMA_RELAY_URL` | Your local Ollama | `ngrok http 11434` |
## Features
- **37+ real models** in registry
- **Thermodynamic telemetry**: EEP, PTI, NEWI hallucination signals
- **VRAM-aware filtering**: only shows models that fit your budget
- **Per-token risk scoring**: hallucination detection simulation
## What's New in v4.0
- **Self-contained**: no `nexus_os_v2/` imports, no torch/pinecone dependencies
- **5 real providers**: HF Router, Groq, DeepSeek, OpenRouter, Together AI
- **Removed**: Kilocode (IDE plugin), OpenCode (IDE plugin), NVIDIA NIM (trial only), Fireworks ($1 credit)
- **Intelligent routing**: parallel health checks, capability-based model selection
## Repository
[specimba/nexus-os-v2](https://huggingface.co/datasets/specimba/nexus-os-v2)