Spaces:

specimba
/

nexus-os-space

Running

App Files Files Community

nexus-os-space / README.md

specimba

Update ML Intern artifact metadata

5b65a76 verified 5 days ago

preview code

raw

history blame contribute delete

2.41 kB

	---
	title: NEXUS OS v4.0
	emoji: 🔥
	colorFrom: red
	colorTo: purple
	sdk: gradio
	sdk_version: 6.14.0
	app_file: app.py
	pinned: false
	tags:
	- ml-intern
	---

	# NEXUS OS v4.0 — Intelligent Multi-Provider Router

	COMPLETELY self-contained — zero external dependencies except gradio + stdlib.
	No torch, no pinecone, no package imports that crash on startup.

	## How It Works

	### Intelligent Routing (Auto-Detected)
	The app queries ALL configured providers in parallel, measures health + latency,
	and picks the best one automatically. Falls back through the chain if any fail.

	\| Priority \| Provider \| Free Tier \| Strength \|
	\|----------\|----------\|-----------\|----------\|
	\| 1 \| HF Inference Providers \| $0.10/mo credits \| Auto-routing, single HF token \|
	\| 2 \| Groq \| Generous \| Fastest inference (LPU chips) \|
	\| 3 \| DeepSeek \| 5M tokens \| Best reasoning models \|
	\| 4 \| OpenRouter \| 25+ free models \| Most model variety \|
	\| 5 \| Together AI \| Rate-limited 70B \| Large models, slow \|
	\| 6 \| Ollama Relay \| Your local models \| Via ngrok tunnel \|
	\| 7 \| Mock \| Always works \| Simulated for testing \|

	### Setup

	No setup needed for mock mode. To get real inference, add API keys as Space secrets:

	\| Secret \| Provider \| Get Key At \|
	\|--------\|----------\|------------\|
	\| `HF_TOKEN` \| HF Inference Providers \| Already active in Spaces \|
	\| `GROQ_API_KEY` \| Groq \| https://console.groq.com \|
	\| `DEEPSEEK_API_KEY` \| DeepSeek \| https://platform.deepseek.com \|
	\| `OPENROUTER_API_KEY` \| OpenRouter \| https://openrouter.ai \|
	\| `TOGETHER_API_KEY` \| Together AI \| https://api.together.xyz \|
	\| `OLLAMA_RELAY_URL` \| Your local Ollama \| `ngrok http 11434` \|

	## Features
	- 37+ real models in registry
	- Thermodynamic telemetry: EEP, PTI, NEWI hallucination signals
	- VRAM-aware filtering: only shows models that fit your budget
	- Per-token risk scoring: hallucination detection simulation

	## What's New in v4.0
	- Self-contained: no `nexus_os_v2/` imports, no torch/pinecone dependencies
	- 5 real providers: HF Router, Groq, DeepSeek, OpenRouter, Together AI
	- Removed: Kilocode (IDE plugin), OpenCode (IDE plugin), NVIDIA NIM (trial only), Fireworks ($1 credit)
	- Intelligent routing: parallel health checks, capability-based model selection

	## Repository
	[specimba/nexus-os-v2](https://huggingface.co/datasets/specimba/nexus-os-v2)

	---
	title: NEXUS OS v4.0
	emoji: 🔥
	colorFrom: red
	colorTo: purple
	sdk: gradio
	sdk_version: 6.14.0
	app_file: app.py
	pinned: false
	tags:
	- ml-intern
	---

	# NEXUS OS v4.0 — Intelligent Multi-Provider Router

	COMPLETELY self-contained — zero external dependencies except gradio + stdlib.
	No torch, no pinecone, no package imports that crash on startup.

	## How It Works

	### Intelligent Routing (Auto-Detected)
	The app queries ALL configured providers in parallel, measures health + latency,
	and picks the best one automatically. Falls back through the chain if any fail.

	\| Priority \| Provider \| Free Tier \| Strength \|
	\|----------\|----------\|-----------\|----------\|
	\| 1 \| HF Inference Providers \| $0.10/mo credits \| Auto-routing, single HF token \|
	\| 2 \| Groq \| Generous \| Fastest inference (LPU chips) \|
	\| 3 \| DeepSeek \| 5M tokens \| Best reasoning models \|
	\| 4 \| OpenRouter \| 25+ free models \| Most model variety \|
	\| 5 \| Together AI \| Rate-limited 70B \| Large models, slow \|
	\| 6 \| Ollama Relay \| Your local models \| Via ngrok tunnel \|
	\| 7 \| Mock \| Always works \| Simulated for testing \|

	### Setup

	No setup needed for mock mode. To get real inference, add API keys as Space secrets:

	\| Secret \| Provider \| Get Key At \|
	\|--------\|----------\|------------\|
	\| `HF_TOKEN` \| HF Inference Providers \| Already active in Spaces \|
	\| `GROQ_API_KEY` \| Groq \| https://console.groq.com \|
	\| `DEEPSEEK_API_KEY` \| DeepSeek \| https://platform.deepseek.com \|
	\| `OPENROUTER_API_KEY` \| OpenRouter \| https://openrouter.ai \|
	\| `TOGETHER_API_KEY` \| Together AI \| https://api.together.xyz \|
	\| `OLLAMA_RELAY_URL` \| Your local Ollama \| `ngrok http 11434` \|

	## Features
	- 37+ real models in registry
	- Thermodynamic telemetry: EEP, PTI, NEWI hallucination signals
	- VRAM-aware filtering: only shows models that fit your budget
	- Per-token risk scoring: hallucination detection simulation

	## What's New in v4.0
	- Self-contained: no `nexus_os_v2/` imports, no torch/pinecone dependencies
	- 5 real providers: HF Router, Groq, DeepSeek, OpenRouter, Together AI
	- Removed: Kilocode (IDE plugin), OpenCode (IDE plugin), NVIDIA NIM (trial only), Fireworks ($1 credit)
	- Intelligent routing: parallel health checks, capability-based model selection

	## Repository
	[specimba/nexus-os-v2](https://huggingface.co/datasets/specimba/nexus-os-v2)