Inference Providers
Active filters: modelopt
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4
Text Generation
• 17B • Updated • 1.01k
• 1
mdavidson83/Qwen3-Embedding-4B_nvfp4_hf
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K
Text Generation
• 17B • Updated • 2.27k
• 1
Image-Text-to-Text
• 13B • Updated • 9
lukealonso/MiniMax-M2-NVFP4
115B • Updated • 12
• 14
Text Generation
• 7B • Updated • 8
• 1
leatan95/Tongyi-DeepResearch-30B-A3B-NVFP4
16B • Updated DataSnake/Wayfarer-12B-NVFP4
Text Generation
• 7B • Updated • 3
• 1
DataSnake/Wayfarer-2-12B-NVFP4
Text Generation
• 7B • Updated • 4
• 2
Ex0bit/OLMo-3-7B-Instruct-NVFP4-1M
Text Generation
• 4B • Updated • 21
• 2
wangqia0309/Captain-Eris_Violet-V0.420-12B-FP8-KV-modelopt
12B • Updated • 30
rahtml/Qwen3-Coder-30B-A3B-Instruct-NVFP4
16B • Updated nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
• Updated • 25.3k
• 30
eousphoros/DeepSeek-V3.2-NVFP4
Text Generation
• 387B • Updated • 25
• 5
zhuyksir/qwen3_30b_a3b_nvfp4_baseline
16B • Updated • 3
zhuyksir/qwen3_30b_a3b_nvfp4_qat
16B • Updated alphatozeta/sglang_glm_4_6_fp4_modelopt
177B • Updated ericlewis/Nemotron-Orchestrator-8B-NVFP4
Text Generation
• 5B • Updated • 4
trithemius/Velvet-14B-nvfp4
8B • Updated • 2
OPENZEKA/Qwen3-4B-Instruct-2507-NVFP4
2B • Updated • 189
Z841973620/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 3
Z841973620/Qwen3-30B-A3B-FP8
Text Generation
• 31B • Updated • 1
OPENZEKA/Qwen3-Coder-30B-A3B-Instruct-NVFP4
Text Generation
• 16B • Updated • 9.57k
josephdowling10/Mixtral-8x7B-Instruct-v0.1-NVFP4
Text Generation
• 23B • Updated • 26
taharmasmaliyev07/Llama-2-7b-hf-fp8
7B • Updated OPENZEKA/Qwen3-Coder-480B-A35B-Instruct-NVFP4
241B • Updated • 3
Shifusen/Llama-3.3-70B-Instruct-abliterated-NVFP4-modelopt
36B • Updated • 4
taharmasmaliyev07/Mistral-7B-v0.1-fp8
7B • Updated taharmasmaliyev07/Llama-3.1-8B-fp8
8B • Updated taharmasmaliyev07/gemma-2-9b-it-fp8
9B • Updated