Inference Providers
Active filters: FP4
NVFP4/Qwen3-235B-A22B-Instruct-2507-FP4
Text Generation
• 118B • Updated • 219
• 4
NVFP4/Qwen3-Coder-480B-A35B-Instruct-FP4
Text Generation
• 241B • Updated • 401
• 2
NVFP4/Qwen3-235B-A22B-Thinking-2507-FP4
Text Generation
• 118B • Updated • 5
• 2
BitPhinix/DeepSeek-V3-0324-FP4
Text Generation
• 397B • Updated • 3
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
• 16B • Updated • 570
• 12
NVFP4/Qwen3-30B-A3B-Thinking-2507-FP4
Text Generation
• 16B • Updated • 99
• 4
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
• 16B • Updated • 9.77k
• 23
Text Generation
• 0.4B • Updated • 311
• 1
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD
Image-Text-to-Text
• 6B • Updated • 158
• 14
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-NVFP4-QAD
Image-Text-to-Text
• Updated • 14.2k
• 25
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
• Updated • 25.4k
• 30
nvidia/Qwen3-235B-A22B-Thinking-2507-FP4-Eagle3
Text Generation
• Updated • 48
Text Generation
• 5B • Updated • 484
• 1
surogate/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 3
Text Generation
• 17B • Updated • 3
Text Generation
• 8B • Updated • 5
Cirrascale/Kimi-K2.5-NVFP4
Text Generation
• Updated • 112
Cirrascale/Qwen3-Coder-480B-A35B-Instruct-NVFP4
Text Generation
• 241B • Updated • 5
Cirrascale/Qwen3.5-397B-A17B-NVFP4
Text Generation
• Updated • 7
txn545/Qwen3.5-35B-A3B-NVFP4
Text Generation
• Updated • 12.7k
• 5
txn545/Qwen3.5-122B-A10B-NVFP4
Text Generation
• 64B • Updated • 96.6k
• 24
Text Generation
• 17B • Updated • 1.52k
• 1
mmangkad/Qwen3.5-27B-NVFP4
Text Generation
• 20B • Updated • 428
fsgfn/Qwen3.5-122B-A10B-NVFP4
Text Generation
• 64B • Updated • 23
Text Generation
• Updated • 107
Text Generation
• Updated • 1.97k
mmangkad/Qwen3.5-35B-A3B-NVFP4-V2
Text Generation
• Updated • 368
mmangkad/Qwen3.5-122B-A10B-NVFP4-V2
Text Generation
• 65B • Updated • 256
vivien118899/Llama-3.1-8B-Instruct-NVFP4
5B • Updated