Inference Providers
Active filters: W4A16
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2
Text Generation
• 33B • Updated • 11
• 16
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3
Text Generation
• 33B • Updated • 7
• 14
ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
• 10B • Updated • 5
• 3
ModelCloud/Qwen2.5-0.5B-Instruct-gptqmodel-w4a16
Text Generation
• 0.5B • Updated • 31
• 1
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1
Text Generation
• 8B • Updated • 4
• 6
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2
Text Generation
• 8B • Updated • 187
• 8
RedHatAI/phi-4-quantized.w4a16
Text Generation
• 3B • Updated • 174
• 5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16
Image-Text-to-Text
• 5B • Updated • 3.06k
• 10
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
Image-Text-to-Text
• 20B • Updated • 25.8k
• 12
pyrymikko/nomic-embed-code-W4A16-AWQ
1B • Updated • 228
tcclaviger/Minimax-M2-Thrift-GPTQ-W4A16-AMD
Text Generation
• 24B • Updated • 15
• 1
TevunahAi/granite-34b-code-instruct-8k-Ultra-Hybrid
Text Generation
• 11B • Updated • 6
TevunahAi/Llama-3.1-70B-Instruct-Ultra-Hybrid
Text Generation
• 22B • Updated • 4
Vishva007/Qwen3-4B-Instruct-2507-W4A16-AutoRound
Text Generation
• 0.9B • Updated • 40
Vishva007/Qwen3-VL-8B-Instruct-W4A16-AutoRound
Image-Text-to-Text
• 2B • Updated • 5
Vishva007/Qwen3-VL-2B-Instruct-W4A16-AutoRound
Image-Text-to-Text
• 0.9B • Updated • 7
Vishva007/Qwen3-VL-2B-Instruct-W4A16-AutoRound-GPTQ
Image-Text-to-Text
• 2B • Updated • 7
Vishva007/Qwen3-VL-2B-Instruct-W4A16-AutoRound-AWQ
Image-Text-to-Text
• 2B • Updated • 13
Vishva007/Qwen3-VL-4B-Instruct-W4A16-AutoRound
Image-Text-to-Text
• 1B • Updated • 3
Vishva007/Qwen3-VL-4B-Instruct-W4A16-AutoRound-GPTQ
Image-Text-to-Text
• 4B • Updated • 69
Vishva007/Qwen3-VL-4B-Instruct-W4A16-AutoRound-AWQ
Image-Text-to-Text
• 4B • Updated • 53
• 1
embedl/Cosmos-Reason2-2B-W4A16
Image-Text-to-Text
• 2B • Updated • 572
• 7
bg-digitalservices/Gemma-4-26B-A4B-it-NVFP4A16
Text Generation
• 15B • Updated • 5.62k
• 4
bg-digitalservices/Apertus-8B-2509-NVFP4A16
Text Generation
• 5B • Updated • 286
bg-digitalservices/Apertus-8B-Instruct-2509-NVFP4A16
Text Generation
• 5B • Updated • 278
bg-digitalservices/Apertus-70B-2509-NVFP4A16
Text Generation
• 36B • Updated • 308
bg-digitalservices/Apertus-70B-Instruct-2509-NVFP4A16
Text Generation
• 36B • Updated • 310
bg-digitalservices/Gemma-4-E2B-NVFP4A16
Text Generation
• 4B • Updated • 9.73k
bg-digitalservices/Gemma-4-E2B-it-NVFP4A16
Text Generation
• 4B • Updated • 30.5k
bg-digitalservices/Gemma-4-E4B-it-NVFP4A16
Text Generation
• 6B • Updated • 365