Inference Providers
Active filters: vLLM
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 27.9k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 205
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 46
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 128
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 4.85k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 147
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 745
• 4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 740
• 1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
• 4B • Updated • 125
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
• 8B • Updated • 438
QuantTrio/Qwen3-235B-A22B-GPTQ-Int8
Text Generation
• 235B • Updated • 5
BeastyZ/Qwen2.5-3B-ConvSearch-R1-TopiOCQA
3B • Updated • 7
QuantTrio/DeepSeek-R1-0528-Qwen3-8B-GPTQ-Int4-Int8Mix
Text Generation
• 11B • Updated • 9
• 4
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Lite
Text Generation
• 721B • Updated • 89
• 2
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact
Text Generation
• 847B • Updated • 11
• 5
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Medium
Text Generation
• 912B • Updated • 23
• 1
brandonbeiler/InternVL3-38B-FP8-Dynamic
Image-Text-to-Text
• 38B • Updated • 9
brandonbeiler/InternVL3-78B-FP8-Dynamic
Image-Text-to-Text
• 78B • Updated • 164
brandonbeiler/InternVL3-8B-FP8-Dynamic
Image-Text-to-Text
• 8B • Updated • 24
• 2
dengcao/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Image-Text-to-Text
• 15B • Updated • 16
• 2
dengcao/GLM-4.1V-9B-Thinking-AWQ
Image-Text-to-Text
• 10B • Updated • 216k
• 1
brandonbeiler/Skywork-R1V3-38B-FP8-Dynamic
Image-Text-to-Text
• 38B • Updated • 16
• 2
koushd/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
• 235B • Updated • 1.59k
• 4
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
• 248B • Updated • 45
• 4
QuantTrio/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
• 235B • Updated • 3.77k
• 10
QuantTrio/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Text Generation
• 15B • Updated • 12
• 1
QuantTrio/GLM-4.1V-9B-Thinking-AWQ
Text Generation
• 10B • Updated • 470
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-AWQ
Text Generation
• 480B • Updated • 1.65k
• 8
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
• 534B • Updated • 177
• 7
QuantTrio/Qwen3-235B-A22B-Thinking-2507-AWQ
Text Generation
• 235B • Updated • 293
• 6