Inference Providers
Active filters: GPTQ
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 10
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 157
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 12
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 27.9k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 213
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 46
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 126
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 4.46k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 148
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 759
• 4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 750
• 1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
• 4B • Updated • 127
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
• 8B • Updated • 437
iqbalamo93/Phi-4-mini-instruct-GPTQ-4bit
Text Generation
• 4B • Updated • 383
iqbalamo93/Phi-4-mini-instruct-GPTQ-8bit
Text Generation
• 4B • Updated • 21
• 2
GusPuffy/Legion-V2.1-LLaMa-70B-GPTQ
Text Generation
• 11B • Updated • 2
QuantTrio/DeepSeek-R1-0528-Qwen3-8B-GPTQ-Int4-Int8Mix
Text Generation
• 11B • Updated • 8
• 4
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
• 104B • Updated • 767
• 13
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Lite
Text Generation
• 721B • Updated • 88
• 2
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact
Text Generation
• 847B • Updated • 11
• 5
AXERA-TECH/Qwen2.5-0.5B-Instruct-CTX-Int8
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Medium
Text Generation
• 912B • Updated • 20
• 1
kxdw2580/DeepSeek-R1-0528-Qwen3-8B-catgirl-v2.5-gptqv2-8bit
Text Generation
• 8B • Updated • 10
kxdw2580/DeepSeek-R1-0528-Qwen3-8B-catgirl-v2.5-gptqv2-4bit
Text Generation
• 8B • Updated • 10
dengcao/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Image-Text-to-Text
• 15B • Updated • 15
• 2
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
• 1T • Updated • 581
• 12
GusPuffy/BlackSheep-24B-GPTQ
Text Generation
• 4B • Updated • 9
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
• 248B • Updated • 75
• 4
QuantTrio/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Text Generation
• 15B • Updated • 13
• 1
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
• 534B • Updated • 178
• 7