Inference Providers
Active filters: GPTQ
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
• 253B • Updated • 8
• 4
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 185
• 9
QuantTrio/GLM-4.5-GPTQ-Int4-Int8Mix
Text Generation
• 55B • Updated • 85
• 5
QuantTrio/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 111
• 2
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8
Text Generation
• 31B • Updated • 857
• 8
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated • 63
• 4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 72
• 5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
• 34B • Updated • 6
• 3
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
• 485B • Updated • 300
• 5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
• 286B • Updated • 12
• 1
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 151
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 40
• 1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 50
• 2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 21.5k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 7
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 15
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated • 5
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
• 8B • Updated • 3
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 2
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 5
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated • 4
QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix
Text Generation
• 69B • Updated • 13
• 4
QuantTrio/KAT-Dev-GPTQ-Int4
Text Generation
• 33B • Updated • 4
• 1
QuantTrio/KAT-Dev-GPTQ-Int8
Text Generation
• 33B • Updated • 3
• 1
QuantTrio/Kimi-Dev-72B-GPTQ-Int4
Text Generation
• 73B • Updated • 36
• 2
QuantTrio/Kimi-Dev-72B-GPTQ-Int8
Text Generation
• 73B • Updated • 13
• 2
AXERA-TECH/Qwen3-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 53
• 1
AXERA-TECH/Qwen3-VL-4B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 37
AXERA-TECH/Qwen3-VL-8B-Instruct-GPTQ-Int4
Image-Text-to-Text
• Updated • 24
• 1
AXERA-TECH/Qwen3-VL-8B-Instruct
Image-Text-to-Text
• Updated • 5