Inference Providers
Active filters: gptq
rinna/llama-3-youko-8b-instruct-gptq
Text Generation
• 8B • Updated • 6
• 1
rinna/llama-3-youko-70b-gptq
Text Generation
• 71B • Updated • 4
rinna/llama-3-youko-70b-instruct-gptq
Text Generation
• 71B • Updated • 7
Xu-Ouyang/pythia-1.4b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
• 1B • Updated • 1
Xu-Ouyang/pythia-1.4b-deduped-int3-step29000-GPTQ-wikitext2
Text Generation
• 1B • Updated • 1
Xu-Ouyang/pythia-1.4b-deduped-int3-step43000-GPTQ-wikitext2
Text Generation
• 1B • Updated • 1
Xu-Ouyang/pythia-1.4b-deduped-int3-step57000-GPTQ-wikitext2
Text Generation
• 1B • Updated • 1
ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-GPTQ
Text Generation
• 13B • Updated • 2
ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-BitBLAS
Text Generation
• 51B • Updated • 2
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-BitBLAS
Text Generation
• 51B • Updated • 2
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-GPTQ
Text Generation
• 13B • Updated • 2
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-BitBLAS
Text Generation
• 51B • Updated • 5
Xu-Ouyang/pythia-2.8b-deduped-int4-step129000-GPTQ-wikitext2
Text Generation
• 3B • Updated • 1
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-GPTQ
Text Generation
• 13B • Updated • 1
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-BitBLAS
Text Generation
• 274B • Updated • 3
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-GPTQ
Text Generation
• 69B • Updated • 1
ChenMnZ/Llama-2-70b-EfficientQAT-w2g64-GPTQ
Text Generation
• 69B • Updated • 1
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-BitBLAS
Text Generation
• 275B • Updated • 1
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-GPTQ
Text Generation
• 69B • Updated • 2
Xu-Ouyang/pythia-2.8b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
• 3B • Updated • 2
Xu-Ouyang/pythia-12b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
• 11B • Updated • 5
ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-GPTQ
Text Generation
• 7B • Updated • 2
ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-GPTQ
Text Generation
• 7B • Updated • 10
• 1
Xu-Ouyang/pythia-2.8b-deduped-int3-step29000-GPTQ-wikitext2
Text Generation
• 3B • Updated • 7
ModelCloud/gemma-2-27b-it-gptq-4bit
Text Generation
• 28B • Updated • 124
• 12
ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-GPTQ
Text Generation
• 7B • Updated • 7
ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-GPTQ
Text Generation
• 71B • Updated • 2
ChenMnZ/Llama-3-70b-EfficientQAT-w2g64-GPTQ
Text Generation
• 71B • Updated • 2
ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-GPTQ
Text Generation
• 71B • Updated • 20
Xu-Ouyang/pythia-2.8b-deduped-int3-step43000-GPTQ-wikitext2
Text Generation
• 3B • Updated • 1