Models

7,514

Full-text search

Active filters: gptq

rinna/llama-3-youko-8b-instruct-gptq

Text Generation • 8B • Updated Mar 23, 2025 • 6 • 1

rinna/llama-3-youko-70b-gptq

Text Generation • 71B • Updated Mar 23, 2025 • 4

rinna/llama-3-youko-70b-instruct-gptq

Text Generation • 71B • Updated Mar 23, 2025 • 7

Xu-Ouyang/pythia-1.4b-deduped-int3-step14000-GPTQ-wikitext2

Text Generation • 1B • Updated Jul 21, 2024 • 1

Xu-Ouyang/pythia-1.4b-deduped-int3-step29000-GPTQ-wikitext2

Text Generation • 1B • Updated Jul 21, 2024 • 1

Xu-Ouyang/pythia-1.4b-deduped-int3-step43000-GPTQ-wikitext2

Text Generation • 1B • Updated Jul 22, 2024 • 1

Xu-Ouyang/pythia-1.4b-deduped-int3-step57000-GPTQ-wikitext2

Text Generation • 1B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-GPTQ

Text Generation • 13B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-BitBLAS

Text Generation • 51B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-BitBLAS

Text Generation • 51B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-GPTQ

Text Generation • 13B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-BitBLAS

Text Generation • 51B • Updated Jul 22, 2024 • 5

Xu-Ouyang/pythia-2.8b-deduped-int4-step129000-GPTQ-wikitext2

Text Generation • 3B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-GPTQ

Text Generation • 13B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-BitBLAS

Text Generation • 274B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-GPTQ

Text Generation • 69B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-70b-EfficientQAT-w2g64-GPTQ

Text Generation • 69B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-BitBLAS

Text Generation • 275B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-GPTQ

Text Generation • 69B • Updated Jul 22, 2024 • 2

Xu-Ouyang/pythia-2.8b-deduped-int3-step14000-GPTQ-wikitext2

Text Generation • 3B • Updated Jul 22, 2024 • 2

Xu-Ouyang/pythia-12b-deduped-int3-step14000-GPTQ-wikitext2

Text Generation • 11B • Updated Jul 22, 2024 • 5

ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-GPTQ

Text Generation • 7B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-GPTQ

Text Generation • 7B • Updated Jul 22, 2024 • 10 • 1

Xu-Ouyang/pythia-2.8b-deduped-int3-step29000-GPTQ-wikitext2

Text Generation • 3B • Updated Jul 22, 2024 • 7

ModelCloud/gemma-2-27b-it-gptq-4bit

Text Generation • 28B • Updated Jul 23, 2024 • 124 • 12

ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-GPTQ

Text Generation • 7B • Updated Jul 22, 2024 • 7

ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-70b-EfficientQAT-w2g64-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 20

Xu-Ouyang/pythia-2.8b-deduped-int3-step43000-GPTQ-wikitext2

Text Generation • 3B • Updated Jul 22, 2024 • 1