Quantized Models (GGUF, IQ, Imatrix)
Collection
Various GGUF quantizations of small models. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 97 items • Updated • 71
"Lllama 3.1 experiments."
Q4_0 ARM/Mobile quants here: L3.1-8B-Niitama-v1.1-GGUF-ARM-Imatrix-Supplementary.
My GGUF-IQ-Imatrix quants for Sao10K/L3.1-8B-Niitama-v1.1.
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Base model
Sao10K/L3.1-8B-Niitama-v1.1