Quantized Models (GGUF, IQ, Imatrix)
Collection
Various GGUF quantizations of small models. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 97 items • Updated • 71
Updated!
Please grab "v2" quants remade with the new tokenizer settings to fix the endless generation issues.
SillyTavern
The complete AIO recommended preset:
v2-SillyTavern-Presets-AIO-2024-12-28.json
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit