Quantized Models (GGUF, IQ, Imatrix)
Collection
Various GGUF quantizations of small models. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 97 items • Updated • 71
My GGUF-IQ-Imatrix quants for Sao10K/MN-BackyardAI-Party-12B-v1.
"For best results, set both <|im_end|> and [INST] as stopping strings. Recommended Temperature is <1 , min_p of at least 0.1."
"This does require a lot of tinkering to fit within SillyTavern / other frontends."
Prompting:
- Similar to Mistral for group chats (please read the original model page for information on this)
- ChatML for one-on-one chats
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Base model
Sao10K/MN-BackyardAI-Party-12B-v1