Quantized Models (GGUF, IQ, Imatrix)
Collection
Various GGUF quantizations of small models. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 97 items • Updated • 71
This repository hosts GGUF-Imatrix quantizations for ChaoticNeutrals/Eris_Floramix_DPO_7B.
Base⇢ GGUF(F16)⇢ Imatrix-Data(F16)⇢ GGUF(Imatrix-Quants)
quantization_options = [
"Q3_K_M", "Q4_K_M", "Q5_K_M", "Q6_K",
"Q8_0", "IQ4_XS", "IQ3_XXS"
]
This is experimental.
For imatrix data generation, kalomaze's groups_merged.txt with added roleplay chats was used, you can find it here.
The goal is to measure the (hopefully positive) impact of this data for consistent formatting in roleplay chatting scenarios.
Image:
Original model information:
This is a mix between Eris Remix DPO and Flora DPO, a finetune of the original Eris Remix on the Synthetic_Soul_1k dataset.
Applied this DPO dataset: https://huggingface.co/datasets/athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW-v1-SHUFFLED
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit