Miquella 120B GGUF
GGUF quantized weights for miquella-120b. Contains all quants.
I used Importance Matrices generated from Q8_0 quant of the model. The dataset used for that was random junk for optimal quality.
Due to the limitations of HF's file size, the larger files were split into multiple chunks. Instructions below.
Linux
Example uses Q3_K_L. Replace the names appropriately for your quant of choice.
cat miquella-120b.Q3_K_L.gguf_part_* > miquella-120b.Q3_K_L.gguf && rm miquella-120b.Q3_K_L.gguf_part_*
Windows
Example uses Q3_K_L. Replace the names appropriately for your quant of choice.
COPY /B miquella-120b.Q3_K_L.gguf_part_aa + miquella-120b.Q3_K_L.gguf_part_ab miquella-120b.Q3_K_L.gguf
Then delete the two splits.
- Downloads last month
- 17
Hardware compatibility
Log In to add your hardware
2-bit
3-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support