GGUF Mistral-Nemo-2407-Instruct OQ8_0.EF32 IQuants
Collection
Custom GGUF quants of Mistral-Nemo-2407-Instruct, where the Output Tensors are quantized to Q8_0 while the Embeddings are kept at F32. ๐ง ๐ฅ๐ โข 1 item โข Updated
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Custom GGUF quants of Mistral-Nemo-Instruct-2407, where the Output Tensors are quantized to Q8_0 while the Embeddings are kept at F32. ๐ง ๐ฅ๐
4-bit
6-bit
8-bit