Non GGUF quantizations of Voxtral

by quasoft2 - opened Mar 9

Mar 9

•

There are almost no quantizations of Voxtral models. Is it possible to convert any of your GGUF quants for this model to format supported by transformers?

For example, I really like the Q6_K_L quantization, but have no idea how to make a similar one or convert it for use with transformers.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment