Request: oQ4 (MLX) quantization for Apple Silicon?

by 0xtimi2233 - opened about 15 hours ago

Hi,

Great model! Is anyone planning to convert this model to the oQ4 (MLX) format using omlx?
(Ref: https://github.com/jundot/omlx/blob/main/docs/oQ_Quantization.md)

This format would be perfect for running it locally on Mac.

Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment