Question on quantization

by Bercraft - opened 21 days ago

Hi what tool you used to quantize the model to gguf, since whisper.cpp only supports .bin fles. I am trying to quantize it but i cannot find a way. Could you tell me what tools you used? Thank you

FerrisMind

Oxide Lab org 21 days ago

•

edited 20 days ago

Hi!

Whisper.cpp supports both the bin format and the old-style GGUF/GGML, which has been abandoned in Llama.cpp. In order to use it, you simply need to rename your resulting bin file to GGUF. And that's it

FerrisMind changed discussion status to closed 21 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment