Can it be updated for the new, faster, llama.cpp implementation

by juanml82 - opened Jan 13

Jan 13

A recent update of llama.cpp makes Qwen 3 Next faster https://github.com/ggml-org/llama.cpp/pull/18683 , but requires updated ggufs. Unsloth already released regular updated ggufs, I wonder if you'd be able to abliterated this model with this update so it works faster

huihui-ai

Owner Jan 13

Due to the large number of models we have on hf.co, the available space has become limited. You can either quantify and upload these models to hf.co yourself or perform local testing.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment