Any fix for this, is the quant broken or something?
Nevermind, it was an issue with the llama-server build I was using.
Β· Sign up or log in to comment