Q3_K_XL outputs EOS token immediately, all messages result in 1 token immediate end of response

#3
by Goldkoron - opened

Any fix for this, is the quant broken or something?

Nevermind, it was an issue with the llama-server build I was using.

Goldkoron changed discussion status to closed

Sign up or log in to comment