GPTQ vs Q4 GGUF

by ciprianv - opened Mar 4

Mar 4

Thank you for providing these GPTQ version. How should it compare as accuracy vs standard Q4 GGUF. Is there any post quantization training done? (QAT). In vllm/sglang does this has advantages over AWQ?

chriswritescode

Mar 10

I also wonder about this, but from testing this model does very well. I imagine having the original training set to use as the dataset would provide a good model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment