Any chance for same 4bit quants but in AWQ format?

#1
by MDSExpro - opened

vLLM version would be amazing!

Owner

vLLM version would be amazing!

Thanks for the tip! I didn't know MLX-LM also supports AWQ. I will give it a shot if they support MiniMax

Owner
β€’
edited Feb 20

vLLM version would be amazing!

I can't seem to dequant using Transformers lib on mps unfortunately, and mlx doesnt support M2 yet 😭
Will try again tomorrow.

Sign up or log in to comment