Nice work with this model. Can we get a GPTQ version in 4 bits? I tried to create it in GPTQModel but got OOM killed.
· Sign up or log in to comment