Could you please add the Q3? Cause 12GB VRAM doesn't seem to be enough :(

#2
by fawogin598 - opened

Thank you in advance :)

但为什么呢?我的3080 10G 也可以跑Q5模型,甚至Q8,虽然很慢

You should be using sageattention to make this chug along quicker. By the way any chance of Q6? I think for the quality boost at least

For me, on 12GB, this raises out of memory:
image.png

但为什么呢?我的3080 10G 也可以跑Q5模型,甚至Q8,虽然很慢

Could you please share your workflow? Or the model part of it? @AC116

Sign up or log in to comment