Could you please add the Q3? Cause 12GB VRAM doesn't seem to be enough :(

by fawogin598 - opened Jul 22, 2025

Discussion

fawogin598

Jul 22, 2025

Thank you in advance :)

hanzogak

Owner Jul 23, 2025

I don't recommend using Q3_K_M, but it will work.
https://huggingface.co/hanzogak/Wan2.1-Anisora-14B-I2V-480P-GGUF/blob/main/Wan2_1-Anisora-I2V-480P-14B_Q3_K_M.gguf

AC116

Jul 24, 2025

但为什么呢？我的3080 10G 也可以跑Q5模型，甚至Q8，虽然很慢

AC116

Jul 24, 2025

I don't recommend using Q3_K_M, but it will work.
https://huggingface.co/hanzogak/Wan2.1-Anisora-14B-I2V-480P-GGUF/blob/main/Wan2_1-Anisora-I2V-480P-14B_Q3_K_M.gguf

有什么差距呢

MarcoZolo

Jul 28, 2025

You should be using sageattention to make this chug along quicker. By the way any chance of Q6? I think for the quality boost at least

fawogin598

Aug 15, 2025

For me, on 12GB, this raises out of memory:

fawogin598

Aug 15, 2025

但为什么呢？我的3080 10G 也可以跑Q5模型，甚至Q8，虽然很慢

Could you please share your workflow? Or the model part of it? @AC116

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment