Why it gives me OOM with 21gb gguf model but gens fine with 27gb fp8 safetensor?

#12
by Zuzuus - opened

I'm using the same workflow, same parameters, the only change is for the gguf loader and gguf text encoder.
I have 16gb vram and 32gb ram

Sign up or log in to comment