other versions

by michel-io - opened Jan 31

Discussion

michel-io

Jan 31

works great but slow on limited vram. can you make fp8 versions? and fp16 versions for older cards?

BennyDaBall

Owner Jan 31

https://huggingface.co/mradermacher/Qwen3-4b-Z-Image-Turbo-AbliteratedV1-GGUF --static

https://huggingface.co/mradermacher/Qwen3-4b-Z-Image-Turbo-AbliteratedV1-i1-GGUF --imatrix
@mradermacher beat me making quants!

works great but slow on limited vram. can you make fp8 versions? and fp16 versions for older cards?
Thank you! I will upload some GGUF format Q2-Q8. thank you!

BennyDaBall changed discussion status to closed Feb 7

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment