request for generation steps

by kartikvyas1 - opened Mar 17

Mar 17

hi, this is too cool. could you please tell how this was generated from bf16 mode. would love to learn. thanks in advance.
like how many sample inputs were taken. did you also use the deepcompressor framework shared by svdquant authors. if so how to tweak it for qwen2511.

ovedrive

Owner Mar 19

quantization is not same as training. You can check the docs on huggingface for quantization and diffusers. There is no other model/lora external layers added , its still the same original Qwen.
if you mean qwen-2512, the nf4 is already provided by me in another repo.

ovedrive changed discussion status to closed Mar 19

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment