Vision support

by raulalonsoctic - opened Feb 24

Feb 24

Any chance you could add the vision decoder to this model? This is the only NVFP4 quantization that works reliably under vLLM

raulalonsoctic changed discussion status to closed Feb 24

raulalonsoctic changed discussion status to open Feb 24

Sehyo

Owner Feb 24

The vision encoder is in the model in shard 5, it is unquantized (therefore in ignore list) and it should work.

Sehyo

Owner Feb 24

I have uploaded a new revision that includes some configs that were missing for multi-modal purposes. Can you try with that and see if vision works? Thanks.

raulalonsoctic

Feb 24

Working great on my first tests. Thank you very much!!!

Sehyo

Owner Feb 25

Thanks, Im uploading the nvfp4 for the 122B version as we speak, with all configs etc.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment