Qwen 3.5

by Trilogix1 - opened Mar 2

Discussion

Trilogix1

Mar 2

Did you bench it to see how much accuracy it losses, and do you have any plans to do the same to qwen 3.5?

null-space

Owner Mar 9

I updated the model cards with MMLU benchmark: 86.2% (-1.6% from baseline). I haven't dug into 3.5 yet, but based my experience with the Qwen3 VL model, the multimodals are much harder to abliterate because they have higher order refusals. Though, I am always looking for good VL models in this size range, so maybe it's worth a shot.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment