I was wondering if the z-lab/Qwen3.5-27B-PARO model supports speculative decoding(mtp). Could you please clarify?
+1 on this question, would like to know how to add dflash to this.
· Sign up or log in to comment