MTP results with vLLM inside

#2
by unoid - opened

Did this for 122b, now here 27B
vLLM Qwen3.5 27B Benchmark Results
FP8 Quantization - Speculative Decoding - March 3, 2026

RTX Pro 6000 Blackwell - 300W - vLLM 0.16.1rc1.dev173+g8fa68a8ce

image

image

unoid changed discussion status to closed

Sign up or log in to comment