2x Nvidia 6000 Pros

#2
by mtcl - opened

Will this work on 2x Nvidia 6000 Pros? Can you please share a working vllm/sglang command for that?

I've tried it, it won't work. Still depending on DeepGEMM to support sm120 or reimplement in triton.
checkout https://github.com/vllm-project/vllm/pull/40991

Same here. Doesn't work.

Sign up or log in to comment