2x Nvidia 6000 Pros
#2
by mtcl - opened
Will this work on 2x Nvidia 6000 Pros? Can you please share a working vllm/sglang command for that?
I've tried it, it won't work. Still depending on DeepGEMM to support sm120 or reimplement in triton.
checkout https://github.com/vllm-project/vllm/pull/40991
Same here. Doesn't work.