Support SM120

by darkstar3537 - opened Feb 22

•

Dear Nvidia, stop being assholes and provide better support for SM120 cards. These cards cost a great deal of money and are used by professionals in the field. The lack of support around nvfp4 in this space is a joke. Do better. I know datacenter cards let Jensen buy more leather jackets, but I think he's good for the moment.

aabbccddwasd

Feb 23

vllm already supported.
trt-llm is trash running qwen3.5 even on B200, according to github

jukingjack1

Feb 23

care to actually back that up with your config? vLLM does not support it as far as I can tell

aabbccddwasd

Feb 24

care to actually back that up with your config? vLLM does not support it as far as I can tell

because bad quant config
add .experts.mlp.gate and mtp.fc* to ignore list

jukingjack1

Feb 25

care to actually back that up with your config? vLLM does not support it as far as I can tell

because bad quant config
add .experts.mlp.gate and mtp.fc* to ignore list

Can you share your config because MTP is not compatible with NFP4 unless I am missing something?

aabbccddwasd

Feb 26

care to actually back that up with your config? vLLM does not support it as far as I can tell

because bad quant config
add .experts.mlp.gate and mtp.fc* to ignore list

Can you share your config because MTP is not compatible with NFP4 unless I am missing something?

sorry but I have problems uploading files.
you can look at my comment in this https://huggingface.co/vincentzed-hf/Qwen3.5-397B-A17B-NVFP4/discussions/1

josephbreda

Mar 8

It's astonishing to me that a company with a $4T+ market cap, virtually unlimited cash and 40,000 employees so badly botched:
-Day ONE complete software support for Blackwell
-TRT-LLM support
-Introduction of NVFP4, so breathlessly hyped to by optimized for Blackwell but so far has been a complete turd in actual use.

These are signs of a company getting arrogant for want of competition. I hope they see some soon.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment