Parameter model.layers.15.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict
When run with sglang@0.5.10.post1:
Multi-thread loading shards: 0% Completed | 0/66 [00:00<?, ?it/s][2026-04-23 02:48:49] Parameter model.layers.15.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict
[2026-04-23 02:48:50] Parameter model.layers.15.mlp.gate_up_proj.weight_scale_inv not found in params_dict
Multi-thread loading shards: 2% Completed | 1/66 [00:01<01:10, 1.09s/it][2026-04-23 02:48:50] Parameter model.layers.23.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict
[2026-04-23 02:48:51] Parameter model.layers.23.mlp.gate_up_proj.weight_scale_inv not found in params_dict
Multi-thread loading shards: 3% Completed | 2/66 [00:02<01:10, 1.10s/it][2026-04-23 02:48:52] Parameter model.layers.52.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict
[2026-04-23 02:48:52] Parameter model.layers.52.mlp.gate_up_proj.weight_scale_inv not found in params_dict
Multi-thread loading shards: 5% Completed | 3/66 [00:03<01:09, 1.11s/it][2026-04-23 02:48:53] Parameter model.layers.19.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict
...
Is this expected behaviour?
It seems the model broke completely:
edListicc书城feretzDW问答 Lair辙锵 fair公平公平公平公平公平的公平公平公平公平公平公平公平公平公平公平公平公平公平公平的公平的公平公平公平公平公平公平的公平公平的公平的公平的公平的公平的公平的公平公平公平公平公平公平公平公平公平公平公平公平公平公平的fair公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平的公平公平的公平公平公平公平的公平公平的公平公平公平公平公平公平公平公平公平的公平公平公平公平公平公平公平的公平的公平的公平的公平的公平公平公平公平的公平公平公平的公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平公平的公平公平的公平公平公平公平
Confirmed!
Got same issue with sglang
Update: also working fine with vLLM 0.19
apply this commit https://github.com/sgl-project/sglang/commit/4323fce82a091fab154bf36baa5820659ec0fd16
apply this commit
https://github.com/sgl-project/sglang/commit/4323fce82a091fab154bf36baa5820659ec0fd16
Thanks, it works! I tried to use vllm, but sglang decodes ~2x faster on my setup