XuebinWang
XuebinWang
·
AI & ML interests
None yet
Recent Activity
new activity 21 days ago
amd/DeepSeek-R1-0528-MXFP4-MTP-MoEFP4:Update README about vllm reproduction scripts new activity about 1 month ago
amd/MiniMax-M2.1-MXFP4:Mismatch model shape new activity 2 months ago
amd/gpt-oss-20b-WFP8-AFP8-KVFP8:Amd R9700 - vLLM crashes on startupOrganizations
Update README about vllm reproduction scripts
#3 opened 21 days ago
by
XuebinWang
Mismatch model shape
10
#1 opened 2 months ago
by
twinsen123
Amd R9700 - vLLM crashes on startup
2
#6 opened 2 months ago
by
jmander11
update readme
#5 opened 3 months ago
by
XuebinWang
Update models and readme with accuracy number and disclaimer
#7 opened 3 months ago
by
XuebinWang
update readme with disclaimer
#4 opened 3 months ago
by
XuebinWang
update readme
#3 opened 3 months ago
by
XuebinWang
update README (results etc) and upload LICENSE and USAGE_POLICY
#2 opened 3 months ago
by
XuebinWang
Update README and upload original files
#6 opened 5 months ago
by
XuebinWang
Use self_attn in config.json
#5 opened 5 months ago
by
XuebinWang
KV cache quantization in FP8
#1 opened 5 months ago
by
XuebinWang
Change to FP8 customized attention quantization, and update README
#4 opened 5 months ago
by
XuebinWang
update model with several fixings
#5 opened 5 months ago
by
XuebinWang
Update README (NOT ready to use)
#3 opened 6 months ago
by
XuebinWang
Update README (NOT ready to use)
#3 opened 6 months ago
by
XuebinWang
Update README
#2 opened 6 months ago
by
XuebinWang
Initial commit to be used in vllm PR#27334
#1 opened 6 months ago
by
XuebinWang
update readme and upload LICENSE file
#2 opened 7 months ago
by
XuebinWang
update readme and upload LICENSE file
#2 opened 7 months ago
by
XuebinWang
copy needed files from original meta-llama
#4 opened 7 months ago
by
XuebinWang