Zhiyu Cheng
AI & ML interests
None yet
Recent Activity
new activity about 14 hours ago
nvidia/Gemma-4-31B-IT-NVFP4:Update chat_template.jinja new activity about 14 hours ago
nvidia/Gemma-4-31B-IT-NVFP4:Update tokenizer_config.json updated a model 3 days ago
nvidia/MiniMax-M2.5-NVFP4Organizations
Update chat_template.jinja
#11 opened 1 day ago
by
zhiyucheng
Update tokenizer_config.json
#12 opened 1 day ago
by
zhiyucheng
Update README.md
#10 opened 3 days ago
by
tstarkey-nvidia
Update README.md
1
#5 opened 12 days ago
by
kaihangj
Update model card with evaluation results
#6 opened 24 days ago
by
jingyux-nv
Fix: add .model after language_model in quantization ignore/exclude_modules
#5 opened about 2 months ago
by
zhiyucheng
Fix: add .model after language_model in quantization ignore/exclude_modules
#4 opened about 2 months ago
by
zhiyucheng
Transformers v5 support
#3 opened 2 months ago
by
nv-fszarwacki
update config for exclude modules
#3 opened 4 months ago
by
shengliangx
update config for exclude modules
#1 opened 4 months ago
by
shengliangx
update config for exclude modules
#3 opened 4 months ago
by
shengliangx
Use actual module path in ignore
2
#2 opened 4 months ago
by
shengliangx
Update README.md
#3 opened 6 months ago
by
alejandrar
Update README.md
#2 opened 6 months ago
by
alejandrar
Update README.md
1
#2 opened 6 months ago
by
alejandrar
Update README.md
#1 opened 7 months ago
by
huizimao
Update README.md
1
#2 opened 12 months ago
by
RestingCodeFace
Update README.md
#1 opened 12 months ago
by
omrialmog
Request for Detailed Benchmarking Setup with TensorRT-LLM on B200
➕ 4
1
#6 opened about 1 year ago
by
StardusterLiu
Benchmark results compared to orig fp8 / int4 quants etc?
➕ 15
6
#1 opened about 1 year ago
by
CHNtentes