Zihan Liu's picture

Zihan Liu

zihanliu

·

https://zliucr.github.io/

zliucr

AI & ML interests

None yet

Recent Activity

updated a model 8 days ago

nvidia/Nemotron-Cascade-2-30B-A3B

updated a collection 29 days ago

Nemotron-Cascade 2

upvoted a paper 29 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

View all activity

Organizations

upvoted a paper 29 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 29 days ago • 66

upvoted a collection 29 days ago

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 3 days ago • 48

upvoted 2 collections 4 months ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated 3 days ago • 268

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 14 items • Updated 3 days ago • 54

upvoted a collection 8 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 3 days ago • 104

upvoted a paper 10 months ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16, 2025 • 26

upvoted a collection 10 months ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 3 days ago • 21

upvoted a paper 10 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22, 2025 • 37

upvoted a collection 12 months ago

AceMath-RL

Math reasoning models trained through reinforcement learning (RL) • 1 item • Updated 3 days ago • 6

upvoted a collection over 1 year ago

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 3 days ago • 17

upvoted a paper over 1 year ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 74

upvoted a collection almost 2 years ago

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 3 days ago • 46