Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 29 days ago • 66
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 3 days ago • 48
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 3 days ago • 268
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 14 items • Updated 3 days ago • 54
NVIDIA Nemotron V2 Collection Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 3 days ago • 104
AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy Paper • 2506.13284 • Published Jun 16, 2025 • 26
AceReason Collection Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 3 days ago • 21
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning Paper • 2505.16400 • Published May 22, 2025 • 37
AceMath-RL Collection Math reasoning models trained through reinforcement learning (RL) • 1 item • Updated 3 days ago • 6
AceMath Collection We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 3 days ago • 17
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 3 days ago • 46