Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2603.19220

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated 9 days ago • 317k • 476
nvidia/Nemotron-Cascade-2-RL-data

Viewer • Updated about 1 month ago • 55.7k • 1.41k • 47
nvidia/Nemotron-Cascade-2-SFT-Data

Viewer • Updated about 1 month ago • 15.9M • 18.6k • 54
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

General interest

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

Allanatrix/ProtienBank

Viewer • Updated Jun 27, 2025 • 100k • 13
shuyuej/French-MMLU-Medical-Genetics-Benchmark

Viewer • Updated Jun 8, 2024 • 100 • 3 • 1
Wang4XD/gemma-3-neuroscience-1b

1.0B • Updated Apr 23, 2025 • 2 • 2
nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated 9 days ago • 317k • 476

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

Paper • 2603.00889 • Published Mar 1 • 56

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66
Fish Audio S2 Technical Report

Paper • 2603.08823 • Published Mar 9 • 37

MapTrace: Scalable Data Generation for Route Tracing on Maps

Paper • 2512.19609 • Published Dec 22, 2025 • 3
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66
nyu-mll/glue

Viewer • Updated Jan 30, 2024 • 1.49M • 403k • 489
Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 153

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Dec 17, 2025 • 15.5k • 1.43k
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14, 2025 • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct

Text Generation • 8B • Updated Apr 17, 2025 • 96 • 17
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15, 2025 • 63

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated 9 days ago • 317k • 476
nvidia/Nemotron-Cascade-2-RL-data

Viewer • Updated about 1 month ago • 55.7k • 1.41k • 47
nvidia/Nemotron-Cascade-2-SFT-Data

Viewer • Updated about 1 month ago • 15.9M • 18.6k • 54
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

Paper • 2603.00889 • Published Mar 1 • 56

General interest

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66
Fish Audio S2 Technical Report

Paper • 2603.08823 • Published Mar 9 • 37

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66

MapTrace: Scalable Data Generation for Route Tracing on Maps

Paper • 2512.19609 • Published Dec 22, 2025 • 3
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published about 1 month ago • 66
nyu-mll/glue

Viewer • Updated Jan 30, 2024 • 1.49M • 403k • 489
Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published Mar 16 • 153

Allanatrix/ProtienBank

Viewer • Updated Jun 27, 2025 • 100k • 13
shuyuej/French-MMLU-Medical-Genetics-Benchmark

Viewer • Updated Jun 8, 2024 • 100 • 3 • 1
Wang4XD/gemma-3-neuroscience-1b

1.0B • Updated Apr 23, 2025 • 2 • 2
nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated 9 days ago • 317k • 476

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Dec 17, 2025 • 15.5k • 1.43k
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14, 2025 • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct

Text Generation • 8B • Updated Apr 17, 2025 • 96 • 17
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15, 2025 • 63

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs