view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 5 days ago • 40
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 11 days ago • 822
GPT-1900 Collection Pre-1900 LLMs for physics reasoning. RL models are physics-only; use the SFT model for general chat. Tune temperature (0.6-0.7). • 11 items • Updated 10 days ago • 6
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 6 days ago • 119
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 26 days ago • 62
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 6 days ago • 47
FlashSampling: Fast and Memory-Efficient Exact Sampling Paper • 2603.15854 • Published 27 days ago • 9
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 6 days ago • 265
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 124
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 8 days ago • 140
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 Feb 12 • 32
view article Article IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST Feb 18 • 18
Nemotron-Terminal Collection We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 6 days ago • 34
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 501
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents Paper • 2512.12730 • Published Dec 14, 2025 • 52