Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 2 days ago • 48
OmniScience: A Large-scale Multi-modal Dataset for Scientific Image Understanding Paper • 2602.13758 • Published Feb 14 • 6
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family Jan 19 • 91
RxnBench: A Multimodal Benchmark for Evaluating Large Language Models on Chemical Reaction Understanding from Scientific Literature Paper • 2512.23565 • Published Dec 29, 2025 • 1
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks Aug 11, 2025 • 76
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 156
MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild Paper • 2411.11098 • Published Nov 17, 2024 • 1