Story2Proposal: A Scaffold for Structured Scientific Paper Writing Paper • 2603.27065 • Published 21 days ago • 22
Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio Paper • 2603.25926 • Published 22 days ago • 8
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data Paper • 2603.15594 • Published Mar 16 • 149
view article Article The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+ Feb 3 • 52
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family Jan 19 • 91
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published Dec 30, 2024 • 40
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models Paper • 2405.17428 • Published May 27, 2024 • 20
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 247
AudioPaLM: A Large Language Model That Can Speak and Listen Paper • 2306.12925 • Published Jun 22, 2023 • 56