Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? Paper • 2603.24472 • Published 21 days ago • 53
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding Paper • 2508.02215 • Published Aug 4, 2025 • 12
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 140