ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published 27 days ago • 41
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation Paper • 2603.19039 • Published 28 days ago • 50
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning Paper • 2602.11149 • Published Feb 11 • 17
view article Article Re-understanding KL Approximation from an RL-for-LLM Lens: Notes on “Approximating KL Divergence” Aug 11, 2025 • 10