Scaling Test-Time Robustness of Vision-Language Models via Self-Critical Inference Framework Paper • 2603.07659 • Published Mar 25 • 3
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published Mar 16 • 186
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 195
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 189
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published Mar 10, 2025 • 88
Running on CPU Upgrade Agents 1.01k Open VLM Leaderboard 🌎 1.01k VLMEvalKit Evaluation Results Collection