Watch Before You Answer: Learning from Visually Grounded Post-Training Paper • 2604.05117 • Published 10 days ago • 35
SWE-Next: Scalable Real-World Software Engineering Tasks for Agents Paper • 2603.20691 • Published 26 days ago • 10
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis Paper • 2603.20278 • Published 29 days ago • 94
VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction Paper • 2602.13294 • Published Feb 9 • 13
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published Dec 1, 2025 • 74
VisCoder2: Building Multi-Language Visualization Coding Agents Paper • 2510.23642 • Published Oct 24, 2025 • 22