CocoaBench: Evaluating Unified Digital Agents in the Wild Paper • 2604.11201 • Published 2 days ago • 29
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17, 2025 • 50
Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data Curation Paper • 2508.12680 • Published Aug 18, 2025