LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization Paper • 2506.09373 • Published Jun 11, 2025 • 1
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training Paper • 2603.12255 • Published Mar 12 • 91
LongVideoAgent: Multi-Agent Reasoning with Long Videos Paper • 2512.20618 • Published Dec 23, 2025 • 56
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding Paper • 2512.17532 • Published Dec 19, 2025 • 68
Hawk: Learning to Understand Open-World Video Anomalies Paper • 2405.16886 • Published May 27, 2024 • 1
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 189