CocoaBench: Evaluating Unified Digital Agents in the Wild Paper • 2604.11201 • Published 5 days ago • 33
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Paper • 2502.15676 • Published Feb 21, 2025 • 3