The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents
Paper • 2604.10577 • Published • 24
Natural Language Processing
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents
Video-Based Reward Modeling for Computer-Use Agents