AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents Paper • 2604.02947 • Published 12 days ago • 19
PixelSmile: Toward Fine-Grained Facial Expression Editing Paper • 2603.25728 • Published 19 days ago • 117