PixelSmile: Toward Fine-Grained Facial Expression Editing Paper • 2603.25728 • Published 22 days ago • 117
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published Mar 12 • 22
PEARL: Personalized Streaming Video Understanding Model Paper • 2603.20422 • Published 28 days ago • 40
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation Paper • 2603.23500 • Published 24 days ago • 35
Are We on the Right Way to Assessing LLM-as-a-Judge? Paper • 2512.16041 • Published Dec 17, 2025 • 34
WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation Paper • 2407.15502 • Published Jul 22, 2024 • 1
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency Paper • 2506.08343 • Published Jun 10, 2025 • 54