Learning Cross-View Object Correspondence via Cycle-Consistent Mask Prediction Paper • 2602.18996 • Published Feb 22 • 16
Ex-Omni: Enabling 3D Facial Animation Generation for Omni-modal Large Language Models Paper • 2602.07106 • Published Feb 6 • 11
Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR Paper • 2507.15085 • Published Jul 20, 2025 • 7
C3: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations Paper • 2507.22968 • Published Jul 30, 2025 • 25
A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook Paper • 2502.08540 • Published Feb 12, 2025 • 1