Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published Apr 24, 2025 • 30
Token Warping Helps MLLMs Look from Nearby Viewpoints Paper • 2604.02870 • Published 13 days ago • 33