Q-Zoom: Query-Aware Adaptive Perception for Efficient Multimodal Large Language Models Paper • 2604.06912 • Published 8 days ago • 8
Catching the Details: Self-Distilled RoI Predictors for Fine-Grained MLLM Perception Paper • 2509.16944 • Published Sep 21, 2025 • 3
VSSD: Vision Mamba with Non-Casual State Space Duality Paper • 2407.18559 • Published Jul 26, 2024 • 20