VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions Paper • 2603.23495 • Published 24 days ago • 3
More Images, More Problems? A Controlled Analysis of VLM Failure Modes Paper • 2601.07812 • Published Jan 12 • 6
One missing piece in Vision and Language: A Survey on Comics Understanding Paper • 2409.09502 • Published Sep 14, 2024 • 24