Do Thought Streams Matter? Evaluating Reasoning in Gemini Vision-Language Models for Video Scene Understanding Paper • 2604.11177 • Published 3 days ago • 6
Do Thought Streams Matter? Evaluating Reasoning in Gemini Vision-Language Models for Video Scene Understanding Paper • 2604.11177 • Published 3 days ago • 6
Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments Paper • 2502.06445 • Published Feb 10, 2025