ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners? Paper • 2603.25823 • Published 18 days ago • 43
Temporal Memory Attention for Video Semantic Segmentation Paper • 2102.08643 • Published Feb 17, 2021
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion Paper • 2407.07844 • Published Jul 10, 2024 • 1