SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Paper • 2603.23483 • Published 22 days ago • 62
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 29 days ago • 248
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 29 days ago • 248
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 29 days ago • 248
laion/CLIP-ViT-L-14-laion2B-s32B-b82K Zero-Shot Image Classification • 0.4B • Updated Jan 16, 2024 • 358k • 63