StreamingVLM: Real-Time Understanding for Infinite Video Streams Paper • 2510.09608 • Published Oct 10, 2025 • 53 • 3
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning Paper • 2503.19900 • Published Mar 25, 2025 • 2
FLAIR: VLM with Fine-grained Language-informed Image Representations Paper • 2412.03561 • Published Dec 4, 2024 • 2 • 2
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM Paper • 2403.07816 • Published Mar 12, 2024 • 45 • 3