LongVU - a Vision-CAIR Collection

Vision-CAIR 's Collections

LongVU

updated 6 days ago

Vision-CAIR/LongVU_Qwen2_7B

Video-Text-to-Text • 8B • Updated Feb 28, 2025 • 199 • 76
Vision-CAIR/LongVU_Llama3_2_3B

Video-Text-to-Text • Updated Feb 28, 2025 • 23 • 8
Vision-CAIR/LongVU_Llama3_2_3B_img

Updated Feb 28, 2025 • 5 • 6
Vision-CAIR/LongVU_Qwen2_7B_img

Updated Feb 28, 2025 • 8 • 5
Vision-CAIR/LongVU_Llama3_2_1B

Video-Text-to-Text • Updated Feb 28, 2025 • 22 • 12
Vision-CAIR/LongVU_Llama3_2_1B_img

Updated Oct 24, 2024 • 2
Running on Zero

Agents

88

LongVU

🌖

88

Generate responses to video or image inputs
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 27