Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published Mar 14 • 87
ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA Paper • 2603.10256 • Published Mar 10 • 22
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers Paper • 2511.11062 • Published Nov 14, 2025 • 33
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content Paper • 2410.10783 • Published Oct 14, 2024 • 26
AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder Paper • 2306.06370 • Published Jun 10, 2023 • 1