Anchor Forcing: Anchor Memory and Tri-Region RoPE for Interactive Streaming Video Diffusion Paper ⢠2603.13405 ⢠Published Mar 12
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing Paper ⢠2604.04911 ⢠Published 10 days ago ⢠35
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper ⢠2604.04921 ⢠Published 10 days ago ⢠107
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper ⢠2604.04921 ⢠Published 10 days ago ⢠107
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper ⢠2603.12254 ⢠Published Mar 12 ⢠22
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper ⢠2603.12254 ⢠Published Mar 12 ⢠22
SANA-Video Collection š¬ SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer ⢠10 items ⢠Updated about 1 month ago ⢠7
3D Aware Region Prompted Vision Language Model Paper ⢠2509.13317 ⢠Published Sep 16, 2025 ⢠14
ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory Paper ⢠2509.04439 ⢠Published Sep 4, 2025 ⢠1
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems Paper ⢠2510.12872 ⢠Published Oct 14, 2025 ⢠4
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper ⢠2510.15870 ⢠Published Oct 17, 2025 ⢠92