WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 9 days ago • 237
A Systematic Study of Cross-Modal Typographic Attacks on Audio-Visual Reasoning Paper • 2604.03995 • Published 13 days ago • 4
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published 9 days ago • 114
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published 9 days ago • 96
Paper Espresso: From Paper Overload to Research Insight Paper • 2604.04562 • Published 12 days ago • 12