WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 7 days ago • 229
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 7 days ago • 41
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 14 days ago • 470
DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing Paper • 2603.28713 • Published 16 days ago • 19
STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding Paper • 2603.27593 • Published 17 days ago • 12
MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation Paper • 2603.29029 • Published 16 days ago • 13
j05hr3d/Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP-SEED999 Text Generation • 3B • Updated 15 days ago • 348 • 1
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 29 days ago • 248