view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 Jan 29 • 106
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Paper • 2512.22047 • Published Dec 26, 2025 • 30
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published Dec 19, 2025 • 99
Cosmos-Reason1 Collection ⚠️ The latest version of Cosmos Reason is now live! 👉 https://huggingface.co/collections/nvidia/cosmos-reason2 • 5 items • Updated 3 days ago • 41
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 • 344
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? +5 May 11, 2025 • 96
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control +2 Feb 4, 2025 • 192
Android in the Wild: A Large-Scale Dataset for Android Device Control Paper • 2307.10088 • Published Jul 19, 2023 • 12
OpenMask3D: Open-Vocabulary 3D Instance Segmentation Paper • 2306.13631 • Published Jun 23, 2023 • 11