Collections
Discover the best community collections!
Collections including paper arxiv:2506.18088
-
From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
Paper • 2506.09930 • Published • 8 -
SAFE: Multitask Failure Detection for Vision-Language-Action Models
Paper • 2506.09937 • Published • 9 -
Hidden in plain sight: VLMs overlook their visual representations
Paper • 2506.08008 • Published • 7 -
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
Paper • 2506.18088 • Published • 18
-
MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations
Paper • 2504.07830 • Published • 18 -
WORLDMEM: Long-term Consistent World Simulation with Memory
Paper • 2504.12369 • Published • 35 -
Towards a Unified Copernicus Foundation Model for Earth Vision
Paper • 2503.11849 • Published • 5 -
VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
Paper • 2506.18903 • Published • 22
-
Unified Vision-Language-Action Model
Paper • 2506.19850 • Published • 28 -
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Paper • 2506.01844 • Published • 158 -
3D-VLA: A 3D Vision-Language-Action Generative World Model
Paper • 2403.09631 • Published • 12 -
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
Paper • 2312.14457 • Published • 1
-
SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence
Paper • 2506.15672 • Published • 15 -
SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems
Paper • 2506.07564 • Published • 6 -
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
Paper • 2506.18088 • Published • 18 -
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving
Paper • 2507.06229 • Published • 77
-
Unified Vision-Language-Action Model
Paper • 2506.19850 • Published • 28 -
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Paper • 2506.01844 • Published • 158 -
3D-VLA: A 3D Vision-Language-Action Generative World Model
Paper • 2403.09631 • Published • 12 -
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
Paper • 2312.14457 • Published • 1
-
From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
Paper • 2506.09930 • Published • 8 -
SAFE: Multitask Failure Detection for Vision-Language-Action Models
Paper • 2506.09937 • Published • 9 -
Hidden in plain sight: VLMs overlook their visual representations
Paper • 2506.08008 • Published • 7 -
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
Paper • 2506.18088 • Published • 18
-
SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence
Paper • 2506.15672 • Published • 15 -
SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems
Paper • 2506.07564 • Published • 6 -
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
Paper • 2506.18088 • Published • 18 -
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving
Paper • 2507.06229 • Published • 77
-
MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations
Paper • 2504.07830 • Published • 18 -
WORLDMEM: Long-term Consistent World Simulation with Memory
Paper • 2504.12369 • Published • 35 -
Towards a Unified Copernicus Foundation Model for Earth Vision
Paper • 2503.11849 • Published • 5 -
VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
Paper • 2506.18903 • Published • 22