ESearch-R1: Learning Cost-Aware MLLM Agents for Interactive Embodied Search via Reinforcement Learning Paper • 2512.18571 • Published Dec 21, 2025
lyl472324464/pixmo_points_dataset_ALL_which_points_less_than_100 Viewer • Updated Dec 26, 2024 • 2.24M • 4 • 1
PhysVLM-AVR: Active Visual Reasoning for Multimodal Large Language Models in Physical Environments Paper • 2510.21111 • Published Oct 24, 2025 • 3
PhysVLM-AVR: Active Visual Reasoning for Multimodal Large Language Models in Physical Environments Paper • 2510.21111 • Published Oct 24, 2025 • 3
PhysVLM-AVR: Active Visual Reasoning for Multimodal Large Language Models in Physical Environments Paper • 2510.21111 • Published Oct 24, 2025 • 3 • 1
LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning Paper • 2503.08508 • Published Mar 11, 2025
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability Paper • 2503.08481 • Published Mar 11, 2025