ESearch-R1: Learning Cost-Aware MLLM Agents for Interactive Embodied Search via Reinforcement Learning Paper • 2512.18571 • Published Dec 21, 2025
PhysVLM-AVR: Active Visual Reasoning for Multimodal Large Language Models in Physical Environments Paper • 2510.21111 • Published Oct 24, 2025 • 3
LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning Paper • 2503.08508 • Published Mar 11, 2025
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability Paper • 2503.08481 • Published Mar 11, 2025