Submitted by Wenqi Shi 10 Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs Eigen AI 21 2