Submitted by Wenqi Shi 10 Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs Eigen AI 20 2