From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation Paper โข 2603.15600 โข Published 1 day ago โข 2 โข 1