daqi's picture

daqi

Sunshine8393

AI & ML interests

None yet

Recent Activity

authored a paper 28 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

upvoted a paper 28 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

upvoted a collection 28 days ago

View all activity

Organizations

None yet

upvoted a paper 28 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Paper • 2603.15600 • Published 29 days ago • 7

upvoted a collection 28 days ago

PRIMO R1

Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 6 items • Updated 6 days ago • 4