Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
daqi's picture
1 2

daqi

Sunshine8393

AI & ML interests

None yet

Recent Activity

authored a paper 28 days ago
From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation
upvoted a paper 28 days ago
From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation
upvoted a collection 28 days ago
PRIMO R1
View all activity

Organizations

None yet

upvoted a paper 28 days ago

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

Paper • 2603.15600 • Published 29 days ago • 7
upvoted a collection 28 days ago

PRIMO R1

Collection
Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 6 items • Updated 6 days ago • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs