Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
h zhao's picture
6 1

h zhao

n1cck
  • huaiyizhao

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries
updated a dataset 29 days ago
n1cck/test
published a dataset 29 days ago
n1cck/test
View all activity

Organizations

None yet

commented a paper 2 months ago

UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action

Paper • 2510.17790 • Published Oct 20, 2025 • 6 •
3
New activity in HuggingFaceM4/FineVision 3 months ago

Which training framework can directly load this data without preprocessing?

#32 opened 3 months ago by
n1cck
New activity in WeiboAI/VibeThinker-1.5B 5 months ago

hello? 虽然是一个推理模型,但有的方面也太离谱了吧

6
#8 opened 5 months ago by
yu0226
commented a paper 7 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665 •
56
commented 2 papers 8 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 119 •
6

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 119 •
6
New activity in Time-MQA/TSQA 9 months ago

Open sourcing evaluation scripts?

#1 opened 9 months ago by
n1cck
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs