h zhao's picture

h zhao

n1cck

huaiyizhao

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

updated a dataset 29 days ago

published a dataset 29 days ago

View all activity

Organizations

None yet

commented a paper 2 months ago

UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action

Paper • 2510.17790 • Published Oct 20, 2025 • 6 •

New activity in HuggingFaceM4/FineVision 3 months ago

Which training framework can directly load this data without preprocessing?

#32 opened 3 months ago by

New activity in WeiboAI/VibeThinker-1.5B 5 months ago

hello? 虽然是一个推理模型，但有的方面也太离谱了吧

#8 opened 5 months ago by

commented a paper 7 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665 •

commented 2 papers 8 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 119 •

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 119 •

New activity in Time-MQA/TSQA 9 months ago

Open sourcing evaluation scripts?

#1 opened 9 months ago by