PSFT+RL models
SII-Wenhong
wh-zhu
AI & ML interests
None yet
Recent Activity
liked a dataset about 1 month ago
stepfun-ai/Step-3.5-Flash-SFT updated a dataset about 2 months ago
wh-zhu/dapo published a dataset about 2 months ago
wh-zhu/dapo