zhepeihong
peregrine123
AI & ML interests
Post-training, On-policy Distillation
Recent Activity
submitted a paper about 23 hours ago
Rubric-based On-policy Distillation authored a paper 1 day ago
Rubric-based On-policy Distillation upvoted a paper 1 day ago
Rubric-based On-policy DistillationOrganizations
None yet