Han Yang's picture

Han Yang

yaanhaan

·

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

qiaojin/PubMedQA

liked a dataset 22 days ago

siyanzhao/Openthoughts_math_30k_opsd

upvoted a paper about 1 month ago

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

Paper • 2601.18734 • Published Jan 26 • 5

upvoted an article about 1 month ago

Article

NuminaMath 是如何荣膺首届 AIMO 进步奖的？

+6

Jul 11, 2024

•

1

upvoted a paper 5 months ago

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Paper • 2508.19828 • Published Aug 27, 2025 • 8

upvoted 2 collections 6 months ago

Reinforcement learning

142 items • Updated about 16 hours ago • 9

Agent & RL

55 items • Updated Nov 27, 2025 • 21