-
Hao0oWang/CurioSFT-Qwen2.5-Math-7B-RL
8B • Updated -
Hao0oWang/CurioSFT-Qwen2.5-Math-7B-SFT
8B • Updated • 2 -
Hao0oWang/CurioSFT_Data
Viewer • Updated • 63k • 23 -
Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
Paper • 2602.02244 • Published • 1
Hao
Hao0oWang
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 13 days ago
Gen-Searcher: Reinforcing Agentic Search for Image Generation authored a paper about 2 months ago
Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models updated a collection about 2 months ago
CurioSFTOrganizations
None yet