XYX's picture

XYX

xuyd16

AI & ML interests

None yet

Recent Activity

submitted a paper about 1 month ago

PACED: Distillation at the Frontier of Student Competence

authored a paper about 1 month ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

authored a paper about 1 month ago

On-Policy Self-Distillation for Reasoning Compression

View all activity

Organizations

None yet

submitted a paper to Daily Papers about 1 month ago

PACED: Distillation at the Frontier of Student Competence

Paper • 2603.11178 • Published Mar 11 • 4

authored 4 papers about 1 month ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Paper • 2602.21420 • Published Feb 24 • 6

On-Policy Self-Distillation for Reasoning Compression

Paper • 2603.05433 • Published Mar 5 • 8

Not all tokens are needed(NAT): token efficient reinforcement learning

Paper • 2603.06619 • Published Feb 20 • 1

PACED: Distillation at the Frontier of Student Competence

Paper • 2603.11178 • Published Mar 11 • 4