Zhiyuan He's picture

Zhiyuan He

hzy46

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

upvoted a paper 7 months ago

ΔL Normalization: Rethink Loss Aggregation in RLVR

commentedon a paper 7 months ago

$ΔL$ Normalization: Rethink Loss Aggregation in RLVR

View all activity

Organizations

None yet

upvoted a paper 21 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 21 days ago • 53

upvoted a paper 7 months ago

ΔL Normalization: Rethink Loss Aggregation in RLVR

Paper • 2509.07558 • Published Sep 9, 2025 • 7

upvoted 3 papers 8 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

LeanK: Learnable K Cache Channel Pruning for Efficient Decoding

Paper • 2508.02215 • Published Aug 4, 2025 • 12

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140