gsy's picture

8

gsy

gsy1519

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

upvoted a paper about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

upvoted a paper 7 months ago

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

View all activity

Organizations

upvoted a paper about 7 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 1 day ago • 53

upvoted a paper about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 58

upvoted 2 papers 7 months ago

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11, 2025 • 80

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4, 2025 • 76

upvoted a paper 8 months ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14, 2025 • 97

upvoted a paper 11 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 132

upvoted a paper 12 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22, 2025 • 122

upvoted a paper about 1 year ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14, 2025 • 28