Chenzehao's picture

3 2

Chenzehao

beichenhang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

published a dataset 12 days ago

beichenhang/EnsembleLLM-data

updated a dataset 12 days ago

beichenhang/EnsembleLLM-data

View all activity

Organizations

None yet

upvoted a paper 12 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 24 days ago • 330

published a dataset 12 days ago

beichenhang/EnsembleLLM-data

Preview • Updated 12 days ago • 26

updated a dataset 12 days ago

beichenhang/EnsembleLLM-data

Preview • Updated 12 days ago • 26

upvoted a paper 2 months ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 289

upvoted a paper 3 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158

liked 2 datasets 5 months ago

qwedsacf/competition_math

Viewer • Updated Jan 28, 2023 • 12.5k • 10.7k • 119

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 14k • 730

updated a model 6 months ago

beichenhang/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Text Generation • 2B • Updated Oct 17, 2025 • 1

published 2 models 6 months ago

beichenhang/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Text Generation • 2B • Updated Oct 17, 2025 • 1

beichenhang/OpenR1-Distill-7B

Updated Oct 15, 2025