Examples of tasks we designed in https://arxiv.org/abs/2504.15266
Chen Wu PRO
ChenWu98
AI & ML interests
Generative models
Recent Activity
updated a model 4 days ago
ChenWu98/grpo_generator_feedback_hard_Qwen-Qwen3-8B_lr1e-6_k1_init2800 published a model 4 days ago
ChenWu98/grpo_generator_feedback_hard_Qwen-Qwen3-8B_lr1e-6_k1_init2800 updated a model 4 days ago
ChenWu98/grpo_generator_feedback_hard_Qwen-Qwen3-8B_lr1e-6_k3_init2800Organizations
None yet