arxiv:2601.23143
Sangwoo Park
Sangsang
AI & ML interests
I do LLM post-training research (KAIST AI)
Recent Activity
updated a model about 10 hours ago
Sangsang/grpo_Qwen3-0.6B_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 published a model about 10 hours ago
Sangsang/grpo_Qwen3-0.6B_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 updated a model 1 day ago
Sangsang/feedback_asymmetric_fixed_ema_Llama-3.1-8B-Instruct_bw0p5_fw0p5_ema0p999_ep30_v2Organizations
None yet