Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
7
28
30
Sangwoo Park
Sangsang
Follow
invincible-jha's profile picture
jiongdao's profile picture
Fishtiks's profile picture
15 followers
·
30 following
swgger
AI & ML interests
I do LLM post-training research (KAIST AI)
Recent Activity
updated
a model
2 days ago
Sangsang/grpo_Qwen3-0.6B_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30
published
a model
2 days ago
Sangsang/grpo_Qwen3-0.6B_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30
updated
a model
3 days ago
Sangsang/feedback_asymmetric_fixed_ema_Llama-3.1-8B-Instruct_bw0p5_fw0p5_ema0p999_ep30_v2
View all activity
Organizations
None yet
Sangsang
's models
212
Sort: Recently updated
Sangsang/DeepSeek-R1-Distill-Qwen-14B_pm_ep5
15B
•
Updated
Feb 18
Sangsang/Qwen2.5-7B-Instruct_pm_think_ep5
8B
•
Updated
Feb 18
Sangsang/DeepSeek-R1-Distill-Qwen-7B_pm_ep5
8B
•
Updated
Feb 18
Sangsang/thinksafe-r1-1.5B-ablation_R32_BZ64_Gen8
Text Generation
•
Updated
Jan 26
•
1
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-32-pm-e3
Text Generation
•
Updated
Jan 20
•
1
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-32-pm-e2
Text Generation
•
Updated
Jan 20
•
1
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-16-pm-e3
Text Generation
•
Updated
Jan 20
•
1
Sangsang/R1-8B-thinksafe-r1-8B-ablation-32-pm-e3
Text Generation
•
Updated
Jan 20
•
1
Sangsang/R1-8B-thinksafe-r1-8B-ablation-32-pm-e2
Text Generation
•
Updated
Jan 20
•
1
Sangsang/R1-8B-thinksafe-r1-8B-ablation-16-pm-e3
Text Generation
•
Updated
Jan 19
•
1
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-16-pm-e2
Text Generation
•
Updated
Jan 19
•
1
Sangsang/R1-8B-thinksafe-r1-8B-ablation-16-pm-e2
Text Generation
•
Updated
Jan 19
•
1
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-16-pm-e1
Text Generation
•
Updated
Jan 19
•
2
Sangsang/R1-8B-thinksafe-r1-8B-ablation-16-pm-e1
Text Generation
•
Updated
Jan 19
•
1
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-64-pm-e3
Text Generation
•
Updated
Jan 19
•
1
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-64-pm-e2
Text Generation
•
Updated
Jan 19
•
1
Sangsang/qwen3-8B-thinksafe-8B-n1-ablation-64-pm-e1
Text Generation
•
Updated
Jan 19
•
1
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-64-pm-e3
Text Generation
•
Updated
Jan 19
•
1
Sangsang/R1-8B-thinksafe-r1-8B-ablation-64-pm-e3
Text Generation
•
Updated
Jan 19
•
2
Sangsang/R1-8B-thinksafe-r1-8B-ablation-64-pm-e2
Text Generation
•
Updated
Jan 19
•
1
Sangsang/R1-8B-thinksafe-r1-8B-ablation-64-pm-e1
Text Generation
•
Updated
Jan 19
•
1
Sangsang/R1-7B-thinksafe-r1-7B-ablation-64-pm-e3
Text Generation
•
Updated
Jan 19
•
1
Sangsang/R1-7B-thinksafe-r1-7B-ablation-64-pm-e2
Text Generation
•
Updated
Jan 18
•
1
Sangsang/R1-7B-thinksafe-r1-7B-ablation-64-pm-e1
Text Generation
•
Updated
Jan 18
•
1
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-64-pm-e2
Text Generation
•
Updated
Jan 18
•
1
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-64-pm-e1
Text Generation
•
Updated
Jan 18
•
1
Sangsang/R1-7B-thinksafe-r1-7B-ablation-32-pm-e3
Text Generation
•
Updated
Jan 18
•
1
Sangsang/R1-7B-thinksafe-r1-7B-ablation-32-pm-e2
Text Generation
•
Updated
Jan 18
•
7
Sangsang/R1-7B-thinksafe-r1-7B-ablation-16-pm-e3
Text Generation
•
Updated
Jan 18
•
1
Sangsang/R1-7B-thinksafe-r1-7B-ablation-16-pm-e2
Text Generation
•
Updated
Jan 18
•
2
Previous
1
2
3
4
5
...
8
Next