Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
7
28
30
Sangwoo Park
Sangsang
Follow
DongkiKim's profile picture
Jackson0018's profile picture
jin1234's profile picture
15 followers
·
30 following
swgger
AI & ML interests
I do LLM post-training research (KAIST AI)
Recent Activity
updated
a model
2 days ago
Sangsang/grpo_Qwen3-0.6B_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30
published
a model
2 days ago
Sangsang/grpo_Qwen3-0.6B_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30
updated
a model
3 days ago
Sangsang/feedback_asymmetric_fixed_ema_Llama-3.1-8B-Instruct_bw0p5_fw0p5_ema0p999_ep30_v2
View all activity
Organizations
None yet
Sangsang
's models
212
Sort: Recently updated
Sangsang/R1-7B-thinksafe-r1-7B-ablation-16-pm-e1
Text Generation
•
Updated
Jan 18
•
3
Sangsang/R1-1.5B-thinksafe-r1-1.5B-ablation-64-pm-e3
Text Generation
•
Updated
Jan 18
•
3
Sangsang/R1-1.5B-thinksafe-r1-1.5B-ablation-64-pm-e2
Text Generation
•
Updated
Jan 18
•
1
Sangsang/R1-1.5B-thinksafe-r1-1.5B-ablation-64-pm-e1
Text Generation
•
Updated
Jan 18
•
1
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-32-pm-e3
Text Generation
•
Updated
Jan 18
•
1
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-32-pm-e2
Text Generation
•
Updated
Jan 18
•
1
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-16-pm-e3
Text Generation
•
Updated
Jan 18
•
1
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-16-pm-e2
Text Generation
•
Updated
Jan 18
•
1
Sangsang/qwen3-4B-thinksafe-4B-n1-ablation-16-pm-e1
Text Generation
•
Updated
Jan 18
•
1
Sangsang/qwen3-1.7B-thinksafe-1.7B-n1-ablation-64-pm-e3
Text Generation
•
Updated
Jan 18
•
5
Sangsang/qwen3-1.7B-thinksafe-1.7B-n1-ablation-64-pm-e2
Text Generation
•
Updated
Jan 17
•
1
Sangsang/R1-1.5B-thinksafe-r1-1.5B-ablation-32-pm-e3
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-1.7B-thinksafe-1.7B-n1-ablation-64-pm-e1
Text Generation
•
Updated
Jan 17
•
2
Sangsang/qwen3-1.7B-thinksafe-1.7B-n1-ablation-32-pm-e3
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-1.7B-thinksafe-1.7B-n1-ablation-32-pm-e2
Text Generation
•
Updated
Jan 17
•
4
Sangsang/R1-1.5B-thinksafe-r1-1.5B-ablation-32-pm-e2
Text Generation
•
Updated
Jan 17
•
1
Sangsang/R1-1.5B-thinksafe-r1-1.5B-ablation-16-pm-e3
Text Generation
•
Updated
Jan 17
•
1
Sangsang/R1-1.5B-thinksafe-r1-1.5B-ablation-16-pm-e2
Text Generation
•
Updated
Jan 17
•
1
Sangsang/R1-1.5B-thinksafe-r1-1.5B-ablation-16-pm-e1
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-1.7B-thinksafe-1.7B-n1-ablation-16-pm-e3
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-1.7B-thinksafe-1.7B-n1-ablation-16-pm-e2
Text Generation
•
Updated
Jan 17
•
3
Sangsang/qwen3-1.7B-thinksafe-1.7B-n1-ablation-16-pm-e1
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-64-pm-e3
Text Generation
•
Updated
Jan 17
•
3
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-64-pm-e2
Text Generation
•
Updated
Jan 17
•
4
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-64-pm-e1
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-32-pm-e3
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-32-pm-e2
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-16-pm-e3
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-16-pm-e2
Text Generation
•
Updated
Jan 17
•
1
Sangsang/qwen3-0.6B-thinksafe-0.6B-n1-ablation-16-pm-e1
Text Generation
•
Updated
Jan 17
•
1
Previous
1
2
3
4
5
6
...
8
Next