Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Jason Wei
JWei05
Follow
0 followers
·
1 following
AI & ML interests
RL, LLMs, DL Theory
Recent Activity
updated
a model
about 1 hour ago
JWei05/gemma3-12b-it-off-policy-distilled-from-gemma4-31b
published
a model
about 2 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-gemma4-31b
updated
a model
about 3 hours ago
JWei05/gemma3-4b-it-off-policy-distilled-from-gemma4-31b
View all activity
Organizations
models
16
Sort: Recently updated
JWei05/gemma3-12b-it-off-policy-distilled-from-gemma4-31b
Updated
about 1 hour ago
JWei05/gemma3-4b-it-off-policy-distilled-from-gemma4-31b
Updated
about 2 hours ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-dapo27b
Updated
about 4 hours ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-dapo27b
Updated
about 5 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-dapo27b-correct
Updated
about 17 hours ago
JWei05/gemma3-4b-it-off-policy-distilled-from-dapo27b-correct
Updated
about 18 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-dapo27b
13B
•
Updated
about 19 hours ago
JWei05/gemma3-4b-it-off-policy-distilled-from-dapo27b
5B
•
Updated
about 19 hours ago
JWei05/dapo-gemma3-27b-it
Updated
1 day ago
JWei05/Qwen2.5-3B-all-steps-563
3B
•
Updated
Nov 21, 2025
View 16 models
datasets
37
Sort: Recently updated
JWei05/DAPO-Gemma4-31B-IT-SFT-Data
Updated
about 3 hours ago
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data-correct
Viewer
•
Updated
about 18 hours ago
•
41.8k
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data
Viewer
•
Updated
about 23 hours ago
•
69.6k
•
5
JWei05/swe_smith_py_qwen3.5_35b_trajs_1952
Viewer
•
Updated
5 days ago
•
2k
•
26
JWei05/swe_smith_rs_qwen3.5_35b_trajs_2477
Viewer
•
Updated
5 days ago
•
5k
•
26
JWei05/swe_smith_go_qwen3.5_35b_trajs_1448
Viewer
•
Updated
5 days ago
•
1.63k
•
24
JWei05/swe_smith_js_qwen3.5_35b_trajs_4358
Viewer
•
Updated
5 days ago
•
5k
•
28
JWei05/swe_smith_java_qwen3.5_35b_trajs_4369
Viewer
•
Updated
5 days ago
•
5k
•
31
JWei05/swe_smith_js_5902_filtered
Viewer
•
Updated
15 days ago
•
5.9k
•
30
JWei05/swe_smith_java_6457_filtered
Viewer
•
Updated
15 days ago
•
6.46k
•
31
View 37 datasets