Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Jason Wei
JWei05
Follow
0 followers
·
1 following
AI & ML interests
RL, LLMs, DL Theory
Recent Activity
updated
a model
about 1 hour ago
JWei05/dapo-gemma3-27b-it-warmup20
published
a model
about 1 hour ago
JWei05/dapo-gemma3-27b-it-warmup20
updated
a model
about 4 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-gemma4-31b
View all activity
Organizations
JWei05
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
about 1 hour ago
JWei05/dapo-gemma3-27b-it-warmup20
Updated
about 1 hour ago
published
a model
about 1 hour ago
JWei05/dapo-gemma3-27b-it-warmup20
Updated
about 1 hour ago
updated
a model
about 4 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-gemma4-31b
Updated
about 4 hours ago
published
a model
about 5 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-gemma4-31b
Updated
about 4 hours ago
updated
a model
about 6 hours ago
JWei05/gemma3-4b-it-off-policy-distilled-from-gemma4-31b
Updated
about 5 hours ago
published
a model
about 6 hours ago
JWei05/gemma3-4b-it-off-policy-distilled-from-gemma4-31b
Updated
about 5 hours ago
updated
a dataset
about 6 hours ago
JWei05/DAPO-Gemma4-31B-IT-SFT-Data
Updated
about 6 hours ago
published
a dataset
about 6 hours ago
JWei05/DAPO-Gemma4-31B-IT-SFT-Data
Updated
about 6 hours ago
updated
a model
about 8 hours ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-dapo27b
Updated
about 7 hours ago
published
a model
about 8 hours ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-dapo27b
Updated
about 7 hours ago
updated
a model
about 8 hours ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-dapo27b
Updated
about 8 hours ago
published
a model
about 8 hours ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-dapo27b
Updated
about 8 hours ago
updated
a model
about 21 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-dapo27b-correct
Updated
about 20 hours ago
published
a model
about 21 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-dapo27b-correct
Updated
about 20 hours ago
updated
a model
about 21 hours ago
JWei05/gemma3-4b-it-off-policy-distilled-from-dapo27b-correct
Updated
about 20 hours ago
published
a model
about 21 hours ago
JWei05/gemma3-4b-it-off-policy-distilled-from-dapo27b-correct
Updated
about 20 hours ago
updated
a dataset
about 21 hours ago
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data-correct
Viewer
•
Updated
about 21 hours ago
•
41.8k
published
a dataset
about 21 hours ago
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data-correct
Viewer
•
Updated
about 21 hours ago
•
41.8k
updated
a model
about 23 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-dapo27b
13B
•
Updated
about 22 hours ago
published
a model
about 23 hours ago
JWei05/gemma3-12b-it-off-policy-distilled-from-dapo27b
13B
•
Updated
about 22 hours ago
Load more