Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
25
1
John Schaefer
johnschaefer
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
about 21 hours ago
johnschaefer/GRPO-olmo3_7b_creativity_grpo_purerl
published
a model
about 21 hours ago
johnschaefer/GRPO-olmo3_7b_creativity_grpo_purerl
updated
a model
about 22 hours ago
johnschaefer/GRPO-olmo3_7b_physics_grpo_purerl
View all activity
Organizations
None yet
models
10
Sort: Recently updated
johnschaefer/GRPO-olmo3_7b_creativity_grpo_purerl
Updated
about 21 hours ago
johnschaefer/GRPO-olmo3_7b_physics_grpo_purerl
Updated
about 21 hours ago
johnschaefer/GRPO-qwen3_8b_creativity_grpo_purerl
Updated
about 22 hours ago
johnschaefer/GRPO-qwen3_8b_creativity_grpo_weighted_mul
Updated
about 22 hours ago
johnschaefer/GRPO-qwen3_8b_physics_grpo_purerl
Updated
about 22 hours ago
johnschaefer/DAPO-RLVR-with-full-tokens-Qwen3-8B
Updated
about 22 hours ago
johnschaefer/GRPO-olmo3_7b_physics_grpo_weighted_mul
Updated
about 23 hours ago
johnschaefer/GRPO-qwen3_8b_physics_grpo_weighted_mul
Updated
about 23 hours ago
johnschaefer/DAPO-RLVR-with-only-high-entropy-tokens-Qwen3-8B
Updated
about 23 hours ago
johnschaefer/GRPO-olmo3_7b_creativity_grpo_weighted_mul
Updated
about 23 hours ago
datasets
0
None public yet