Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
srishtichugh
/
orgOS
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
f90e8de
orgOS
/
training
Ctrl+K
Ctrl+K
7 contributors
History:
7 commits
muskan singh
adding
f5c84e7
12 days ago
plots
adding
12 days ago
grpo_orgos.ipynb
Safe
30.7 kB
fix: stable GRPO notebook — pin TRL<=0.24, multi-step reward, Drive checkpoints every 30 steps
12 days ago
train.py
Safe
20.7 kB
plots, results, readme updations
12 days ago