Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ycwhencpp
/
final-iteration
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
6279175
final-iteration
/
training
/
train_grpo.ipynb
vaibhav12332112312
training: smoke-mode + hardcoded peak hint + valid tool IDs
1f72457
16 days ago
raw
Copy download link
history
blame
Safe
62.3 kB
Rendering notebook...