Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ALIASING
/
grpo-step500
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
grpo-step500
15.2 GB
Ctrl+K
Ctrl+K
2 contributors
History:
2 commits
Karina-ww
First model version
4073e89
9 months ago
7B-GRPO-ori-500
First model version
9 months ago
.gitattributes
Safe
1.56 kB
First model version
9 months ago