Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
sravanthib
/
test
like
0
Text Generation
Transformers
Safetensors
AI-MO/NuminaMath-TIR
llama
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
test
Commit History
Add files using upload-large-folder tool
80bdd0c
verified
sravanthib
commited on
May 22, 2025
End of training
0f989be
verified
sravanthib
commited on
Mar 23, 2025
Model save
05edf24
verified
sravanthib
commited on
Mar 23, 2025
Training in progress, step 14
cda0de4
verified
sravanthib
commited on
Mar 23, 2025
Training in progress, step 10
10b1269
verified
sravanthib
commited on
Mar 23, 2025
initial commit
28e70a5
verified
sravanthib
commited on
Mar 23, 2025