Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
sharon8811
/
model4
like
0
Safetensors
llama
unsloth
trl
grpo
License:
bsd
Model card
Files
Files and versions
xet
Community
Use this model
main
model4
/
README.md
sharon8811
Trained with Unsloth
143e1f1
verified
about 1 year ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
57 Bytes
metadata
license:
bsd
tags:
-
unsloth
-
trl
-
grpo