Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
YoungMan233
/
mcqa-grpo350m-kl0.01-step400
like
0
Safetensors
lfm2
Model card
Files
Files and versions
xet
Community
main
mcqa-grpo350m-kl0.01-step400
1.69 GB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
YoungMan233
MCQA GRPO 350M KL=0.01 step 400
ccb235d
verified
14 days ago
.gitattributes
Safe
1.52 kB
initial commit
14 days ago
chat_template.jinja
Safe
2.55 kB
MCQA GRPO 350M KL=0.01 step 400
14 days ago
config.json
1.31 kB
MCQA GRPO 350M KL=0.01 step 400
14 days ago
generation_config.json
Safe
131 Bytes
MCQA GRPO 350M KL=0.01 step 400
14 days ago
model.safetensors
1.69 GB
xet
MCQA GRPO 350M KL=0.01 step 400
14 days ago
tokenizer.json
Safe
4.73 MB
MCQA GRPO 350M KL=0.01 step 400
14 days ago
tokenizer_config.json
Safe
489 Bytes
MCQA GRPO 350M KL=0.01 step 400
14 days ago