Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

YoungMan233
/
mcqa-grpo350m-kl0.01-step400

Safetensors
lfm2
Model card Files Files and versions
xet
Community
mcqa-grpo350m-kl0.01-step400
1.69 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
YoungMan233's picture
YoungMan233
MCQA GRPO 350M KL=0.01 step 400
ccb235d verified 14 days ago
  • .gitattributes
    1.52 kB
    initial commit 14 days ago
  • chat_template.jinja
    2.55 kB
    MCQA GRPO 350M KL=0.01 step 400 14 days ago
  • config.json
    1.31 kB
    MCQA GRPO 350M KL=0.01 step 400 14 days ago
  • generation_config.json
    131 Bytes
    MCQA GRPO 350M KL=0.01 step 400 14 days ago
  • model.safetensors
    1.69 GB
    xet
    MCQA GRPO 350M KL=0.01 step 400 14 days ago
  • tokenizer.json
    4.73 MB
    MCQA GRPO 350M KL=0.01 step 400 14 days ago
  • tokenizer_config.json
    489 Bytes
    MCQA GRPO 350M KL=0.01 step 400 14 days ago