Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

DatPySci
/
RLVR-SGDM-Gap

Safetensors
Model card Files Files and versions
xet
Community
RLVR-SGDM-Gap / Llama-3.2-3B-Instruct-polaris-GRPO--bsz512
Ctrl+K
Ctrl+K
  • 1 contributor
History: 7 commits
DatPySci's picture
DatPySci
Add files using upload-large-folder tool
d29b9f4 verified 2 months ago
  • global_step_128
    Add files using upload-large-folder tool 2 months ago
  • global_step_192
    Add files using upload-large-folder tool 2 months ago
  • global_step_256
    Add files using upload-large-folder tool 2 months ago
  • global_step_320
    Add files using upload-large-folder tool 2 months ago
  • global_step_384
    Add files using upload-large-folder tool 2 months ago
  • global_step_448
    Add files using upload-large-folder tool 2 months ago
  • global_step_512
    Add files using upload-large-folder tool 2 months ago
  • global_step_64
    Add files using upload-large-folder tool 2 months ago
  • latest_checkpointed_iteration.txt
    3 Bytes
    Add files using upload-large-folder tool 2 months ago