Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

aakritil
/
content

Transformers
Safetensors
Generated from Trainer
trl
dpo
Model card Files Files and versions
xet
Community
content / dpo_train_dataset
1.22 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
aakritil's picture
aakritil
aakritil/llama2b_reg_dpo_trainer
2bb6f58 verified over 1 year ago
  • data-00000-of-00001.arrow
    1.22 MB
    xet
    aakritil/llama2b_reg_dpo_trainer over 1 year ago
  • dataset_info.json
    311 Bytes
    aakritil/llama2b_reg_dpo_trainer over 1 year ago
  • state.json
    247 Bytes
    aakritil/llama2b_reg_dpo_trainer over 1 year ago