model4 / README.md
sharon8811's picture
Trained with Unsloth
143e1f1 verified
metadata
license: bsd
tags:
  - unsloth
  - trl
  - grpo