DeepSeek-R1-Distill-Llama-8B-GRPO / trainer_state.json
cfei621's picture
Model save
835faae verified
raw
history
765 kB
File too large to display, you can check the raw version instead.