DeepSeek-R1-Distill-Llama-8B-GRPO / trainer_state.json

Commit History

Model save
835faae
verified

cfei621 commited on

Model save
2a95588
verified

cfei621 commited on

Model save
fc231c1
verified

cfei621 commited on

Model save
2455459
verified

cfei621 commited on

Model save
ce0e22c
verified

cfei621 commited on

Model save
66f6bf5
verified

cfei621 commited on

Model save
3b3391d
verified

cfei621 commited on

Model save
57e084e
verified

cfei621 commited on