rhythm_env / plots

Commit History

Add SFT v3 + GRPO refine results to README + results.md
666b4ce

InosLihka commited on

Post-deadline: full eval results + bigger plots via Git LFS
d64efa6

InosLihka commited on

README: embed reward curve and belief-accuracy curve plots
4dd50e0

InosLihka commited on

Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
f2401bf

InosLihka commited on