rhythm_env / docs

Commit History

fix: reduce kl_coef to prevent training instability
0bdfeaa

InosLihka Claude Sonnet 4.6 commited on

Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline
cc6473a

InosLihka Claude Sonnet 4.6 commited on

Reorganize docs: segregate Round 1 and Round 2
9bfe470

InosLihka Claude Sonnet 4.6 commited on

Add SPEC.md and hackathon reference docs
69310d6

InosLihka commited on