rhythm_env / inference.py

Commit History

iter4: fix the 'constant belief = free reward' bug + 6 other deep issues
bb2a9c7

InosLihka Claude Opus 4.7 (1M context) commited on

iter3: align reward with grader + belief-first format + exploration shaping
64d24b3

InosLihka Claude Opus 4.7 (1M context) commited on

env: meta-RL refactor (continuous profiles, action+belief, adaptation grader)
ecbe0d8

InosLihka Claude Opus 4.7 (1M context) commited on

Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline
cc6473a

InosLihka Claude Sonnet 4.6 commited on

Fix bugs, add tests, and improve code quality
c07f15e

Akhil Soni commited on

Add custom task input support and update URLs to HF Space
e74ff96

Akhil Soni commited on

Initial commit: RhythmEnv daily planning RL environment
025774a

Akhil Soni commited on