Spaces:
Sleeping
Sleeping
Commit History
iter3: align reward with grader + belief-first format + exploration shaping 64d24b3
env: meta-RL refactor (continuous profiles, action+belief, adaptation grader) ecbe0d8
Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline cc6473a
Fix bugs, add tests, and improve code quality c07f15e
Akhil Soni commited on
Add custom task input support and update URLs to HF Space e74ff96
Akhil Soni commited on
Initial commit: RhythmEnv daily planning RL environment 025774a
Akhil Soni commited on