rhythm_env / server /rhythm_environment.py

Commit History

Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses
f0ca22d

InosLihka commited on

Acknowledge OpenEnv Rubric system conformance gap
dc5658d

InosLihka commited on

Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
ece0bbe

InosLihka commited on

iter4: fix the 'constant belief = free reward' bug + 6 other deep issues
bb2a9c7

InosLihka Claude Opus 4.7 (1M context) commited on

iter3: align reward with grader + belief-first format + exploration shaping
64d24b3

InosLihka Claude Opus 4.7 (1M context) commited on

iter2: fix mode collapse + 3 deeper bugs from code review
e21a960

InosLihka Claude Opus 4.7 (1M context) commited on

env: meta-RL refactor (continuous profiles, action+belief, adaptation grader)
ecbe0d8

InosLihka Claude Opus 4.7 (1M context) commited on

env: enrich observation with history, anomalies, and discovery bonus
0a15ab5

InosLihka Claude Sonnet 4.6 commited on

Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline
cc6473a

InosLihka Claude Sonnet 4.6 commited on

Fix bugs, add tests, and improve code quality
c07f15e

Akhil Soni commited on

Add custom task input support and update URLs to HF Space
e74ff96

Akhil Soni commited on

Initial commit: RhythmEnv daily planning RL environment
025774a

Akhil Soni commited on