Spaces:
Sleeping
Sleeping
Commit History
fix: notebook plot cell syntax error (newline in string literal) 7340206
notebook: add belief-accuracy + reward-components plots b5ac530
merge hf/main: meta-RL refactor supersedes prior commits 786249b
env: meta-RL refactor (continuous profiles, action+belief, adaptation grader) ecbe0d8
env: enrich observation with history, anomalies, and discovery bonus 9ed122d
env: enrich observation with history, anomalies, and discovery bonus 0a15ab5
Add Run 3 training results: README update + training log (no plots) 52e33e8
Add Run 3 training results: plots, training log, README update c67f463
docs: fix README accuracy + add training results structure 92808b9
docs: add sim-to-real deployment architecture reference 24adee5
fix: correct GRPO training hyperparameters to prevent KL explosion fb112e4
restore: validate-submission.sh to scripts/ 8a56903
docs: reorganize — 25 files → 4 focused docs 1a25a1a
refactor: rewrite blog around product vision; fix UI for Gradio 6 5fbafee
fix: rename kl_coef to beta (correct param name in TRL GRPOConfig) 2c6ee11
docs: expand blog with purpose, sim-to-real framing, lightweight model goal 26b1e6a
fix: reduce kl_coef to prevent training instability 0bdfeaa
Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline cc6473a
Reorganize docs: segregate Round 1 and Round 2 9bfe470
Add SPEC.md and hackathon reference docs 69310d6
Fix bugs, add tests, and improve code quality c07f15e
Akhil Soni commited on
Rewrite README for hackathon human review f36d90a
Akhil Soni commited on
Add custom task input support and update URLs to HF Space e74ff96
Akhil Soni commited on
Initial commit: RhythmEnv daily planning RL environment 025774a
Akhil Soni commited on