Commit History

feat: HF Jobs training script + plot generator
73c7ea0

InosLihka commited on

fix: notebook plot cell syntax error (newline in string literal)
7340206

InosLihka commited on

notebook: add belief-accuracy + reward-components plots
b5ac530

InosLihka Claude Opus 4.7 (1M context) commited on

merge hf/main: meta-RL refactor supersedes prior commits
786249b

InosLihka commited on

env: meta-RL refactor (continuous profiles, action+belief, adaptation grader)
ecbe0d8

InosLihka Claude Opus 4.7 (1M context) commited on

env: enrich observation with history, anomalies, and discovery bonus
9ed122d

InosLihka Claude Sonnet 4.6 commited on

env: enrich observation with history, anomalies, and discovery bonus
0a15ab5

InosLihka Claude Sonnet 4.6 commited on

Add Run 3 training results: README update + training log (no plots)
52e33e8

InosLihka Claude Sonnet 4.6 commited on

Add Run 3 training results: plots, training log, README update
c67f463

InosLihka Claude Sonnet 4.6 commited on

docs: fix README accuracy + add training results structure
92808b9

InosLihka Claude Sonnet 4.6 commited on

docs: add sim-to-real deployment architecture reference
24adee5

InosLihka Claude Sonnet 4.6 commited on

fix: correct GRPO training hyperparameters to prevent KL explosion
fb112e4

InosLihka Claude Sonnet 4.6 commited on

restore: validate-submission.sh to scripts/
8a56903

InosLihka Claude Sonnet 4.6 commited on

docs: reorganize — 25 files → 4 focused docs
1a25a1a

InosLihka Claude Sonnet 4.6 commited on

refactor: rewrite blog around product vision; fix UI for Gradio 6
5fbafee

InosLihka Claude Sonnet 4.6 commited on

fix: rename kl_coef to beta (correct param name in TRL GRPOConfig)
2c6ee11

InosLihka Claude Sonnet 4.6 commited on

docs: expand blog with purpose, sim-to-real framing, lightweight model goal
26b1e6a

InosLihka Claude Sonnet 4.6 commited on

fix: reduce kl_coef to prevent training instability
0bdfeaa

InosLihka Claude Sonnet 4.6 commited on

Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline
cc6473a

InosLihka Claude Sonnet 4.6 commited on

Reorganize docs: segregate Round 1 and Round 2
9bfe470

InosLihka Claude Sonnet 4.6 commited on

Add SPEC.md and hackathon reference docs
69310d6

InosLihka commited on

Fix bugs, add tests, and improve code quality
c07f15e

Akhil Soni commited on

Rewrite README for hackathon human review
f36d90a

Akhil Soni commited on

Add custom task input support and update URLs to HF Space
e74ff96

Akhil Soni commited on

Initial commit: RhythmEnv daily planning RL environment
025774a

Akhil Soni commited on