Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
InosLihka
/
rhythm_env
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
rhythm_env
1.88 MB
Ctrl+K
Ctrl+K
3 contributors
History:
57 commits
InosLihka
Clarify documentation: anomaly signal explainer, GRPO scope notes
361aed7
6 days ago
docs
Clarify documentation: anomaly signal explainer, GRPO scope notes
6 days ago
plots
Add SFT v3 + GRPO refine results to README + results.md
9 days ago
scripts
Add SFT v3 + GRPO refine results to README + results.md
9 days ago
server
Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses
11 days ago
tests
Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses
11 days ago
training
Clarify documentation: anomaly signal explainer, GRPO scope notes
6 days ago
ui
refactor: rewrite blog around product vision; fix UI for Gradio 6
13 days ago
.dockerignore
Safe
92 Bytes
Initial commit: RhythmEnv daily planning RL environment
29 days ago
.env.example
Safe
441 Bytes
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago
.gitattributes
Safe
218 Bytes
Post-deadline: full eval results + bigger plots via Git LFS
11 days ago
.gitignore
Safe
211 Bytes
Clarify documentation: anomaly signal explainer, GRPO scope notes
6 days ago
BLOG.md
Safe
9.48 kB
Move blog to root as BLOG.md (per Meta mentor guidance)
11 days ago
Dockerfile
Safe
1.49 kB
Fix HF Space README rendering + Dockerfile encoding
6 days ago
README.md
Safe
23.2 kB
Clarify documentation: anomaly signal explainer, GRPO scope notes
6 days ago
__init__.py
Safe
724 Bytes
env: enrich observation with history, anomalies, and discovery bonus
13 days ago
client.py
Safe
5.04 kB
client: surface ALL observation fields (was dropping deltas, anomalies, last_action, step_history)
12 days ago
eval_baselines_v2.json
Safe
284 Bytes
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago
inference.py
Safe
13.4 kB
iter4: fix the 'constant belief = free reward' bug + 6 other deep issues
12 days ago
models.py
Safe
4.17 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago
openenv.yaml
Safe
93 Bytes
Initial commit: RhythmEnv daily planning RL environment
29 days ago
pyproject.toml
Safe
909 Bytes
Rebuild as Life Simulator: 5 meters, 3 hidden profiles, GRPO training pipeline
13 days ago
uv.lock
Safe
576 kB
Initial commit: RhythmEnv daily planning RL environment
29 days ago