Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
InosLihka
/
rhythm_env
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
rhythm_env
/
docs
Ctrl+K
Ctrl+K
3 contributors
History:
27 commits
InosLihka
Clarify documentation: anomaly signal explainer, GRPO scope notes
361aed7
6 days ago
references
Tighten README: resolve GRPO contradiction, drop duplicate baseline table, remove internal mentor docs
8 days ago
architecture.md
Safe
40.4 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago
entity_definitions.md
Safe
9.46 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago
environment_design.md
Safe
6 kB
docs: reorganize β 25 files β 4 focused docs
13 days ago
iterations.md
Safe
20 kB
Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses
11 days ago
results.md
Safe
8.81 kB
Add SFT v3 + GRPO refine results to README + results.md
9 days ago
training.md
Safe
5.96 kB
Clarify documentation: anomaly signal explainer, GRPO scope notes
6 days ago