Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
InosLihka
/
rhythm_env
Sleeping

App Files Files Community
Fetching metadata from the HF Docker repository...
rhythm_env / docs
Ctrl+K
Ctrl+K
  • 3 contributors
History: 27 commits
InosLihka's picture
InosLihka
Clarify documentation: anomaly signal explainer, GRPO scope notes
361aed7 6 days ago
  • references
    Tighten README: resolve GRPO contradiction, drop duplicate baseline table, remove internal mentor docs 8 days ago
  • architecture.md
    40.4 kB
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 12 days ago
  • entity_definitions.md
    9.46 kB
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 12 days ago
  • environment_design.md
    6 kB
    docs: reorganize β€” 25 files β†’ 4 focused docs 13 days ago
  • iterations.md
    20 kB
    Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses 11 days ago
  • results.md
    8.81 kB
    Add SFT v3 + GRPO refine results to README + results.md 9 days ago
  • training.md
    5.96 kB
    Clarify documentation: anomaly signal explainer, GRPO scope notes 6 days ago