Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
InosLihka
/
rhythm_env
Sleeping

App Files Files Community
Fetching metadata from the HF Docker repository...
rhythm_env / training
Ctrl+K
Ctrl+K
  • 3 contributors
History: 18 commits
InosLihka's picture
InosLihka
Clarify documentation: anomaly signal explainer, GRPO scope notes
361aed7 6 days ago
  • RhythmEnv_GRPO_Training.ipynb
    23.7 kB
    Clarify documentation: anomaly signal explainer, GRPO scope notes 6 days ago
  • dataset.py
    11.6 kB
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 12 days ago
  • inference_eval.py
    11.5 kB
    Fix prompt truncation in inference_eval.py: max_seq_length 768 -> 2048 12 days ago
  • reward_functions.py
    14 kB
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 12 days ago
  • sft_prime.py
    8.51 kB
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 12 days ago
  • train.py
    9.03 kB
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 12 days ago