Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
InosLihka
/
rhythm_env
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
rhythm_env
/
training
Ctrl+K
Ctrl+K
3 contributors
History:
18 commits
InosLihka
Clarify documentation: anomaly signal explainer, GRPO scope notes
361aed7
6 days ago
RhythmEnv_GRPO_Training.ipynb
Safe
23.7 kB
Clarify documentation: anomaly signal explainer, GRPO scope notes
6 days ago
dataset.py
Safe
11.6 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago
inference_eval.py
Safe
11.5 kB
Fix prompt truncation in inference_eval.py: max_seq_length 768 -> 2048
12 days ago
reward_functions.py
Safe
14 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago
sft_prime.py
Safe
8.51 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago
train.py
Safe
9.03 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
12 days ago