Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
InosLihka
/
rhythm_env
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
rhythm_env
/
plots
905 kB
Ctrl+K
Ctrl+K
3 contributors
History:
4 commits
InosLihka
Add SFT v3 + GRPO refine results to README + results.md
666b4ce
9 days ago
README.md
Safe
1.21 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
12 days ago
grpo_iter2_baseline_vs_trained.png
Safe
66.8 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
12 days ago
grpo_iter2_belief_accuracy.png
Safe
189 kB
xet
Post-deadline: full eval results + bigger plots via Git LFS
11 days ago
grpo_iter2_reward_components.png
Safe
263 kB
xet
Post-deadline: full eval results + bigger plots via Git LFS
11 days ago
grpo_iter2_reward_curve.png
Safe
179 kB
xet
Post-deadline: full eval results + bigger plots via Git LFS
11 days ago
grpo_iter2_training_loss.png
Safe
92.5 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
12 days ago
sft_grpo_comparison.png
Safe
34.9 kB
Add SFT v3 + GRPO refine results to README + results.md
9 days ago
sft_v3_baseline_vs_trained.png
Safe
39.8 kB
Post-deadline: full eval results + bigger plots via Git LFS
11 days ago
sft_v3_training_loss.png
Safe
38 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
12 days ago