Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
InosLihka
/
rhythm_env
Sleeping

App Files Files Community
Fetching metadata from the HF Docker repository...
rhythm_env / plots
905 kB
Ctrl+K
Ctrl+K
  • 3 contributors
History: 4 commits
InosLihka's picture
InosLihka
Add SFT v3 + GRPO refine results to README + results.md
666b4ce 9 days ago
  • README.md
    1.21 kB
    Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves 12 days ago
  • grpo_iter2_baseline_vs_trained.png
    66.8 kB
    Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves 12 days ago
  • grpo_iter2_belief_accuracy.png
    189 kB
    xet
    Post-deadline: full eval results + bigger plots via Git LFS 11 days ago
  • grpo_iter2_reward_components.png
    263 kB
    xet
    Post-deadline: full eval results + bigger plots via Git LFS 11 days ago
  • grpo_iter2_reward_curve.png
    179 kB
    xet
    Post-deadline: full eval results + bigger plots via Git LFS 11 days ago
  • grpo_iter2_training_loss.png
    92.5 kB
    Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves 12 days ago
  • sft_grpo_comparison.png
    34.9 kB
    Add SFT v3 + GRPO refine results to README + results.md 9 days ago
  • sft_v3_baseline_vs_trained.png
    39.8 kB
    Post-deadline: full eval results + bigger plots via Git LFS 11 days ago
  • sft_v3_training_loss.png
    38 kB
    Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves 12 days ago