Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

SantiagoC
/
palindrome-grpo-v2

ml-intern
Model card Files Files and versions
xet
Community
palindrome-grpo-v2
96.3 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 20 commits
SantiagoC's picture
SantiagoC
Upload train_grpo_v6.py
329712a verified 19 days ago
  • .gitattributes
    1.52 kB
    initial commit 19 days ago
  • README.md
    740 Bytes
    Update ML Intern artifact metadata 19 days ago
  • test_model.py
    3.83 kB
    Upload test_model.py 19 days ago
  • test_sft_model.py
    2.15 kB
    Add SFT model evaluation script 19 days ago
  • train_grpo_curriculum.py
    14.8 kB
    Upload train_grpo_curriculum.py 19 days ago
  • train_grpo_curriculum_v2.py
    10.7 kB
    Upload train_grpo_curriculum_v2.py 19 days ago
  • train_grpo_from_sft.py
    9.71 kB
    Upload train_grpo_from_sft.py 19 days ago
  • train_grpo_v2.py
    10.2 kB
    Upload train_grpo_v2.py 19 days ago
  • train_grpo_v3.py
    11 kB
    Upload train_grpo_v3.py 19 days ago
  • train_grpo_v4.py
    8.24 kB
    Upload train_grpo_v4.py 19 days ago
  • train_grpo_v5.py
    7.11 kB
    Upload train_grpo_v5.py 19 days ago
  • train_grpo_v6.py
    11.4 kB
    Upload train_grpo_v6.py 19 days ago
  • train_sft.py
    3.5 kB
    Fix OOM: reduce batch, disable packing, add flash-attn2 kernel 19 days ago
  • train_sft_qwen3.py
    1.37 kB
    Upload train_sft_qwen3.py 19 days ago