Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
SantiagoC
/
palindrome-grpo-v2
like
0
ml-intern
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
palindrome-grpo-v2
96.3 kB
Ctrl+K
Ctrl+K
1 contributor
History:
20 commits
SantiagoC
Upload train_grpo_v6.py
329712a
verified
19 days ago
.gitattributes
Safe
1.52 kB
initial commit
19 days ago
README.md
Safe
740 Bytes
Update ML Intern artifact metadata
19 days ago
test_model.py
Safe
3.83 kB
Upload test_model.py
19 days ago
test_sft_model.py
Safe
2.15 kB
Add SFT model evaluation script
19 days ago
train_grpo_curriculum.py
Safe
14.8 kB
Upload train_grpo_curriculum.py
19 days ago
train_grpo_curriculum_v2.py
Safe
10.7 kB
Upload train_grpo_curriculum_v2.py
19 days ago
train_grpo_from_sft.py
Safe
9.71 kB
Upload train_grpo_from_sft.py
19 days ago
train_grpo_v2.py
Safe
10.2 kB
Upload train_grpo_v2.py
19 days ago
train_grpo_v3.py
Safe
11 kB
Upload train_grpo_v3.py
19 days ago
train_grpo_v4.py
Safe
8.24 kB
Upload train_grpo_v4.py
19 days ago
train_grpo_v5.py
Safe
7.11 kB
Upload train_grpo_v5.py
19 days ago
train_grpo_v6.py
11.4 kB
Upload train_grpo_v6.py
19 days ago
train_sft.py
Safe
3.5 kB
Fix OOM: reduce batch, disable packing, add flash-attn2 kernel
19 days ago
train_sft_qwen3.py
Safe
1.37 kB
Upload train_sft_qwen3.py
19 days ago