Commit History

Optimize for Kaggle P100: float16, batch=1, grad_accum=8, num_gen=4, max_completion=256, lora_r=8
73f957d

shank commited on

Fix GRPOConfig: rename max_new_tokens to max_completion_length for trl==0.14.0
8b16369

shank commited on

Align gradio version with Hugging Face Space builder2
633a3b7

shank commited on

Stabilize Space runtime: pin ML deps and disable runtime package drift
663b8db

shank commited on

Pin torch to cu121 build + use model.device instead of hardcoded cuda string
8f291e0

shank commited on

Replace unsloth with bitsandbytes+peft: fixes CUDA driver incompatibility on HF A100
c325ad7

shank commited on

Reduce training to 500 steps with tightened curriculum for A10G budget
ba8df98

shank commited on

Fix eval device selection with CUDA-safe fallback
dc8001b

shank commited on

Optimize for A100 80GB: 8 generations, batch 4, lr 2e-5, dense logging
2b1fbf3

shank commited on

Restore full 1000-step training with original curriculum
1128de1

shank commited on

Reduce training to 500 steps with tightened curriculum for A10G budget
3152fa9

shank commited on

Add Gradio training monitor and fix subprocess python path
b92ad01

shank commited on

Update: Started making changes for the hackathon
a55c81d

shank commited on