Commit History

fix: smaller per-device batch + grad accum + bigger completion budget for L40S
649912e
verified

anugrahteesdollar commited on

fix: pass --per-device-train-batch-size 4 to GRPO so effective batch divides num_generations
45efa2c
verified

anugrahteesdollar commited on

add: pre/post eval + summarize + bumped GRPO config (space/training/app.py)
1a90e9c
verified

anugrahteesdollar commited on

fix: multi-GPU SFT shape mismatch (space/training/app.py)
8f997ce
verified

anugrahteesdollar commited on

initial: drugenv trainer control panel
e681925
verified

anugrahteesdollar commited on