drugenv-trainer / training

Commit History

add: pre/post eval + summarize + bumped GRPO config (training/summarize.py)
18adb35
verified

anugrahteesdollar commited on

add: pre/post eval + summarize + bumped GRPO config (training/evaluate.py)
95fd656
verified

anugrahteesdollar commited on

fix: multi-GPU SFT shape mismatch (training/sft_warmstart.py)
9867323
verified

anugrahteesdollar commited on

fix: refresh env._latent after step loop in oracle collector
bc59187
verified

anugrahteesdollar commited on

fix: accept --difficulty in sft_warmstart to match space/training/app.py
3c7d404
verified

anugrahteesdollar commited on

fix: include requirements-train.txt + tests (glob bug)
ad12dda
verified

anugrahteesdollar commited on