final-iteration / training /train_grpo.ipynb
anuragredbus's picture
train_grpo: rename monthly_* tasks to weekly_* (with env alias)
d8bb03f
Open in Colab