subtext-arena / notebooks
aamrinder's picture
sync Colab notebook with current train_grpo.py
9bd1f77 verified