anuragredbus's picture
add train_grpo_smoke notebook; quote pip versions in train_grpo
b55c1ff