tucano2-commerce / notebooks /v4_1_instruct_grpo.ipynb
rtferraz's picture
notebooks: add V4.1 GRPO notebook (parser fix, 600 steps, LR 5e-6, constant_with_warmup)
d7a090d verified
Open in Colab
Rendering notebook...