tucano2-commerce / notebooks /v4_1_instruct_grpo.ipynb

Commit History

notebooks: add V4.1 GRPO notebook (parser fix, 600 steps, LR 5e-6, constant_with_warmup)
d7a090d
verified

rtferraz commited on