fix: restore parse_model_output exception parity with original bare except aeedd8d anuragredbus commited on Apr 25
chore: align train_grpo.ipynb with smoke/syntax patterns for Colab 0587f05 anuragredbus commited on Apr 25
add train_grpo_smoke notebook; quote pip versions in train_grpo b55c1ff anuragredbus commited on Apr 25
fix: notebook loads Qwen without bitsandbytes on Mac; optional training deps eb1d764 anuragredbus commited on Apr 25
fix: robust notebook setup (no magic shell) + local CWD auto-detect 8d09986 anuragredbus commited on Apr 25
fix: rewrite training notebook for real LoRA fine-tuning on Colab 4a29e22 anuragredbus commited on Apr 25