fix(notebook): py3.11 f-string backslash error in format_obs 56f70b1 vaibhav12332112312 commited on 13 days ago
Merge branch 'main' of https://huggingface.co/spaces/vaibhavkhandare/train-bhai-train 383294c vaibhav12332112312 commited on 13 days ago
fix(notebook): pin typing_extensions>=4.13.0 to fix pydantic Sentinel ImportError b1bd9cc vaibhav12332112312 commited on 13 days ago
fix: restore parse_model_output exception parity with original bare except aeedd8d anuragredbus commited on 13 days ago
chore: align train_grpo.ipynb with smoke/syntax patterns for Colab 0587f05 anuragredbus commited on 13 days ago
add training/syntax_only.ipynb — kernel + Python syntax only (no project logic) 0e50d91 anuragredbus commited on 13 days ago
add train_grpo_smoke notebook; quote pip versions in train_grpo b55c1ff anuragredbus commited on 13 days ago
fix: notebook loads Qwen without bitsandbytes on Mac; optional training deps eb1d764 anuragredbus commited on 13 days ago
fix: robust notebook setup (no magic shell) + local CWD auto-detect 8d09986 anuragredbus commited on 13 days ago
Merge branch 'hack1' of github.com:VaibhavKhandare/viral-posts-env into hack1 6c01076 vaibhav12332112312 commited on 13 days ago
fix: rewrite training notebook for real LoRA fine-tuning on Colab 4a29e22 anuragredbus commited on 13 days ago