Commit History

fix(notebook): pin typing_extensions>=4.13.0 to fix pydantic Sentinel ImportError
b1bd9cc

vaibhav12332112312 commited on

fix: restore parse_model_output exception parity with original bare except
aeedd8d

anuragredbus commited on

chore: align train_grpo.ipynb with smoke/syntax patterns for Colab
0587f05

anuragredbus commited on

add training/syntax_only.ipynb — kernel + Python syntax only (no project logic)
0e50d91

anuragredbus commited on

add train_grpo_smoke notebook; quote pip versions in train_grpo
b55c1ff

anuragredbus commited on

fix: notebook loads Qwen without bitsandbytes on Mac; optional training deps
eb1d764

anuragredbus commited on

fix: robust notebook setup (no magic shell) + local CWD auto-detect
8d09986

anuragredbus commited on

Merge branch 'hack1' of github.com:VaibhavKhandare/viral-posts-env into hack1
6c01076

vaibhav12332112312 commited on

fix: rewrite training notebook for real LoRA fine-tuning on Colab
4a29e22

anuragredbus commited on

reduced steps to fit out free tier
fcfbc38

anuragredbus commited on

reduced steps to fit out free tier
571f8a4

anuragredbus commited on

Viraltest OpenEnv: deploy to HF Space
28dd5a4

anuragredbus commited on