ronitraj commited on
Commit
b11c099
Β·
verified Β·
1 Parent(s): 68d2b8a

Upload BLOG.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. BLOG.md +3 -3
BLOG.md CHANGED
@@ -17,7 +17,7 @@ We trained Qwen2.5-3B-Instruct with SFT followed by GRPO. Inference happens behi
17
 
18
  - πŸ§ͺ **Live environment**: <https://huggingface.co/spaces/ronitraj/QuantumScribe>
19
  - πŸ‹οΈ **Trained adapter**: <https://huggingface.co/ronitraj/quantumscribe>
20
- - πŸ“’ **Colab notebook**: [`notebooks/colab_train.ipynb`](notebooks/colab_train.ipynb)
21
  - πŸ“ˆ **W&B project**: <https://wandb.ai/ronitraj/QuantumScribe-GRPO>
22
 
23
  ---
@@ -239,7 +239,7 @@ python -m scripts.eval --adapter ronitraj/quantumscribe --level L2_target --epis
239
  ```
240
 
241
  To re-run training (T4 colab):
242
- - Open `notebooks/colab_train.ipynb`
243
  - Runtime β†’ GPU β†’ T4
244
  - Run all cells
245
 
@@ -251,7 +251,7 @@ To re-run training (T4 colab):
251
  |---|---|
252
  | πŸ§ͺ Live HF Space | <https://huggingface.co/spaces/ronitraj/QuantumScribe> |
253
  | πŸ‹οΈ Trained LoRA adapter | <https://huggingface.co/ronitraj/quantumscribe> |
254
- | πŸ“’ Colab training notebook | [`notebooks/colab_train.ipynb`](notebooks/colab_train.ipynb) |
255
  | πŸ“ˆ W&B project | <https://wandb.ai/ronitraj/QuantumScribe-GRPO> |
256
  | πŸ›  OpenEnv manifest | [`openenv.yaml`](openenv.yaml) |
257
  | πŸ“ Architecture deep-dive | [`docs/architecture.md`](docs/architecture.md) |
 
17
 
18
  - πŸ§ͺ **Live environment**: <https://huggingface.co/spaces/ronitraj/QuantumScribe>
19
  - πŸ‹οΈ **Trained adapter**: <https://huggingface.co/ronitraj/quantumscribe>
20
+ - πŸ“’ **Colab notebook (actual training run)**: [`notebooks/meta_final.ipynb`](notebooks/meta_final.ipynb)
21
  - πŸ“ˆ **W&B project**: <https://wandb.ai/ronitraj/QuantumScribe-GRPO>
22
 
23
  ---
 
239
  ```
240
 
241
  To re-run training (T4 colab):
242
+ - Open `notebooks/meta_final.ipynb`
243
  - Runtime β†’ GPU β†’ T4
244
  - Run all cells
245
 
 
251
  |---|---|
252
  | πŸ§ͺ Live HF Space | <https://huggingface.co/spaces/ronitraj/QuantumScribe> |
253
  | πŸ‹οΈ Trained LoRA adapter | <https://huggingface.co/ronitraj/quantumscribe> |
254
+ | πŸ“’ Colab training notebook (actual run) | [`notebooks/meta_final.ipynb`](notebooks/meta_final.ipynb) |
255
  | πŸ“ˆ W&B project | <https://wandb.ai/ronitraj/QuantumScribe-GRPO> |
256
  | πŸ›  OpenEnv manifest | [`openenv.yaml`](openenv.yaml) |
257
  | πŸ“ Architecture deep-dive | [`docs/architecture.md`](docs/architecture.md) |