Spaces:
Sleeping
Sleeping
Upload BLOG.md with huggingface_hub
Browse files
BLOG.md
CHANGED
|
@@ -17,7 +17,7 @@ We trained Qwen2.5-3B-Instruct with SFT followed by GRPO. Inference happens behi
|
|
| 17 |
|
| 18 |
- π§ͺ **Live environment**: <https://huggingface.co/spaces/ronitraj/QuantumScribe>
|
| 19 |
- ποΈ **Trained adapter**: <https://huggingface.co/ronitraj/quantumscribe>
|
| 20 |
-
- π **Colab notebook**: [`notebooks/
|
| 21 |
- π **W&B project**: <https://wandb.ai/ronitraj/QuantumScribe-GRPO>
|
| 22 |
|
| 23 |
---
|
|
@@ -239,7 +239,7 @@ python -m scripts.eval --adapter ronitraj/quantumscribe --level L2_target --epis
|
|
| 239 |
```
|
| 240 |
|
| 241 |
To re-run training (T4 colab):
|
| 242 |
-
- Open `notebooks/
|
| 243 |
- Runtime β GPU β T4
|
| 244 |
- Run all cells
|
| 245 |
|
|
@@ -251,7 +251,7 @@ To re-run training (T4 colab):
|
|
| 251 |
|---|---|
|
| 252 |
| π§ͺ Live HF Space | <https://huggingface.co/spaces/ronitraj/QuantumScribe> |
|
| 253 |
| ποΈ Trained LoRA adapter | <https://huggingface.co/ronitraj/quantumscribe> |
|
| 254 |
-
| π Colab training notebook | [`notebooks/
|
| 255 |
| π W&B project | <https://wandb.ai/ronitraj/QuantumScribe-GRPO> |
|
| 256 |
| π OpenEnv manifest | [`openenv.yaml`](openenv.yaml) |
|
| 257 |
| π Architecture deep-dive | [`docs/architecture.md`](docs/architecture.md) |
|
|
|
|
| 17 |
|
| 18 |
- π§ͺ **Live environment**: <https://huggingface.co/spaces/ronitraj/QuantumScribe>
|
| 19 |
- ποΈ **Trained adapter**: <https://huggingface.co/ronitraj/quantumscribe>
|
| 20 |
+
- π **Colab notebook (actual training run)**: [`notebooks/meta_final.ipynb`](notebooks/meta_final.ipynb)
|
| 21 |
- π **W&B project**: <https://wandb.ai/ronitraj/QuantumScribe-GRPO>
|
| 22 |
|
| 23 |
---
|
|
|
|
| 239 |
```
|
| 240 |
|
| 241 |
To re-run training (T4 colab):
|
| 242 |
+
- Open `notebooks/meta_final.ipynb`
|
| 243 |
- Runtime β GPU β T4
|
| 244 |
- Run all cells
|
| 245 |
|
|
|
|
| 251 |
|---|---|
|
| 252 |
| π§ͺ Live HF Space | <https://huggingface.co/spaces/ronitraj/QuantumScribe> |
|
| 253 |
| ποΈ Trained LoRA adapter | <https://huggingface.co/ronitraj/quantumscribe> |
|
| 254 |
+
| π Colab training notebook (actual run) | [`notebooks/meta_final.ipynb`](notebooks/meta_final.ipynb) |
|
| 255 |
| π W&B project | <https://wandb.ai/ronitraj/QuantumScribe-GRPO> |
|
| 256 |
| π OpenEnv manifest | [`openenv.yaml`](openenv.yaml) |
|
| 257 |
| π Architecture deep-dive | [`docs/architecture.md`](docs/architecture.md) |
|