Link real GitHub training scripts gist
Browse files
CADFORGE_BLOG.md
CHANGED
|
@@ -7,7 +7,7 @@ Detailed judge-facing blog with screenshots, diagrams, references, and all run i
|
|
| 7 |
## Judge Reproducibility Links
|
| 8 |
|
| 9 |
- **GitHub repo:** [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon)
|
| 10 |
-
- **
|
| 11 |
- **Google Colab smoke notebook:** [cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb)
|
| 12 |
- **Raw training logs and evidence:** [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
|
| 13 |
|
|
|
|
| 7 |
## Judge Reproducibility Links
|
| 8 |
|
| 9 |
- **GitHub repo:** [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon)
|
| 10 |
+
- **GitHub Gist: training scripts:** [CADForge OpenEnv SFT/GRPO scripts](https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb)
|
| 11 |
- **Google Colab smoke notebook:** [cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb)
|
| 12 |
- **Raw training logs and evidence:** [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
|
| 13 |
|
README.md
CHANGED
|
@@ -22,7 +22,7 @@ The agent receives a design request, writes a complete CadQuery Python file, and
|
|
| 22 |
## Judge-Facing Links
|
| 23 |
|
| 24 |
- **GitHub repo:** [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon)
|
| 25 |
-
- **
|
| 26 |
- **Raw training logs and evidence:** [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
|
| 27 |
- Training notebook on this HF Space: [training/cadforge_openenv_training_colab.ipynb](training/cadforge_openenv_training_colab.ipynb)
|
| 28 |
- Open the same notebook in Google Colab: [Colab training notebook](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb)
|
|
|
|
| 22 |
## Judge-Facing Links
|
| 23 |
|
| 24 |
- **GitHub repo:** [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon)
|
| 25 |
+
- **GitHub Gist: training scripts:** [CADForge OpenEnv SFT/GRPO scripts](https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb)
|
| 26 |
- **Raw training logs and evidence:** [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
|
| 27 |
- Training notebook on this HF Space: [training/cadforge_openenv_training_colab.ipynb](training/cadforge_openenv_training_colab.ipynb)
|
| 28 |
- Open the same notebook in Google Colab: [Colab training notebook](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb)
|
docs/detailed-blog/cadforge-detailed-blog.md
CHANGED
|
@@ -11,7 +11,7 @@ CADForge is an OpenEnv reinforcement-learning environment for code-CAD. The agen
|
|
| 11 |
| GitHub repo | [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon) |
|
| 12 |
| Google Colab smoke notebook | [cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb) |
|
| 13 |
| HF Space | [sanjuhs/cadforge-cadquery-openenv](https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv) |
|
| 14 |
-
|
|
| 15 |
| Raw logs and evidence archive | [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence) |
|
| 16 |
|
| 17 |
The full SFT and GRPO runs were executed on a RunPod H200 through distinct production scripts. The Colab notebook is the public judge-runnable smoke path: it validates OpenEnv, loads the public dataset, runs the CadQuery reward backend, and launches tiny SFT/GRPO checks using the same source files.
|
|
|
|
| 11 |
| GitHub repo | [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon) |
|
| 12 |
| Google Colab smoke notebook | [cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb) |
|
| 13 |
| HF Space | [sanjuhs/cadforge-cadquery-openenv](https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv) |
|
| 14 |
+
| GitHub Gist: training scripts | [CADForge OpenEnv SFT/GRPO scripts](https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb) |
|
| 15 |
| Raw logs and evidence archive | [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence) |
|
| 16 |
|
| 17 |
The full SFT and GRPO runs were executed on a RunPod H200 through distinct production scripts. The Colab notebook is the public judge-runnable smoke path: it validates OpenEnv, loads the public dataset, runs the CadQuery reward backend, and launches tiny SFT/GRPO checks using the same source files.
|
server/app.py
CHANGED
|
@@ -353,7 +353,7 @@ SPACE_HTML = r'''
|
|
| 353 |
<a class="button" href="https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/CADFORGE_BLOG.md" target="_blank">Read mini-blog</a>
|
| 354 |
<a class="button" href="https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/docs/detailed-blog/cadforge-detailed-blog.md" target="_blank">Detailed blog</a>
|
| 355 |
<a class="button" href="https://github.com/sanjuhs/open-env-meta-final-hackathon" target="_blank">Training code</a>
|
| 356 |
-
<a class="button" href="https://github.com/sanjuhs/
|
| 357 |
<a class="button" href="https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence" target="_blank">Training logs</a>
|
| 358 |
</div>
|
| 359 |
</div>
|
|
@@ -396,7 +396,7 @@ SPACE_HTML = r'''
|
|
| 396 |
<div class="repro-links">
|
| 397 |
<a class="repro-link" href="https://github.com/sanjuhs/open-env-meta-final-hackathon" target="_blank">GitHub repo</a>
|
| 398 |
<a class="repro-link" href="https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb" target="_blank">Open Colab notebook</a>
|
| 399 |
-
<a class="repro-link" href="https://github.com/sanjuhs/
|
| 400 |
<a class="repro-link" href="https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence" target="_blank">HF logs + evidence</a>
|
| 401 |
</div>
|
| 402 |
</section>
|
|
|
|
| 353 |
<a class="button" href="https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/CADFORGE_BLOG.md" target="_blank">Read mini-blog</a>
|
| 354 |
<a class="button" href="https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/docs/detailed-blog/cadforge-detailed-blog.md" target="_blank">Detailed blog</a>
|
| 355 |
<a class="button" href="https://github.com/sanjuhs/open-env-meta-final-hackathon" target="_blank">Training code</a>
|
| 356 |
+
<a class="button" href="https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb" target="_blank">Training scripts Gist</a>
|
| 357 |
<a class="button" href="https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence" target="_blank">Training logs</a>
|
| 358 |
</div>
|
| 359 |
</div>
|
|
|
|
| 396 |
<div class="repro-links">
|
| 397 |
<a class="repro-link" href="https://github.com/sanjuhs/open-env-meta-final-hackathon" target="_blank">GitHub repo</a>
|
| 398 |
<a class="repro-link" href="https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb" target="_blank">Open Colab notebook</a>
|
| 399 |
+
<a class="repro-link" href="https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb" target="_blank">Training scripts Gist</a>
|
| 400 |
<a class="repro-link" href="https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence" target="_blank">HF logs + evidence</a>
|
| 401 |
</div>
|
| 402 |
</section>
|
training/GITHUB_GIST_TRAINING_SCRIPTS.md
CHANGED
|
@@ -1,6 +1,8 @@
|
|
| 1 |
# CADForge Training Scripts Bundle
|
| 2 |
|
| 3 |
-
This is the
|
|
|
|
|
|
|
| 4 |
|
| 5 |
**Main GitHub repo:** https://github.com/sanjuhs/open-env-meta-final-hackathon
|
| 6 |
**HF Space:** https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv
|
|
@@ -42,9 +44,9 @@ The production run used shell wrappers so each stage could run in the right Pyth
|
|
| 42 |
- Strict GRPO eval report: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/training/eval/qwen35-9b-cadforge-grpo-strict-build-20260426-strict-build/eval_report.md
|
| 43 |
- Detailed technical blog: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/docs/detailed-blog/cadforge-detailed-blog.md
|
| 44 |
|
| 45 |
-
##
|
| 46 |
|
| 47 |
-
After authenticating GitHub CLI with gist permission:
|
| 48 |
|
| 49 |
```bash
|
| 50 |
gh auth login --scopes gist,repo
|
|
|
|
| 1 |
# CADForge Training Scripts Bundle
|
| 2 |
|
| 3 |
+
This is the source bundle mirrored into the public CADForge OpenEnv training-script Gist:
|
| 4 |
+
|
| 5 |
+
https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb
|
| 6 |
|
| 7 |
**Main GitHub repo:** https://github.com/sanjuhs/open-env-meta-final-hackathon
|
| 8 |
**HF Space:** https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv
|
|
|
|
| 44 |
- Strict GRPO eval report: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/training/eval/qwen35-9b-cadforge-grpo-strict-build-20260426-strict-build/eval_report.md
|
| 45 |
- Detailed technical blog: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/docs/detailed-blog/cadforge-detailed-blog.md
|
| 46 |
|
| 47 |
+
## Recreate or Update the GitHub Gist
|
| 48 |
|
| 49 |
+
After authenticating GitHub CLI with gist permission, this is the command used to create the public Gist:
|
| 50 |
|
| 51 |
```bash
|
| 52 |
gh auth login --scopes gist,repo
|
training/cadforge_openenv_training_colab.ipynb
CHANGED
|
@@ -11,7 +11,7 @@
|
|
| 11 |
"**GitHub repo:** [https://github.com/sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon) \n",
|
| 12 |
"**Run this notebook in Google Colab:** [https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb) \n",
|
| 13 |
"**HF Space:** [https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv](https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv) \n",
|
| 14 |
-
"**
|
| 15 |
"**Raw training logs + evidence archive:** [https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)\n",
|
| 16 |
"\n",
|
| 17 |
"## What actually ran where\n",
|
|
@@ -59,7 +59,7 @@
|
|
| 59 |
"- Hugging Face Space repo: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv\n",
|
| 60 |
"\n",
|
| 61 |
"- GitHub repo, exact source: https://github.com/sanjuhs/open-env-meta-final-hackathon\n",
|
| 62 |
-
"-
|
| 63 |
"- Raw training logs and evidence bundle: https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence\n",
|
| 64 |
"- RunPod/H200 clarification: full 2B/9B SFT and GRPO runs were executed as distinct scripts on RunPod; this Colab is the public smoke rerun path.\n",
|
| 65 |
"- Training dataset: https://huggingface.co/datasets/sanjuhs/cadforge-cadquery-agentic-traces\n",
|
|
@@ -524,7 +524,7 @@
|
|
| 524 |
"\n",
|
| 525 |
"## RunPod H200 production scripts\n",
|
| 526 |
"\n",
|
| 527 |
-
"The real training was launched on RunPod H200 through shell wrappers and Python scripts, then backed up to Hugging Face. Judges can inspect the
|
| 528 |
"\n",
|
| 529 |
"Raw logs and charts are here: https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence\n"
|
| 530 |
]
|
|
|
|
| 11 |
"**GitHub repo:** [https://github.com/sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon) \n",
|
| 12 |
"**Run this notebook in Google Colab:** [https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb) \n",
|
| 13 |
"**HF Space:** [https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv](https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv) \n",
|
| 14 |
+
"**GitHub Gist: training scripts:** [https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb](https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb) \n",
|
| 15 |
"**Raw training logs + evidence archive:** [https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)\n",
|
| 16 |
"\n",
|
| 17 |
"## What actually ran where\n",
|
|
|
|
| 59 |
"- Hugging Face Space repo: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv\n",
|
| 60 |
"\n",
|
| 61 |
"- GitHub repo, exact source: https://github.com/sanjuhs/open-env-meta-final-hackathon\n",
|
| 62 |
+
"- GitHub Gist: training scripts: https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb\n",
|
| 63 |
"- Raw training logs and evidence bundle: https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence\n",
|
| 64 |
"- RunPod/H200 clarification: full 2B/9B SFT and GRPO runs were executed as distinct scripts on RunPod; this Colab is the public smoke rerun path.\n",
|
| 65 |
"- Training dataset: https://huggingface.co/datasets/sanjuhs/cadforge-cadquery-agentic-traces\n",
|
|
|
|
| 524 |
"\n",
|
| 525 |
"## RunPod H200 production scripts\n",
|
| 526 |
"\n",
|
| 527 |
+
"The real training was launched on RunPod H200 through shell wrappers and Python scripts, then backed up to Hugging Face. Judges can inspect the public training-script Gist here: https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb\n",
|
| 528 |
"\n",
|
| 529 |
"Raw logs and charts are here: https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence\n"
|
| 530 |
]
|