sanjuhs commited on
Commit
1551e7d
·
verified ·
1 Parent(s): a22ffaf

Link real GitHub training scripts gist

Browse files
CADFORGE_BLOG.md CHANGED
@@ -7,7 +7,7 @@ Detailed judge-facing blog with screenshots, diagrams, references, and all run i
7
  ## Judge Reproducibility Links
8
 
9
  - **GitHub repo:** [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon)
10
- - **Training script bundle / gist-ready source:** [training/GITHUB_GIST_TRAINING_SCRIPTS.md](https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md)
11
  - **Google Colab smoke notebook:** [cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb)
12
  - **Raw training logs and evidence:** [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
13
 
 
7
  ## Judge Reproducibility Links
8
 
9
  - **GitHub repo:** [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon)
10
+ - **GitHub Gist: training scripts:** [CADForge OpenEnv SFT/GRPO scripts](https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb)
11
  - **Google Colab smoke notebook:** [cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb)
12
  - **Raw training logs and evidence:** [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
13
 
README.md CHANGED
@@ -22,7 +22,7 @@ The agent receives a design request, writes a complete CadQuery Python file, and
22
  ## Judge-Facing Links
23
 
24
  - **GitHub repo:** [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon)
25
- - **Training script bundle / gist-ready source:** [training/GITHUB_GIST_TRAINING_SCRIPTS.md](https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md)
26
  - **Raw training logs and evidence:** [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
27
  - Training notebook on this HF Space: [training/cadforge_openenv_training_colab.ipynb](training/cadforge_openenv_training_colab.ipynb)
28
  - Open the same notebook in Google Colab: [Colab training notebook](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb)
 
22
  ## Judge-Facing Links
23
 
24
  - **GitHub repo:** [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon)
25
+ - **GitHub Gist: training scripts:** [CADForge OpenEnv SFT/GRPO scripts](https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb)
26
  - **Raw training logs and evidence:** [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
27
  - Training notebook on this HF Space: [training/cadforge_openenv_training_colab.ipynb](training/cadforge_openenv_training_colab.ipynb)
28
  - Open the same notebook in Google Colab: [Colab training notebook](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb)
docs/detailed-blog/cadforge-detailed-blog.md CHANGED
@@ -11,7 +11,7 @@ CADForge is an OpenEnv reinforcement-learning environment for code-CAD. The agen
11
  | GitHub repo | [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon) |
12
  | Google Colab smoke notebook | [cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb) |
13
  | HF Space | [sanjuhs/cadforge-cadquery-openenv](https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv) |
14
- | Training script bundle / gist-ready source | [training/GITHUB_GIST_TRAINING_SCRIPTS.md](https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md) |
15
  | Raw logs and evidence archive | [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence) |
16
 
17
  The full SFT and GRPO runs were executed on a RunPod H200 through distinct production scripts. The Colab notebook is the public judge-runnable smoke path: it validates OpenEnv, loads the public dataset, runs the CadQuery reward backend, and launches tiny SFT/GRPO checks using the same source files.
 
11
  | GitHub repo | [sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon) |
12
  | Google Colab smoke notebook | [cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb) |
13
  | HF Space | [sanjuhs/cadforge-cadquery-openenv](https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv) |
14
+ | GitHub Gist: training scripts | [CADForge OpenEnv SFT/GRPO scripts](https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb) |
15
  | Raw logs and evidence archive | [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence) |
16
 
17
  The full SFT and GRPO runs were executed on a RunPod H200 through distinct production scripts. The Colab notebook is the public judge-runnable smoke path: it validates OpenEnv, loads the public dataset, runs the CadQuery reward backend, and launches tiny SFT/GRPO checks using the same source files.
server/app.py CHANGED
@@ -353,7 +353,7 @@ SPACE_HTML = r'''
353
  <a class="button" href="https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/CADFORGE_BLOG.md" target="_blank">Read mini-blog</a>
354
  <a class="button" href="https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/docs/detailed-blog/cadforge-detailed-blog.md" target="_blank">Detailed blog</a>
355
  <a class="button" href="https://github.com/sanjuhs/open-env-meta-final-hackathon" target="_blank">Training code</a>
356
- <a class="button" href="https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md" target="_blank">Script gist bundle</a>
357
  <a class="button" href="https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence" target="_blank">Training logs</a>
358
  </div>
359
  </div>
@@ -396,7 +396,7 @@ SPACE_HTML = r'''
396
  <div class="repro-links">
397
  <a class="repro-link" href="https://github.com/sanjuhs/open-env-meta-final-hackathon" target="_blank">GitHub repo</a>
398
  <a class="repro-link" href="https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb" target="_blank">Open Colab notebook</a>
399
- <a class="repro-link" href="https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md" target="_blank">Training script bundle</a>
400
  <a class="repro-link" href="https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence" target="_blank">HF logs + evidence</a>
401
  </div>
402
  </section>
 
353
  <a class="button" href="https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/CADFORGE_BLOG.md" target="_blank">Read mini-blog</a>
354
  <a class="button" href="https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/docs/detailed-blog/cadforge-detailed-blog.md" target="_blank">Detailed blog</a>
355
  <a class="button" href="https://github.com/sanjuhs/open-env-meta-final-hackathon" target="_blank">Training code</a>
356
+ <a class="button" href="https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb" target="_blank">Training scripts Gist</a>
357
  <a class="button" href="https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence" target="_blank">Training logs</a>
358
  </div>
359
  </div>
 
396
  <div class="repro-links">
397
  <a class="repro-link" href="https://github.com/sanjuhs/open-env-meta-final-hackathon" target="_blank">GitHub repo</a>
398
  <a class="repro-link" href="https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb" target="_blank">Open Colab notebook</a>
399
+ <a class="repro-link" href="https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb" target="_blank">Training scripts Gist</a>
400
  <a class="repro-link" href="https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence" target="_blank">HF logs + evidence</a>
401
  </div>
402
  </section>
training/GITHUB_GIST_TRAINING_SCRIPTS.md CHANGED
@@ -1,6 +1,8 @@
1
  # CADForge Training Scripts Bundle
2
 
3
- This is the gist-ready source bundle for the CADForge OpenEnv submission.
 
 
4
 
5
  **Main GitHub repo:** https://github.com/sanjuhs/open-env-meta-final-hackathon
6
  **HF Space:** https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv
@@ -42,9 +44,9 @@ The production run used shell wrappers so each stage could run in the right Pyth
42
  - Strict GRPO eval report: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/training/eval/qwen35-9b-cadforge-grpo-strict-build-20260426-strict-build/eval_report.md
43
  - Detailed technical blog: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/docs/detailed-blog/cadforge-detailed-blog.md
44
 
45
- ## Optional: Create the GitHub Gist
46
 
47
- After authenticating GitHub CLI with gist permission:
48
 
49
  ```bash
50
  gh auth login --scopes gist,repo
 
1
  # CADForge Training Scripts Bundle
2
 
3
+ This is the source bundle mirrored into the public CADForge OpenEnv training-script Gist:
4
+
5
+ https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb
6
 
7
  **Main GitHub repo:** https://github.com/sanjuhs/open-env-meta-final-hackathon
8
  **HF Space:** https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv
 
44
  - Strict GRPO eval report: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/training/eval/qwen35-9b-cadforge-grpo-strict-build-20260426-strict-build/eval_report.md
45
  - Detailed technical blog: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv/blob/main/docs/detailed-blog/cadforge-detailed-blog.md
46
 
47
+ ## Recreate or Update the GitHub Gist
48
 
49
+ After authenticating GitHub CLI with gist permission, this is the command used to create the public Gist:
50
 
51
  ```bash
52
  gh auth login --scopes gist,repo
training/cadforge_openenv_training_colab.ipynb CHANGED
@@ -11,7 +11,7 @@
11
  "**GitHub repo:** [https://github.com/sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon) \n",
12
  "**Run this notebook in Google Colab:** [https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb) \n",
13
  "**HF Space:** [https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv](https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv) \n",
14
- "**Training script bundle / gist-ready source:** [https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md](https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md) \n",
15
  "**Raw training logs + evidence archive:** [https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)\n",
16
  "\n",
17
  "## What actually ran where\n",
@@ -59,7 +59,7 @@
59
  "- Hugging Face Space repo: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv\n",
60
  "\n",
61
  "- GitHub repo, exact source: https://github.com/sanjuhs/open-env-meta-final-hackathon\n",
62
- "- Training script bundle / gist-ready source: https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md\n",
63
  "- Raw training logs and evidence bundle: https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence\n",
64
  "- RunPod/H200 clarification: full 2B/9B SFT and GRPO runs were executed as distinct scripts on RunPod; this Colab is the public smoke rerun path.\n",
65
  "- Training dataset: https://huggingface.co/datasets/sanjuhs/cadforge-cadquery-agentic-traces\n",
@@ -524,7 +524,7 @@
524
  "\n",
525
  "## RunPod H200 production scripts\n",
526
  "\n",
527
- "The real training was launched on RunPod H200 through shell wrappers and Python scripts, then backed up to Hugging Face. Judges can inspect the source bundle here: https://github.com/sanjuhs/open-env-meta-final-hackathon/blob/main/training/GITHUB_GIST_TRAINING_SCRIPTS.md\n",
528
  "\n",
529
  "Raw logs and charts are here: https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence\n"
530
  ]
 
11
  "**GitHub repo:** [https://github.com/sanjuhs/open-env-meta-final-hackathon](https://github.com/sanjuhs/open-env-meta-final-hackathon) \n",
12
  "**Run this notebook in Google Colab:** [https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb](https://colab.research.google.com/github/sanjuhs/open-env-meta-final-hackathon/blob/main/training/cadforge_openenv_training_colab.ipynb) \n",
13
  "**HF Space:** [https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv](https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv) \n",
14
+ "**GitHub Gist: training scripts:** [https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb](https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb) \n",
15
  "**Raw training logs + evidence archive:** [https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)\n",
16
  "\n",
17
  "## What actually ran where\n",
 
59
  "- Hugging Face Space repo: https://huggingface.co/spaces/sanjuhs/cadforge-cadquery-openenv\n",
60
  "\n",
61
  "- GitHub repo, exact source: https://github.com/sanjuhs/open-env-meta-final-hackathon\n",
62
+ "- GitHub Gist: training scripts: https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb\n",
63
  "- Raw training logs and evidence bundle: https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence\n",
64
  "- RunPod/H200 clarification: full 2B/9B SFT and GRPO runs were executed as distinct scripts on RunPod; this Colab is the public smoke rerun path.\n",
65
  "- Training dataset: https://huggingface.co/datasets/sanjuhs/cadforge-cadquery-agentic-traces\n",
 
524
  "\n",
525
  "## RunPod H200 production scripts\n",
526
  "\n",
527
+ "The real training was launched on RunPod H200 through shell wrappers and Python scripts, then backed up to Hugging Face. Judges can inspect the public training-script Gist here: https://gist.github.com/sanjuhs/10596f688e8b4560910a3b1b137bfeeb\n",
528
  "\n",
529
  "Raw logs and charts are here: https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence\n"
530
  ]