Finalize CADForge mini blog evidence
Browse files- CADFORGE_BLOG.md +1 -1
CADFORGE_BLOG.md
CHANGED
|
@@ -109,7 +109,7 @@ The real run used Unsloth for LoRA SFT and TRL GRPO for environment reward train
|
|
| 109 |
The raw logs are public so the training claims are inspectable, not just summarized:
|
| 110 |
|
| 111 |
- Training evidence dataset: [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
|
| 112 |
-
-
|
| 113 |
- Per-sample reward traces: `training/logs/*completions.jsonl`
|
| 114 |
- Generated plots and parsed metrics: `training/reports/*`
|
| 115 |
|
|
|
|
| 109 |
The raw logs are public so the training claims are inspectable, not just summarized:
|
| 110 |
|
| 111 |
- Training evidence dataset: [sanjuhs/cadforge-training-evidence](https://huggingface.co/datasets/sanjuhs/cadforge-training-evidence)
|
| 112 |
+
- Compressed archive on that dataset: `archives/cadforge-training-evidence-20260426.tar.gz`
|
| 113 |
- Per-sample reward traces: `training/logs/*completions.jsonl`
|
| 114 |
- Generated plots and parsed metrics: `training/reports/*`
|
| 115 |
|