adithya9903's picture
Upload PolyGuard training artifacts: docs/results
b5cc210 verified
# Result Artifacts
These tracked files mirror the latest local smoke/evaluation artifacts so the README can show stable evidence even though `outputs/` and `checkpoints/` are intentionally git-ignored.
Current status:
- OpenEnv structure/runtime validation passes locally.
- Test suite passes locally.
- Frontend production build passes locally.
- SFT and GRPO artifacts in this folder are non-fallback TRL Transformers evidence from a tiny local compliance run.
- `postsave_inference.json` loads the merged artifact rather than the fallback policy.
- `improvement_report.json` shows positive average-reward improvement against the no-change baseline.
- `hf_space_verification.json` records a live Hugging Face Space validation pass.
- `active_model_manifest.json` records the currently activated local product model. As of April 26, 2026 this points at the local Qwen 0.5B smoke artifact while the full remote Qwen sweep continues.
For a stronger final pitch, replace these artifacts after a larger Colab/HF GPU run:
- `sft_trl_run.json`
- `grpo_trl_run.json`
- `postsave_inference.json`
- `improvement_report.json`
- all plot PNGs
- `hf_space_verification.json`
- `active_model_manifest.json`