Spaces:

TheJackBright
/

polyguard-openenv

Running

App Files Files Community

polyguard-openenv / docs /architecture.md

TheJackBright

Deploy PolyGuard OpenEnv Space

877add7 verified 12 days ago

preview code

raw

history blame contribute delete

1.25 kB

Architecture

POLYGUARD-RL uses an OpenEnv-first monorepo architecture with six layers:

Data ingestion and retrieval index.
Predictive safety, graph, tabular risk, and dosing models.
Multi-agent orchestration graph.
Hierarchical RL training stack.
Safety governance and anti-cheat controls.
FastAPI, OpenEnv, and React deployment surfaces.

Data Flow

raw/local knowledge -> processed tables -> scenarios -> SFT/GRPO corpora
                                |
                                v
                       PolyGuardEnv reset/step/state
                                |
                                v
           agent stack -> verifier reward -> training/evaluation reports
                                |
                                v
                    docs/results + README + HF Space

Runtime Boundaries

Environment code owns state transition, legality, rewards, anti-cheat, and traces.
Agent code owns candidate interpretation, routing, planning, critique, and explanation.
Training code owns SFT, GRPO, reward logging, adapters, and registry metadata.
Evaluation code owns baselines, perturbations, reports, and plots.
Deployment code owns OpenEnv validation and HF Space push.