polyguard-openenv / docs /hierarchical_rl.md
TheJackBright's picture
Deploy PolyGuard OpenEnv Space
877add7 verified
# Hierarchical RL
Supervisor selects macro mode (`REGIMEN_OPT`, `DOSE_OPT`, `REVIEW`), planner selects constrained candidate action, and dosing policy specializes dose-sensitive transitions.