feat: implement dataset loader, environment, and GRPO training pipeline for undertrial bail prediction bf8f1ff Draken1606 commited on 14 days ago
Fix A3 (OOM eval), B9 (NDPS eligibility), B3 (direction-gated computation bonus), A8-pt2 (episode_id case lookup) 4855450 Draken1606 commited on 14 days ago
Fix 8 compliance gaps: repeat-action dedup+cache, min-steps hard block, criminal history tool (12th action), efficiency removed from training formula, circular import cleaned, yaml formula synced 898bc18 Draken1606 commited on 14 days ago
Reward overhaul: add compute_reasoning_quality (anchoring+arithmetic+specificity+consistency), parity-grounds penalty, reduce outcome 40%->30%, add 10% reasoning quality signal ca62faa Draken1606 commited on 14 days ago
Fix ACTION_MAP gaps: add 4 new tools to REST + WebSocket handlers; remove StepResult import collision a1b1513 Draken1606 commited on 15 days ago
Add 4 missing actions: read_submissions, assess_flight_risk, check_case_factors, apply_proportionality (fixes 4.3d/e/g/h/i) ce6728e Draken1606 commited on 15 days ago
feat: implement core UndertriAI OpenEnv training environment with tool dispatch and reward logic a1a7fd3 Draken1606 commited on 15 days ago
Fix all audit gaps: custody neutral, parity-first bias, skip penalty 0.40, statutory process reward, /observation endpoint, reset() timeout, drift determinism 2bc545f Draken1606 commited on 15 days ago
Fix 5 audit gaps: conditional bail, action history, efficiency reward, train/val split, env API routing 6218d9a Draken1606 commited on 15 days ago
Fix 6 vulnerabilities: /state crash, reward clamp, condition reward, XML exploit, tool-skip bypass, timeout enforcement d76d092 Draken1606 commited on 15 days ago
Add seed param to /reset: demo pins to seed=0 per stage for consistent known episodes 9932c2e Draken1606 commited on 15 days ago
Fix crash: call super().__init__() so self.rubric is set before _reset_rubric() 33279ea Draken1606 commited on 15 days ago
OpenEnv compliance: proper base class, SUPPORTS_CONCURRENT_SESSIONS, state @property, updated openenv.yaml b00feb0 Draken1606 commited on 15 days ago