undertrial-ai / server /undertrial_environment.py

Commit History

feat: implement dataset loader, environment, and GRPO training pipeline for undertrial bail prediction
bf8f1ff

Draken1606 commited on

modified
a085ad1

Shabista Sehar commited on

implemented
d8f8a45

Shabista Sehar commited on

Fix A3 (OOM eval), B9 (NDPS eligibility), B3 (direction-gated computation bonus), A8-pt2 (episode_id case lookup)
4855450

Draken1606 commited on

Fix 8 compliance gaps: repeat-action dedup+cache, min-steps hard block, criminal history tool (12th action), efficiency removed from training formula, circular import cleaned, yaml formula synced
898bc18

Draken1606 commited on

Reward overhaul: add compute_reasoning_quality (anchoring+arithmetic+specificity+consistency), parity-grounds penalty, reduce outcome 40%->30%, add 10% reasoning quality signal
ca62faa

Draken1606 commited on

Fix ACTION_MAP gaps: add 4 new tools to REST + WebSocket handlers; remove StepResult import collision
a1b1513

Draken1606 commited on

Add 4 missing actions: read_submissions, assess_flight_risk, check_case_factors, apply_proportionality (fixes 4.3d/e/g/h/i)
ce6728e

Draken1606 commited on

feat: implement core UndertriAI OpenEnv training environment with tool dispatch and reward logic
a1a7fd3

Draken1606 commited on

Fix all audit gaps: custody neutral, parity-first bias, skip penalty 0.40, statutory process reward, /observation endpoint, reset() timeout, drift determinism
2bc545f

Draken1606 commited on

Fix 5 audit gaps: conditional bail, action history, efficiency reward, train/val split, env API routing
6218d9a

Draken1606 commited on

Fix 6 vulnerabilities: /state crash, reward clamp, condition reward, XML exploit, tool-skip bypass, timeout enforcement
d76d092

Draken1606 commited on

Add seed param to /reset: demo pins to seed=0 per stage for consistent known episodes
9932c2e

Draken1606 commited on

Fix crash: call super().__init__() so self.rubric is set before _reset_rubric()
33279ea

Draken1606 commited on

OpenEnv compliance: proper base class, SUPPORTS_CONCURRENT_SESSIONS, state @property, updated openenv.yaml
b00feb0

Draken1606 commited on

first commit
4052d84

Draken1606 commited on