undertrial-ai / openenv.yaml

Commit History

3-level curriculum + 7B + reward fixes
9868dfb

Draken1606 commited on

changed model
d745c55

Draken1606 commited on

----
aa1acaa

Shabista Sehar commited on

modified
a085ad1

Shabista Sehar commited on

implemented
d8f8a45

Shabista Sehar commited on

Fix 8 compliance gaps: repeat-action dedup+cache, min-steps hard block, criminal history tool (12th action), efficiency removed from training formula, circular import cleaned, yaml formula synced
898bc18

Draken1606 commited on

Fix B1-B4: add 4 actions to openenv.yaml, export from __init__/client, fix reward range, remove global random.seed
03a48f9

Draken1606 commited on

Fix model mismatch: openenv.yaml now correctly declares Qwen2.5-7B-Instruct
60db9ec

Draken1606 commited on

OpenEnv compliance: proper base class, SUPPORTS_CONCURRENT_SESSIONS, state @property, updated openenv.yaml
b00feb0

Draken1606 commited on

first commit
4052d84

Draken1606 commited on