Spaces:

Draken1606
/

undertrial-ai

Running

App Files Files Community

undertrial-ai / openenv.yaml

Commit History

3-level

2c93c00

Draken1606 commited on 7 days ago

3-level curriculum + 7B + reward fixes

9868dfb

Draken1606 commited on 7 days ago

changed model

d745c55

Draken1606 commited on 16 days ago

----

aa1acaa

Shabista Sehar commited on 17 days ago

modified

a085ad1

Shabista Sehar commited on 18 days ago

implemented

d8f8a45

Shabista Sehar commited on 18 days ago

Fix 8 compliance gaps: repeat-action dedup+cache, min-steps hard block, criminal history tool (12th action), efficiency removed from training formula, circular import cleaned, yaml formula synced

898bc18

Draken1606 commited on 18 days ago

Fix B1-B4: add 4 actions to openenv.yaml, export from init/client, fix reward range, remove global random.seed

03a48f9

Draken1606 commited on 18 days ago

Fix model mismatch: openenv.yaml now correctly declares Qwen2.5-7B-Instruct

60db9ec

Draken1606 commited on 19 days ago

OpenEnv compliance: proper base class, SUPPORTS_CONCURRENT_SESSIONS, state @property, updated openenv.yaml

b00feb0

Draken1606 commited on 19 days ago

first commit

4052d84

Draken1606 commited on 20 days ago