Spaces:

Draken1606
/

undertrial-ai

Sleeping

App Files Files Community

undertrial-ai / README.md

Commit History

3-level

2c93c00

Draken1606 commited on 4 days ago

3-level curriculum + 7B + reward fixes

9868dfb

Draken1606 commited on 4 days ago

Add training evidence: curriculum results, plots (LFS), parse_job_log helper

805b735

Draken1606 commited on 13 days ago

changed model

d745c55

Draken1606 commited on 13 days ago

training script

1272145

Shabista Sehar commited on 13 days ago

----

aa1acaa

Shabista Sehar commited on 14 days ago

feat: implement GRPO training pipeline for bail assessment model and update README credits

472a28c

Draken1606 commited on 14 days ago

feat: implement dataset loader, environment, and GRPO training pipeline for undertrial bail prediction

bf8f1ff

Draken1606 commited on 15 days ago

implemented

d8f8a45

Shabista Sehar commited on 15 days ago

Rewrite README as full hackathon submission doc with env design, reward formula, curriculum, training arch

eccca7f

Draken1606 commited on 16 days ago

Implement client.py HTTP methods + add HF Spaces README metadata

6d324e1

Draken1606 commited on 17 days ago

Fix Dockerfile for HF Spaces non-root user + update README URLs

944a2a1

Draken1606 commited on 17 days ago

first commit

4052d84

Draken1606 commited on 17 days ago

Commit History

3-level 2c93c00

3-level curriculum + 7B + reward fixes 9868dfb

Add training evidence: curriculum results, plots (LFS), parse_job_log helper 805b735

changed model d745c55

training script 1272145

---- aa1acaa

feat: implement GRPO training pipeline for bail assessment model and update README credits 472a28c

feat: implement dataset loader, environment, and GRPO training pipeline for undertrial bail prediction bf8f1ff

implemented d8f8a45

Rewrite README as full hackathon submission doc with env design, reward formula, curriculum, training arch eccca7f

Implement client.py HTTP methods + add HF Spaces README metadata 6d324e1

Fix Dockerfile for HF Spaces non-root user + update README URLs 944a2a1

first commit 4052d84

3-level

2c93c00

3-level curriculum + 7B + reward fixes

9868dfb

Add training evidence: curriculum results, plots (LFS), parse_job_log helper

805b735

changed model

d745c55

training script

1272145

----

aa1acaa

feat: implement GRPO training pipeline for bail assessment model and update README credits

472a28c

feat: implement dataset loader, environment, and GRPO training pipeline for undertrial bail prediction

bf8f1ff

implemented

d8f8a45

Rewrite README as full hackathon submission doc with env design, reward formula, curriculum, training arch

eccca7f

Implement client.py HTTP methods + add HF Spaces README metadata

6d324e1

Fix Dockerfile for HF Spaces non-root user + update README URLs

944a2a1

first commit

4052d84