undertrial-ai / README.md

Commit History

3-level curriculum + 7B + reward fixes
9868dfb

Draken1606 commited on

Add training evidence: curriculum results, plots (LFS), parse_job_log helper
805b735

Draken1606 commited on

changed model
d745c55

Draken1606 commited on

training script
1272145

Shabista Sehar commited on

----
aa1acaa

Shabista Sehar commited on

feat: implement GRPO training pipeline for bail assessment model and update README credits
472a28c

Draken1606 commited on

feat: implement dataset loader, environment, and GRPO training pipeline for undertrial bail prediction
bf8f1ff

Draken1606 commited on

implemented
d8f8a45

Shabista Sehar commited on

Rewrite README as full hackathon submission doc with env design, reward formula, curriculum, training arch
eccca7f

Draken1606 commited on

Implement client.py HTTP methods + add HF Spaces README metadata
6d324e1

Draken1606 commited on

Fix Dockerfile for HF Spaces non-root user + update README URLs
944a2a1

Draken1606 commited on

first commit
4052d84

Draken1606 commited on