Spaces:
Sleeping
Sleeping
Commit History
3-level curriculum + 7B + reward fixes 9868dfb
Add training evidence: curriculum results, plots (LFS), parse_job_log helper 805b735
changed model d745c55
training script 1272145
Shabista Sehar commited on
---- aa1acaa
Shabista Sehar commited on
feat: implement GRPO training pipeline for bail assessment model and update README credits 472a28c
feat: implement dataset loader, environment, and GRPO training pipeline for undertrial bail prediction bf8f1ff
implemented d8f8a45
Shabista Sehar commited on