OSINT / src /osint_env /training /self_play.py

Commit History

Add evaluation, minor updates to HF space
8ad6382

ritishshrirao commited on

Add per-batch generation liveness logging
55c5f82

siddeshwar-kagatikar commited on

Upload intermediate GRPO checkpoints to HF Hub on every save
b0b586a

siddeshwar-kagatikar commited on

Make self-play training resilient to HF Space restarts
2e14f6d

siddeshwar-kagatikar commited on

feat(training): improve self-play progress visibility and reward diagnostics
4aca4f5

siddeshwar-kagatikar commited on

Minor config changes
3e893cd

ritishshrirao commited on

Update training config, add checkpointing on HF
e44cdee

ritishshrirao commited on

Sync current main to Hugging Face Space
fe1f842

siddeshwar-kagatikar commited on

fix(rewards): never crash GRPO on malformed completions
d814291

siddeshwar-kagatikar commited on