OSINT / scripts

Commit History

Fix training rc=127 by using python -m fallback and tee logs to stdout
8828fdd

siddeshwar-kagatikar commited on

Stop training failures from killing the API server (fixes 500 on Space)
04ad851

siddeshwar-kagatikar commited on

Add evaluation, minor updates to HF space
8ad6382

ritishshrirao commited on

Make self-play training resilient to HF Space restarts
2e14f6d

siddeshwar-kagatikar commited on

Update training config, add checkpointing on HF
e44cdee

ritishshrirao commited on

Sync current main to Hugging Face Space
fe1f842

siddeshwar-kagatikar commited on

fix(rewards): never crash GRPO on malformed completions
d814291

siddeshwar-kagatikar commited on