Fix training rc=127 by using python -m fallback and tee logs to stdout 8828fdd siddeshwar-kagatikar commited on 12 days ago
Document all Space variables and secrets in README 274f638 siddeshwar-kagatikar commited on 12 days ago
Fix LaTeX math rendering in blog (use \$\$ and \$ delimiters) 9e3e5ff siddeshwar-kagatikar commited on 12 days ago
Add Trace Net blog post with final checkpoint link 72cba6b siddeshwar-kagatikar commited on 12 days ago
Remove stale COMPARE_FINETUNED_DASHBOARD references (fix 500 on Space) 55d75e0 siddeshwar-kagatikar commited on 12 days ago
Stop training failures from killing the API server (fixes 500 on Space) 04ad851 siddeshwar-kagatikar commited on 12 days ago
Fix dashboard 404 by shipping artifacts/ into the Space image 7e4ee5e siddeshwar-kagatikar commited on 12 days ago
Add missing post_training_benchmark_dashboard.html to HF Space 07286e6 siddeshwar-kagatikar commited on 12 days ago
Always render Pre/Post-Training dashboard buttons; drop snapshot+difficulty cards 243dfa6 siddeshwar-kagatikar commited on 12 days ago
Add pre/post benchmark dashboard options in Space UI 9f98669 siddeshwar-kagatikar commited on 12 days ago
Upload intermediate GRPO checkpoints to HF Hub on every save b0b586a siddeshwar-kagatikar commited on 12 days ago
Make self-play training resilient to HF Space restarts 2e14f6d siddeshwar-kagatikar commited on 12 days ago
feat(training): improve self-play progress visibility and reward diagnostics 4aca4f5 siddeshwar-kagatikar commited on 12 days ago
fix(rewards): never crash GRPO on malformed completions d814291 siddeshwar-kagatikar commited on 13 days ago