Commit History

Add W&B training run link to blog
0738dc1

siddeshwar-kagatikar commited on

Fix training rc=127 by using python -m fallback and tee logs to stdout
8828fdd

siddeshwar-kagatikar commited on

Document all Space variables and secrets in README
274f638

siddeshwar-kagatikar commited on

Fix LaTeX math rendering in blog (use \$\$ and \$ delimiters)
9e3e5ff

siddeshwar-kagatikar commited on

Add Trace Net blog post with final checkpoint link
72cba6b

siddeshwar-kagatikar commited on

Add TraceNet slide deck link to README
0a16175

siddeshwar-kagatikar commited on

Remove stale COMPARE_FINETUNED_DASHBOARD references (fix 500 on Space)
55d75e0

siddeshwar-kagatikar commited on

Stop training failures from killing the API server (fixes 500 on Space)
04ad851

siddeshwar-kagatikar commited on

Fix dashboard 404 by shipping artifacts/ into the Space image
7e4ee5e

siddeshwar-kagatikar commited on

Add missing post_training_benchmark_dashboard.html to HF Space
07286e6

siddeshwar-kagatikar commited on

Always render Pre/Post-Training dashboard buttons; drop snapshot+difficulty cards
243dfa6

siddeshwar-kagatikar commited on

Add pre/post benchmark dashboard options in Space UI
9f98669

siddeshwar-kagatikar commited on

added images (hf-space text-only subset)
957b4b2

siddeshwar-kagatikar commited on

Add evaluation, minor updates to HF space
8ad6382

ritishshrirao commited on

Add per-batch generation liveness logging
55c5f82

siddeshwar-kagatikar commited on

Upload intermediate GRPO checkpoints to HF Hub on every save
b0b586a

siddeshwar-kagatikar commited on

Make self-play training resilient to HF Space restarts
2e14f6d

siddeshwar-kagatikar commited on

feat(training): improve self-play progress visibility and reward diagnostics
4aca4f5

siddeshwar-kagatikar commited on

Minor config changes
3e893cd

ritishshrirao commited on

Update training config, add checkpointing on HF
e44cdee

ritishshrirao commited on

test hf space commit
d822755

ritishshrirao commited on

Sync current main to Hugging Face Space
fe1f842

siddeshwar-kagatikar commited on

fix(rewards): never crash GRPO on malformed completions
d814291

siddeshwar-kagatikar commited on