blog: restore plot images + fix unclosed code span in reward-hacking table 2f81a49 verified Pratyush-01 commited on 12 days ago
blog: add reward-hacking section (3 patched exploits + 3 by-construction invariants) 370f89d verified Pratyush-01 commited on 12 days ago
docs: drop per-component reward breakdown chart and bullets 6cbd278 verified Pratyush-01 commited on 12 days ago
blog: show all 6 systems honestly; GRPO=3, SFT=6, no held-out claims f453a88 verified Pratyush-01 commited on 12 days ago
cleanup: remove writeup.md (superseded by blog.md) 24673b2 verified Pratyush-01 commited on 12 days ago
blog: expand system tiers table, add extensibility section c1a4761 verified Pratyush-01 commited on 12 days ago
Add blog.md: accurate systems table, SFT+GRPO plots 7886d89 verified Pratyush-01 commited on 12 days ago
ui: status banner + endpoint guide for physix-infer cold-start a507dcb verified Pratyush-01 commited on 12 days ago
docs: explain cold small model context for SFT rationale 12e2f97 verified Pratyush-01 commited on 12 days ago
docs: expand reward section, remove held-out mention 53f677a verified Pratyush-01 commited on 12 days ago