train: tighten Colab notebook (drop redundant cell, simplify section 5) 2712bf6 verified Pratyush-01 commited on 12 days ago
train: simplify Colab notebook (drop redundant cells) 0134795 verified Pratyush-01 commited on 12 days ago
ui: clear stale session/transcript when user switches model preset c2a38db verified Pratyush-01 commited on 12 days ago
cleanup: trim verbose comments, drop dead code, fix stale tests, proper Dockerfile + .gitignore 7f40db3 verified Pratyush-01 commited on 12 days ago
frontend: sync clean source — drop ComparePane + Compare LLMs tab + Which-endpoint copy; ship 3-preset picker 8225d8a verified Pratyush-01 commited on 12 days ago
blog: restore plot images + fix unclosed code span in reward-hacking table 2f81a49 verified Pratyush-01 commited on 12 days ago
blog: add reward-hacking section (3 patched exploits + 3 by-construction invariants) 370f89d verified Pratyush-01 commited on 12 days ago
Dockerfile: bump SPA cache-bust to 10 — force fresh build (drops Compare LLMs tab + stale endpoint copy) dd80b32 verified Pratyush-01 commited on 12 days ago
docs: README only lists 3 trained systems; drop tier-3 held-out claims and stale example d6cb922 verified Pratyush-01 commited on 12 days ago
docs: drop per-component reward breakdown chart and bullets 1f50272 verified Pratyush-01 commited on 12 days ago
docs: drop per-component reward breakdown chart and bullets 6cbd278 verified Pratyush-01 commited on 12 days ago
W&B disclaimer: point users to their own account, drop hardcoded entity URL fe6567c verified Pratyush-01 commited on 12 days ago
W&B disclaimer: point users to their own account, drop hardcoded entity URL 018639f verified Pratyush-01 commited on 12 days ago
W&B disclaimer: point users to their own account, drop hardcoded entity URL af226f1 verified Pratyush-01 commited on 12 days ago
notebook: W&B disclaimer points users to their own account, not the author's c303f34 verified Pratyush-01 commited on 12 days ago
notebook: add W&B disclaimer banner before SFT and GRPO subprocess launches 5f628de verified Pratyush-01 commited on 12 days ago
notebook: visible pip install + sys.path refresh + import verify 16cf7b6 verified Pratyush-01 commited on 12 days ago
notebook: pull from Space repo (not stale dataset), default profile=3b 8342da7 verified Pratyush-01 commited on 12 days ago
registry: SUPPORTED_SYSTEMS = only 3 trained systems 5eb18f4 verified Pratyush-01 commited on 12 days ago
registry: remove tier3 from SUPPORTED_SYSTEMS; mark as TODO e318f30 verified Pratyush-01 commited on 12 days ago
blog: show all 6 systems honestly; GRPO=3, SFT=6, no held-out claims f453a88 verified Pratyush-01 commited on 12 days ago
cleanup: remove writeup.md (superseded by blog.md) 24673b2 verified Pratyush-01 commited on 12 days ago
blog: expand system tiers table, add extensibility section c1a4761 verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from train/job_train.py 7c990b2 verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from physix/models.py 605f6fe verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from physix/server/app.py 0641193 verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from physix/server/providers.py a88dae7 verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from physix/training/dataset.py b1c6aa6 verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from physix/training/sft.py 0b8f87b verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from physix/training/prompt.py c59b8f5 verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from physix/training/reward_fns.py 0128624 verified Pratyush-01 commited on 12 days ago
cleanup: strip verbose comments from physix/training/loop.py b4bd6d8 verified Pratyush-01 commited on 12 days ago
Add blog.md: accurate systems table, SFT+GRPO plots 7886d89 verified Pratyush-01 commited on 12 days ago
Fix links table: absolute URLs for blog, notebook, checkpoint ecdfaaf verified Pratyush-01 commited on 12 days ago
Update plots, fix README table fmt, update training result observations 6315db4 verified Pratyush-01 commited on 12 days ago