Commit History

Update README.md
3acf10b
verified

YashashMathur commited on

Update README.md
e3603dc
verified

YashashMathur commited on

Update README.md
cb6a4ca
verified

YashashMathur commited on

Update README.md
add766c
verified

YashashMathur commited on

Update train.py
f3fc038
verified

YashashMathur commited on

Update README.md
3c7a3a5
verified

YashashMathur commited on

Delete mock_worker.py
aa1d7f7
verified

YashashMathur commited on

Update train.py
6a2dd66
verified

YashashMathur commited on

Upload MODEL_CARD.md
5d8f1e9
verified

YashashMathur commited on

Update BLOG.md
1f65562
verified

YashashMathur commited on

Create BLOG.md
767ffe9
verified

YashashMathur commited on

Update README.md
904779b
verified

YashashMathur commited on

Update README.md
926997e
verified

YashashMathur commited on

Update train.py
f330c3f
verified

YashashMathur commited on

Fix: Use Level 1 scenarios only for training
7a19964
verified

YashashMathur commited on

Add MemoryLedger
9a90a52
verified

YashashMathur commited on

Add WorldModelSimulator
2327c5a
verified

YashashMathur commited on

Training script with WorldModel/Memory integration
48a7cf6
verified

YashashMathur commited on

Update README with all required links
1a5a502
verified

YashashMathur commited on

Add OpenEnv manifest
609a677
verified

YashashMathur commited on

More SFT warmup (40 steps) for better JSON format
aac0ab9
verified

YashashMathur commited on

Added friend laptop worker via ngrok
2688686
verified

YashashMathur commited on

Added friend worker + local worker dispatch
07656ee
verified

YashashMathur commited on

Updated: 250 steps, K=2, LR=1e-5, temp 1.0-0.7
a0297c3
verified

YashashMathur commited on

Add worker setup guide
7275c33
verified

YashashMathur commited on

Add mock worker for demo
7bffd13
verified

YashashMathur commited on

Add worker client with local ngrok URL
edc3562
verified

YashashMathur commited on

train: higher LR=1e-5, more SFT=20, lower temp=1.0->0.7
23c3a1b

YashashMathur commited on

chore: remove binary artifacts from git
5e5d438

YashashMathur commited on

faster training: lower temp, higher LR, more SFT
c84d93d
verified

YashashMathur commited on

Upload Dockerfile with huggingface_hub
31fe512
verified

YashashMathur commited on

Upload Dockerfile with huggingface_hub
524f673
verified

YashashMathur commited on

Upload Dockerfile with huggingface_hub
f5ca8fb
verified

YashashMathur commited on

Upload train.py with huggingface_hub
e5115bd
verified

YashashMathur commited on

Upload Dockerfile with huggingface_hub
da61617
verified

YashashMathur commited on

Upload folder using huggingface_hub
8b092ea
verified

YashashMathur commited on

fix: C-4 reward clamp, C-6 HF_TOKEN, W-2 citation floor, W-9 dup import
b022bda
verified

YashashMathur commited on

fix: force torch 2.5.1+cu121 after unsloth to prevent colab-new downgrade
9e1ad05
verified

YashashMathur commited on

fix: upgrade PyTorch to 2.5.1 to fix unsloth_zoo AttributeError on torch._inductor.config
ff9091d
verified

YashashMathur commited on

Upload Dockerfile with huggingface_hub
ce8daf0
verified

YashashMathur commited on

Upload hf_training/train.py with huggingface_hub
c51cef7
verified

YashashMathur commited on

Upload Dockerfile with huggingface_hub
4981837
verified

YashashMathur commited on

Upload README.md with huggingface_hub
85a478c
verified

YashashMathur commited on

Upload folder using huggingface_hub
206f794
verified

YashashMathur commited on

Upload README.md with huggingface_hub
acb482d
verified

YashashMathur commited on

Upload Dockerfile with huggingface_hub
319ef7e
verified

YashashMathur commited on

Upload folder using huggingface_hub
e945ec1
verified

YashashMathur commited on

Upload Dockerfile with huggingface_hub
854a3f5
verified

YashashMathur commited on

Upload train.py with huggingface_hub
466a284
verified

YashashMathur commited on

Upload folder using huggingface_hub
51db3e8
verified

YashashMathur commited on