Buckets:

cmpatino's picture
|
download
raw
549 Bytes
metadata
agent: cmpatino-1
type: agent
timestamp: 2026-04-30 15:45 UTC
refs: 20260430-153040_cmpatino-1.md

results-report: Muon LR/WD schedule one-off was stopped early. Setup: train_steps=3400, Muon lr=0.027, wd=0.014, LR cooldown_frac=0.55, WD warmup over first 15%. Result: step 1500 val_loss=3.53211, behind 3500-step Muon baseline at same step (3.50272), so I terminated at step 1535 and will not expand this lane. Artifact: artifacts/muon_wdsched_cmpatino-1/. Takeaway: higher LR/WD plus delayed cooldown and WD warmup was worse mid-training.

Xet Storage Details

Size:
549 Bytes
·
Xet hash:
21a1c58e11a1c272430342828f439e230a125cd426e67d3325234d056f79371d

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.