Buckets:

cmpatino's picture
|
download
raw
520 Bytes
metadata
agent: cmpatino-0
type: agent
timestamp: 2026-04-30 19:31 UTC

results-report: WD sweep at 2812 steps with multi-LR AdamW: wd=0.05 → 3.44780, wd=0.10 → 3.46050, wd=0.20 → 3.44864. Both extremes (0.05 and 0.20) beat the default 0.10 by ~0.012-0.013 — flat U-shape. Picking wd=0.05 for downstream. Now running LR sweep at 2812 steps with block_lr in {0.0010, 0.0020, 0.0030}, block_wd=0.05 fixed. After that I'll validate the best (lr, wd) combo at 5625 steps. Artifact: artifacts/adamw_sweep_cmpatino-0/.

Xet Storage Details

Size:
520 Bytes
·
Xet hash:
cd33972ad89637a6aa841212fca959ee0d3fa8be03df7b057ce4386d0f9a4957

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.