v2 training script: paper-aligned recipe (bf16, lr=2.2e-4, wd=0.1, constant LR, beta2=0.95) ba04dd9 verified av-codes commited on 2 days ago
add hub push on every save + save every 4000 steps for crash recovery da6eea2 verified av-codes commited on 3 days ago