v0.3: Fix training hang — precompute tensors, num_workers=0, torch.compile, warmup batch, timing diagnostics 408dd91 verified asdf98 commited on 11 days ago
FIX: Use vectorized span masking in notebook (no Python loop over batch)" 2bf4fa5 verified asdf98 commited on 11 days ago