v4: Add Min-SNR-γ + velocity direction loss + CCA + multi-scale loss + gate bias (DeepSeek/FasterDiT/DiCo/DiMR research) e53aa97 verified krystv commited on 8 days ago
v3: Add large anime/art datasets, cosine-with-restarts schedule, 2x LR, smart warmup, streaming support, resume training d01fc8b verified krystv commited on 8 days ago
Add verbose training logs: ETA, loss trend, speed, VRAM, grad norm, epoch summaries 0c2542e verified krystv commited on 8 days ago
Optimize notebook: 40% faster CfC blocks, simplified spatial mix a4f6778 verified krystv commited on 8 days ago
v2: Add VAE latent training, fix datasets, streaming support c2b4760 verified krystv commited on 8 days ago