LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer Paper • 2506.06952 • Published Jun 8, 2025 • 9
umd-vt-nyu/JH_dc-vae-f32c32-sana-1.0-768_patch-1_epoch-64_group-7_fusion_residual_attn Updated May 24, 2025
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis Paper • 2505.10046 • Published May 15, 2025 • 9
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper • 2505.09568 • Published May 14, 2025 • 99
PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop Paper • 2503.09595 • Published Mar 12, 2025
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper • 2505.09568 • Published May 14, 2025 • 99
umd-vt-nyu/JH_multinode_dc-ae-f32c32-in-1.0-diffusers-768_patch-1_group-7_5e4_residual_attn Updated May 14, 2025
umd-vt-nyu/JH_dc-vae-f32c32-sana-1.0-768_patch-1_epoch-64_group-14_fusion_residual_attn Updated May 13, 2025
umd-vt-nyu/JH_dc-vae-f32c32-sana-1.0-768_patch-1_epoch-64_group-4_fusion_residual_attn Updated May 10, 2025