umd-vt-nyu (MLLMs)

jiuhai

updated a model 9 months ago

BLIP3o/BLIP3o-NEXT-SFT-3B

5B • Updated Aug 20, 2025 • 49

jiuhai

published a model 9 months ago

BLIP3o/BLIP3o-NEXT-SFT-3B

5B • Updated Aug 20, 2025 • 49

jiuhai

updated a model 9 months ago

umd-vt-nyu/blip3o-next

5B • Updated Aug 1, 2025 • 2

jiuhai

published a model 9 months ago

umd-vt-nyu/blip3o-next

5B • Updated Aug 1, 2025 • 2

jiuhai

updated a dataset 10 months ago

umd-vt-nyu/code

Updated Jun 22, 2025 • 2

jiuhai

published a dataset 10 months ago

umd-vt-nyu/code

Updated Jun 22, 2025 • 2

jiuhai

published a model 10 months ago

umd-vt-nyu/soda

Updated Jun 22, 2025

zhiyang1

authored a paper 10 months ago

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

Paper • 2506.06952 • Published Jun 8, 2025 • 9

jiuhai

updated 3 models 11 months ago

umd-vt-nyu/JH_dc-vae-f32c32-sana-1.0-768_patch-1_epoch-64_group-7_fusion_residual_attn

Updated May 24, 2025

umd-vt-nyu/flow_siglip2_512_sana_512_1e4_64token_2ndlast_sstk_16

Updated May 22, 2025

umd-vt-nyu/JH_dc-vae-f32c32-sana-1.0-768_patch-1_baseline_fusion

Updated May 21, 2025

xcpan

authored a paper 11 months ago

Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

Paper • 2505.10046 • Published May 15, 2025 • 9

jiuhai

authored a paper 11 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 99

xcpan

authored 3 papers 11 months ago

PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop

Paper • 2503.09595 • Published Mar 12, 2025

Transfer between Modalities with MetaQueries

Paper • 2504.06256 • Published Apr 8, 2025 • 2

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 99

jiuhai

updated 3 models 11 months ago

umd-vt-nyu/JH_multinode_dc-ae-f32c32-in-1.0-diffusers-768_patch-1_group-7_5e4_residual_attn

Updated May 14, 2025

umd-vt-nyu/JH_dc-vae-f32c32-sana-1.0-768_patch-1_epoch-64_group-7_fusion

Updated May 14, 2025

umd-vt-nyu/JH_dc-vae-f32c32-sana-1.0-768_patch-1_epoch-64_group-14_fusion_residual_attn

Updated May 13, 2025

jiuhai

updated a model 12 months ago

umd-vt-nyu/JH_dc-vae-f32c32-sana-1.0-768_patch-1_epoch-64_group-4_fusion_residual_attn

Updated May 10, 2025

AI & ML interests

Team members 3

umd-vt-nyu's activity