Thomas's picture

1

Thomas

XThomasBU

·

AI & ML interests

None yet

Recent Activity

authored a paper 6 minutes ago

Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models

authored a paper 7 minutes ago

Some Modalities are More Equal Than Others: Decoding and Architecting Multimodal Integration in MLLMs

authored a paper 7 minutes ago

Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos

View all activity

Organizations

authored a paper 6 minutes ago

Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models

Paper • 2503.17794 • Published Mar 22, 2025

authored 2 papers 7 minutes ago

Some Modalities are More Equal Than Others: Decoding and Architecting Multimodal Integration in MLLMs

Paper • 2511.22826 • Published Nov 28, 2025 • 8

Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos

Paper • 2512.01803 • Published Dec 1, 2025 • 5

authored a paper 11 minutes ago

Semantic Richness or Geometric Reasoning? The Fragility of VLM's Visual Invariance

Paper • 2604.01848 • Published 10 days ago

authored a paper about 1 year ago

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Paper • 2503.06698 • Published Mar 9, 2025 • 4

authored a paper over 1 year ago

$\textit{Revelio}$: Interpreting and leveraging semantic information in diffusion models

Paper • 2411.16725 • Published Nov 23, 2024 • 1