Text-to-images
updated
Training-Free Consistent Text-to-Image Generation
Paper
• 2402.03286
• Published • 67
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Paper
• 2402.04324
• Published • 26
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion
Models by Leveraging CLIP Latent Space
Paper
• 2402.05195
• Published • 19
FiT: Flexible Vision Transformer for Diffusion Model
Paper
• 2402.12376
• Published • 48
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image
Generation
Paper
• 2402.11929
• Published • 11
Paper
• 2402.13144
• Published • 100
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable
Virtual Try-on
Paper
• 2403.01779
• Published • 30
StableDrag: Stable Dragging for Point-based Image Editing
Paper
• 2403.04437
• Published • 27
FlashFace: Human Image Personalization with High-fidelity Identity
Preservation
Paper
• 2403.17008
• Published • 22
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image
Synthesis
Paper
• 2404.13686
• Published • 29
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Paper
• 2404.14507
• Published • 23
Editable Image Elements for Controllable Synthesis
Paper
• 2404.16029
• Published • 12
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
Generation
Paper
• 2405.01434
• Published • 56
Customizing Text-to-Image Models with a Single Image Pair
Paper
• 2405.01536
• Published • 22
Stylus: Automatic Adapter Selection for Diffusion Models
Paper
• 2404.18928
• Published • 15
DressCode: Autoregressively Sewing and Generating Garments from Text
Guidance
Paper
• 2401.16465
• Published • 12
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
• 2404.19427
• Published • 74
MotionLCM: Real-time Controllable Motion Generation via Latent
Consistency Model
Paper
• 2404.19759
• Published • 27
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper
• 2404.18212
• Published • 30
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with
Fine-Grained Chinese Understanding
Paper
• 2405.08748
• Published • 23
Compositional Text-to-Image Generation with Dense Blob Representations
Paper
• 2405.08246
• Published • 17
CAT3D: Create Anything in 3D with Multi-View Diffusion Models
Paper
• 2405.10314
• Published • 47
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Paper
• 2406.04333
• Published • 38
Step-aware Preference Optimization: Aligning Preference with Denoising
Performance at Each Step
Paper
• 2406.04314
• Published • 30
Autoregressive Model Beats Diffusion: Llama for Scalable Image
Generation
Paper
• 2406.06525
• Published • 71