-
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 245 -
A Survey on Diffusion Language Models
Paper • 2508.10875 • Published • 34 -
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 17 -
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper • 2403.03206 • Published • 71
Collections
Discover the best community collections!
Collections including paper arxiv:2302.05543
-
coqui/XTTS-v2
Text-to-Speech • Updated • 6.73M • 3.49k -
deepseek-ai/DeepSeek-V3-0324
Text Generation • 685B • Updated • 612k • • 3.1k -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.86M • • 5.6k -
Distilling an End-to-End Voice Assistant Without Instruction Training Data
Paper • 2410.02678 • Published • 24
-
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 58 -
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Paper • 2308.06721 • Published • 37 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 17
-
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 17 -
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 58 -
Proximal Policy Optimization Algorithms
Paper • 1707.06347 • Published • 11 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 64
-
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 245 -
A Survey on Diffusion Language Models
Paper • 2508.10875 • Published • 34 -
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 17 -
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper • 2403.03206 • Published • 71
-
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 17 -
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 58 -
Proximal Policy Optimization Algorithms
Paper • 1707.06347 • Published • 11 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 64
-
coqui/XTTS-v2
Text-to-Speech • Updated • 6.73M • 3.49k -
deepseek-ai/DeepSeek-V3-0324
Text Generation • 685B • Updated • 612k • • 3.1k -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.86M • • 5.6k -
Distilling an End-to-End Voice Assistant Without Instruction Training Data
Paper • 2410.02678 • Published • 24
-
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 58 -
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Paper • 2308.06721 • Published • 37 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 17