-
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 108 -
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Paper • 2506.08009 • Published • 30 -
Seeing Voices: Generating A-Roll Video from Audio with Mirage
Paper • 2506.08279 • Published • 27 -
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Paper • 2506.07848 • Published • 4
Collections
Discover the best community collections!
Collections including paper arxiv:2506.08279
-
GaussianSpeech: Audio-Driven Gaussian Avatars
Paper • 2411.18675 • Published -
Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis
Paper • 2502.04128 • Published • 27 -
MOSPA: Human Motion Generation Driven by Spatial Audio
Paper • 2507.11949 • Published • 25 -
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
Paper • 2507.12956 • Published • 25
-
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
Paper • 2412.11279 • Published • 13 -
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control
Paper • 2501.02260 • Published • 5 -
GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor
Paper • 2501.09978 • Published • 6 -
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
Paper • 2502.13995 • Published • 9
-
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
Paper • 2502.08590 • Published • 42 -
Distillation Scaling Laws
Paper • 2502.08606 • Published • 47 -
Soundwave: Less is More for Speech-Text Alignment in LLMs
Paper • 2502.12900 • Published • 86 -
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space
Paper • 2503.09419 • Published • 6
-
StyleMaster: Stylize Your Video with Artistic Generation and Translation
Paper • 2412.07744 • Published • 20 -
Video Motion Transfer with Diffusion Transformers
Paper • 2412.07776 • Published • 17 -
ObjCtrl-2.5D: Training-free Object Control with Camera Poses
Paper • 2412.07721 • Published • 9 -
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance
Paper • 2412.05355 • Published • 8
-
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper • 2312.13578 • Published • 29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper • 2312.13150 • Published • 15 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 27 -
Relightable Gaussian Codec Avatars
Paper • 2312.03704 • Published • 32
-
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 108 -
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Paper • 2506.08009 • Published • 30 -
Seeing Voices: Generating A-Roll Video from Audio with Mirage
Paper • 2506.08279 • Published • 27 -
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Paper • 2506.07848 • Published • 4
-
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
Paper • 2502.08590 • Published • 42 -
Distillation Scaling Laws
Paper • 2502.08606 • Published • 47 -
Soundwave: Less is More for Speech-Text Alignment in LLMs
Paper • 2502.12900 • Published • 86 -
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space
Paper • 2503.09419 • Published • 6
-
GaussianSpeech: Audio-Driven Gaussian Avatars
Paper • 2411.18675 • Published -
Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis
Paper • 2502.04128 • Published • 27 -
MOSPA: Human Motion Generation Driven by Spatial Audio
Paper • 2507.11949 • Published • 25 -
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
Paper • 2507.12956 • Published • 25
-
StyleMaster: Stylize Your Video with Artistic Generation and Translation
Paper • 2412.07744 • Published • 20 -
Video Motion Transfer with Diffusion Transformers
Paper • 2412.07776 • Published • 17 -
ObjCtrl-2.5D: Training-free Object Control with Camera Poses
Paper • 2412.07721 • Published • 9 -
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance
Paper • 2412.05355 • Published • 8
-
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
Paper • 2412.11279 • Published • 13 -
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control
Paper • 2501.02260 • Published • 5 -
GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor
Paper • 2501.09978 • Published • 6 -
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
Paper • 2502.13995 • Published • 9
-
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper • 2312.13578 • Published • 29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper • 2312.13150 • Published • 15 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 27 -
Relightable Gaussian Codec Avatars
Paper • 2312.03704 • Published • 32