Co-Speech Gesture Video Generation (ICLR 2025 Oral)
Audio-Driven Portrait Animations
Generate video from image