Co-Speech Gesture Video Generation (ICLR 2025 Oral)
Create images of a given character in different poses