JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation Paper • 2411.09209 • Published Nov 14, 2024
Running on Zero Agents Featured 2.84k F5-TTS 🗣 2.84k F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)