Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator
Paper • 2604.08121 • Published • 42
None defined yet.
SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis
SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization