Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper β’ 2509.14882 β’ Published Sep 18, 2025 β’ 2
Qwen3 Voice Embedding Collection Standalone ECAPA-TDNN x-vector speaker encoders extracted from Qwen3-TTS. 1024-dim (0.6B) and 2048-dim (1.7B). β’ 4 items β’ Updated Feb 27 β’ 29