--- title: XTTS Voice Studio emoji: 🎙️ colorFrom: yellow colorTo: yellow sdk: docker app_port: 7860 pinned: false --- # XTTS v2 Voice Studio A multilingual text-to-speech studio powered by **Coqui XTTS v2**, served via **FastAPI** (no Gradio). ## Features - 16 supported languages (Arabic, English, French, …) - Voice cloning from uploaded audio samples - Persistent voice library - Generation history with playback & download - Fully custom React UI ## Usage 1. Upload a reference audio clip (WAV / MP3 / FLAC, ≥ 6 s recommended). 2. Type your text and pick a language. 3. Adjust advanced parameters if needed. 4. Click **⚡ توليد الصوت** and wait — CPU inference takes ~30–90 s per request. ## Notes - Running on **CPU**; generation is slower than GPU but fully functional. - The XTTS v2 model (~1.8 GB) is downloaded on first startup and cached. - `COQUI_TOS_AGREED=1` is set automatically — by using this Space you agree to the [Coqui TTS terms](https://coqui.ai/cpml).