l / README.md
Arabi32's picture
Rename README (9).md to README.md
d25b7ef verified
metadata
title: XTTS Voice Studio
emoji: 🎙️
colorFrom: yellow
colorTo: yellow
sdk: docker
app_port: 7860
pinned: false

XTTS v2 Voice Studio

A multilingual text-to-speech studio powered by Coqui XTTS v2, served via FastAPI (no Gradio).

Features

  • 16 supported languages (Arabic, English, French, …)
  • Voice cloning from uploaded audio samples
  • Persistent voice library
  • Generation history with playback & download
  • Fully custom React UI

Usage

  1. Upload a reference audio clip (WAV / MP3 / FLAC, ≥ 6 s recommended).
  2. Type your text and pick a language.
  3. Adjust advanced parameters if needed.
  4. Click ⚡ توليد الصوت and wait — CPU inference takes ~30–90 s per request.

Notes

  • Running on CPU; generation is slower than GPU but fully functional.
  • The XTTS v2 model (~1.8 GB) is downloaded on first startup and cached.
  • COQUI_TOS_AGREED=1 is set automatically — by using this Space you agree to the Coqui TTS terms.