Supertone
/

supertonic-3

speech-synthesis

Model card Files Files and versions

juheon2 commited on 4 days ago

Commit

feb60dc

·

1 Parent(s): 724fb5a

Update Voice Builder links

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -85,6 +85,10 @@ print(f"Generated {duration:.2f}s of audio")
 - **Higher speaker similarity**: improved similarity across the shared-language set compared with Supertonic 2.
 - **Expression tags**: supports simple tags such as `<laugh>`, `<breath>`, and `<sigh>`.
 ## Performance Highlights
 Supertonic 3 is designed for practical on-device inference: compact enough to run locally, while staying competitive with much larger open TTS systems.

 - **Higher speaker similarity**: improved similarity across the shared-language set compared with Supertonic 2.
 - **Expression tags**: supports simple tags such as `<laugh>`, `<breath>`, and `<sigh>`.
+## Custom Voices and Audio Samples
+The open-weight package includes fixed preset voice styles for immediate local inference. If you want to hear how Supertonic 3 performs with zero-shot custom voice styles, visit the [Audio Sample Demo](https://supertonic3.github.io/) to compare reference audio and generated speech across several use cases. To create your own Supertonic 3 voice-style JSON from reference audio, use [Supertonic Voice Builder](https://supertonic.supertone.ai/voice-builder); purchased Voice Builder styles include downloadable embeddings for both Supertonic 2 and Supertonic 3.
 ## Performance Highlights
 Supertonic 3 is designed for practical on-device inference: compact enough to run locally, while staying competitive with much larger open TTS systems.