Inference speed

#10

by banank1989 - opened Feb 6, 2025

Discussion

banank1989

Feb 6, 2025

Can it convert an 10-15 words sentence within 1 or 2 sec? I tried but seemed too slow(10-15sec)

SekarKrishnan

Mar 6, 2025

@banank1989 did you get any solution for speedup the TTS? I am using G5 instance it takes around 5-6 seconds for 10 tokens. Pls suggest if you found anything to do it for real-time TTS

AshwinSankar

AI4Bharat org Jun 6, 2025

You can the model with flash attention in streaming mode.

fahim6363

Jul 4, 2025

how do i do that, i have tried all the optimization method they have suggested is there any other way to do that?

SamaFiroz

Jan 28

You can the model with flash attention in streaming mode.

I tried implementing https://github.com/huggingface/parler-tts/blob/main/INFERENCE.md in this current model and its not supporting at all.

Could you please guide where we are going wrong?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment