F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper β’ 2410.06885 β’ Published β’ 47
F5-TTS finetune on all ami data and ithuan trv data, using ipa as input.
g2p from this repo.
please refer source repo
Base model
SWivid/F5-TTS