Audio
Collection
6 items • Updated
This model is a fine-tuned version of microsoft/speecht5_tts on the facebook/voxpopuli dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.5109 | 3.5730 | 1000 | 0.4749 |
| 0.4885 | 7.1460 | 2000 | 0.4626 |
| 0.4819 | 10.7191 | 3000 | 0.4578 |
| 0.486 | 14.2921 | 4000 | 0.4559 |
Base model
microsoft/speecht5_tts