Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
frivasc
's Collections
Image to 3D
Speaker Diarization
Computer Vision
TTS - STT
Datasets
Robotics
LLM Models
Visual Transformers
TTS - STT
updated
Nov 20, 2025
Text to Speech and Speech to Text models
Upvote
-
SWivid/F5-TTS
Text-to-Speech
•
Updated
Mar 21, 2025
•
671k
•
1.16k
SWivid/E2-TTS
Text-to-Speech
•
Updated
Mar 12, 2025
•
111k
•
57
pyannote/segmentation-3.0
Voice Activity Detection
•
Updated
May 10, 2024
•
10.6M
•
913
maya-research/maya1
Text-to-Speech
•
Updated
Nov 12, 2025
•
24.9k
•
879
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
Apr 10, 2025
•
9.84M
•
•
6.02k
Upvote
-
Share collection
View history
Collection guide
Browse collections