-
Granary: Speech Recognition and Translation Dataset in 25 European Languages
Paper • 2505.13404 • Published • 3 -
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages
Paper • 2410.01036 • Published • 15 -
YODAS: Youtube-Oriented Dataset for Audio and Speech
Paper • 2406.00899 • Published • 4
Patrick Cho
pcat
·
AI & ML interests
None yet
Recent Activity
updated a collection 3 days ago
Dataset-TTS updated a collection 3 days ago
Dataset-TTS liked a model 3 days ago
kyutai/tts-voicesOrganizations
None yet