Audio, Speech & Music - a rocari Collection

rocari 's Collections

Image Generation

Audio, Speech & Music

Agents, Planning & Tools

Audio, Speech & Music

updated Jan 4, 2024

facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 74.7k • 972
openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 4.82M • • 5.61k
jonatasgrosman/whisper-large-pt-cv11

Automatic Speech Recognition • Updated Dec 22, 2022 • 17 • 16
openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 73k • 1.79k
Incremental FastPitch: Chunk-based High Quality Text to Speech

Paper • 2401.01755 • Published Jan 3, 2024 • 10