Speech to Speech - a alshell7 Collection

alshell7 's Collections

Speech to Speech

Small/Tiny Models

Speech to Speech

updated Feb 7

Qwen/Qwen2.5-Omni-3B

Any-to-Any • Updated Apr 30, 2025 • 453k • 332
Running on CPU Upgrade

Featured

1.31k

Open ASR Leaderboard

🏆

1.31k

Explore speech recognition model benchmarks and rankings
fishaudio/s1-mini

Text-to-Speech • Updated Feb 6 • 4.9k • 640
fluxions/vui

Text-to-Speech • Updated Jun 17, 2025 • 533 • 147
OpenMOSS-Team/MOSS-TTSD-v0

Text-to-Speech • 2B • Updated Jun 20, 2025 • 6 • 28
nvidia/audio-flamingo-3

Audio-Text-to-Text • Updated Nov 28, 2025 • 306 • 145
bosonai/higgs-audio-v2-generation-3B-base

Text-to-Speech • 6B • Updated 10 days ago • 507k • 668
Vyvo/VyvoTTS-v0-Qwen3-0.6B

Text-to-Speech • 0.8B • Updated Aug 9, 2025 • 360 • 25
nvidia/canary-1b-v2

Automatic Speech Recognition • Updated Dec 3, 2025 • 165k • 373
nvidia/diar_streaming_sortformer_4spk-v2

Automatic Speech Recognition • Updated Dec 31, 2025 • 23.2k • 113
microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Jan 22 • 103k • 2.32k
stepfun-ai/Step-Audio-2-mini

Any-to-Any • Updated Feb 14 • 2.31k • 254
FireRedTeam/FireRedTTS2

Updated Sep 17, 2025 • 66
ThomasG/faster-whisper-large-v3-turbo-int8-fp16

Automatic Speech Recognition • Updated Feb 7 • 20