AudioPaLM: A Large Language Model That Can Speak and Listen Paper • 2306.12925 • Published Jun 22, 2023 • 56
distil-whisper/distil-large-v3 Automatic Speech Recognition • 0.8B • Updated Mar 6, 2025 • 1.2M • 375
distil-whisper/distil-large-v2 Automatic Speech Recognition • 0.8B • Updated Mar 6, 2025 • 8.67k • 514
distil-whisper/distil-medium.en Automatic Speech Recognition • 0.4B • Updated Mar 25, 2024 • 8.9k • 127
distil-whisper/distil-small.en Automatic Speech Recognition • 0.2B • Updated Mar 25, 2024 • 7.37k • 112
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper • 2311.00430 • Published Nov 1, 2023 • 56
nvidia/diar_sortformer_4spk-v1 Automatic Speech Recognition • 0.1B • Updated Dec 15, 2025 • 5.37k • 137
nvidia/stt_ar_fastconformer_hybrid_large_pcd_v1.0 Automatic Speech Recognition • Updated Oct 21, 2025 • 83.9k • 36
nvidia/parakeet-tdt-0.6b-v3 Automatic Speech Recognition • 0.6B • Updated about 3 hours ago • 346k • 783
Running on CPU Upgrade Agents Featured 1.31k Open ASR Leaderboard 🏆 1.31k Explore speech recognition model benchmarks and rankings