Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MotorolaMobility 's Collections
Classifications
Small Text Models (SML)
Image Generation
ASR - Speech to Text
Multimodal

ASR - Speech to Text

updated Sep 5, 2024
Upvote
-

  • Paused
    Agents
    Featured
    2.76k

    Whisper

    πŸ“‰
    2.76k

    Transcribe audio files into text


  • qualcomm/Whisper-Small-En

    Other β€’ Updated Dec 16, 2025 β€’ 1.11k β€’ 6

    Note Size Parameters English-only Multilingual tiny 39 M βœ“ βœ“ base 74 M βœ“ βœ“ small 244 M βœ“ βœ“ medium 769 M βœ“ βœ“ large 1550 M x βœ“ large-v2 1550 M x βœ“


  • openai/whisper-medium

    Automatic Speech Recognition β€’ Updated Feb 29, 2024 β€’ 737k β€’ 283

  • openai/whisper-small

    Automatic Speech Recognition β€’ Updated Feb 29, 2024 β€’ 1.93M β€’ 549

  • aiola/whisper-medusa-v1

    Updated Aug 3, 2024 β€’ 39 β€’ 179

  • Robust Speech Recognition via Large-Scale Weak Supervision

    Paper β€’ 2212.04356 β€’ Published Dec 6, 2022 β€’ 53
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs