Ethio-ASR Logo

arXiv ๐Ÿ“– [ preprint ]

โš’๏ธ Model Description

Ethio-ASR is a suite of Automatic Speech Recognition (ASR) models for Ethiopian languages. This repo contains a monolingual Amharic ASR model based on wav2vec2โ€‘bert-2.0, fine-tuned on the Amharic subset of the WAXAL Speech Dataset.

  • Developed by: Ethio-ASR Team
  • Task: Speech Recognition (ASR)
  • Language: Amharic
  • License: CC-BY-4.0
  • Finetuned from: facebook/w2v-bert-2.0

๐Ÿ“ˆ Evaluation on WAXAL Test Set (Amharic)

๐Ÿ“Œ ASR model in this HF repo

Model # Params Amharic WER (โ†“)
Ethio-ASR (afrihubert) 94M 30.95
Ethio-ASR (mms-300) 300M 30.19
Ethio-ASR (mms-1b) 1B 26.14
Ethio-ASR (w2v-bert-2.0) 600M 22.92
Monolingual SFT (w2v-bert-2.0) ๐Ÿ“Œ 600M 22.37

๐ŸŽง Direct Use

from transformers import AutoModelForCTC, AutoProcessor
import torchaudio, torch

processor = AutoProcessor.from_pretrained("badrex/Ethio-ASR-amharic")
model = AutoModelForCTC.from_pretrained("badrex/Ethio-ASR-amharic")

audio, sr = torchaudio.load("audio.wav")
inputs = processor(audio.squeeze(), sampling_rate=sr, return_tensors="pt")

with torch.no_grad():
    logits = model(**inputs).logits

pred_ids = torch.argmax(logits, dim=-1)
transcription = processor.batch_decode(pred_ids)[0]

print(transcription)

๐Ÿ”ง Downstream Use

  • Voice assistants
  • Accessibility tools
  • Research baselines

๐Ÿšซ Outโ€‘ofโ€‘Scope Use

  • Languages other than Amharic
  • Highโ€‘stakes deployments without human review
  • Noisy audio without speech enhancement

โš ๏ธ Risks & Limitations

Performance might vary across dialects, genders, ages, and recording quality.

๐Ÿ“Œ Citation

@misc{ethio_asr_2026,
  author = {
    Abdullah, Badr M. and
    Azime, Israel Abebe and
    Tonja, Atnafu Lambebo and
    Alabi, Jesujoba O. and
    Alemu, Abel Mulat and
    Hagos, Eyob G. and
    Balcha, Bontu Fufa and
    Nerea, Mulubrhan A. and
    Yadeta, Debela Desalegn and
    Marilign, Dagnachew Mekonnen and
    Fentahun, Amanuel Temesgen and
    Kebede, Tadesse and
    Gebru, Israel D. and
    Woldeyohannis, Michael Melese and
    Sewunetie, Walelign Tewabe and
    Mรถbius, Bernd and
    Klakow, Dietrich
  },
  title = {Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages},
  year = {2026},
  howpublished = {\url{https://huggingface.co/badrex/Ethio-ASR-multilingual-600M}}
}
Downloads last month
270
Safetensors
Model size
0.6B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for badrex/Ethio-ASR-amharic

Finetuned
(464)
this model

Dataset used to train badrex/Ethio-ASR-amharic

Collection including badrex/Ethio-ASR-amharic

Paper for badrex/Ethio-ASR-amharic