Ethio-ASR ๐ช๐น ๐ฌ
Collection
A suite of multilingual CTC-based ASR models for Ethiopian languages โข 10 items โข Updated
Ethio-ASR is a suite of Automatic Speech Recognition (ASR) models for Ethiopian languages. This repo contains a monolingual Amharic ASR model based on wav2vec2โbert-2.0, fine-tuned on the Amharic subset of the WAXAL Speech Dataset.
๐ ASR model in this HF repo
| Model | # Params | Amharic WER (โ) |
|---|---|---|
| Ethio-ASR (afrihubert) | 94M | 30.95 |
| Ethio-ASR (mms-300) | 300M | 30.19 |
| Ethio-ASR (mms-1b) | 1B | 26.14 |
| Ethio-ASR (w2v-bert-2.0) | 600M | 22.92 |
| Monolingual SFT (w2v-bert-2.0) ๐ | 600M | 22.37 |
from transformers import AutoModelForCTC, AutoProcessor
import torchaudio, torch
processor = AutoProcessor.from_pretrained("badrex/Ethio-ASR-amharic")
model = AutoModelForCTC.from_pretrained("badrex/Ethio-ASR-amharic")
audio, sr = torchaudio.load("audio.wav")
inputs = processor(audio.squeeze(), sampling_rate=sr, return_tensors="pt")
with torch.no_grad():
logits = model(**inputs).logits
pred_ids = torch.argmax(logits, dim=-1)
transcription = processor.batch_decode(pred_ids)[0]
print(transcription)
Performance might vary across dialects, genders, ages, and recording quality.
@misc{ethio_asr_2026,
author = {
Abdullah, Badr M. and
Azime, Israel Abebe and
Tonja, Atnafu Lambebo and
Alabi, Jesujoba O. and
Alemu, Abel Mulat and
Hagos, Eyob G. and
Balcha, Bontu Fufa and
Nerea, Mulubrhan A. and
Yadeta, Debela Desalegn and
Marilign, Dagnachew Mekonnen and
Fentahun, Amanuel Temesgen and
Kebede, Tadesse and
Gebru, Israel D. and
Woldeyohannis, Michael Melese and
Sewunetie, Walelign Tewabe and
Mรถbius, Bernd and
Klakow, Dietrich
},
title = {Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages},
year = {2026},
howpublished = {\url{https://huggingface.co/badrex/Ethio-ASR-multilingual-600M}}
}
Base model
facebook/w2v-bert-2.0