๐ Model Description
This is a Automatic Speech Recognition (ASR) model for Amharic, one of the official languages of Ethiopia. It is fineโtuned from Wav2Vec2โBERT 2.0 using the Ethio speech corpus.
- Developed by: Badr al-Absi
- Model type: Speech Recognition (ASR)
- Languages: Amharic
- License: CC-BY-4.0
- Finetuned from: facebook/w2v-bert-2.0
๐ง Direct Use
from transformers import Wav2Vec2BertProcessor, Wav2Vec2BertForCTC
import torchaudio, torch
processor = Wav2Vec2BertProcessor.from_pretrained("badrex/w2v-bert-2.0-amharic-asr")
model = Wav2Vec2BertForCTC.from_pretrained("badrex/w2v-bert-2.0-amharic-asr")
audio, sr = torchaudio.load("audio.wav")
inputs = processor(audio.squeeze(), sampling_rate=sr, return_tensors="pt")
with torch.no_grad():
logits = model(**inputs).logits
pred_ids = torch.argmax(logits, dim=-1)
transcription = processor.batch_decode(pred_ids)[0]
print(transcription)
๐ง Downstream Use
- Voice assistants
- Accessibility tools
- Research baselines
๐ซ OutโofโScope Use
- Other languages besides Amharic
- Highโstakes deployments without human review
- Noisy audio without further tuning
โ ๏ธ Risks & Limitations
Performance varies with accents, dialects, and recording quality.
๐ Citation
@misc{w2v_bert_ethiopian_asr,
author = {Badr M. Abdullah},
title = {Fine-tuning Wav2Vec2-BERT 2.0 for Ethiopian ASR},
year = {2025},
url = {https://huggingface.co/badrex/w2v-bert-2.0-amharic-asr}
}
- Downloads last month
- 11
Model tree for hadamard-2/w2v-bert-2.0-amharic-asr
Base model
facebook/w2v-bert-2.0