Telugu ASR
Collection
1 item β’ Updated
Telugu automatic speech recognition model (wav2vec2-based), trained on the Swecha Gonthuka dataset. It is evaluated on Telugu-only test sets with Character Error Rate (CER).
| Dataset | Test samples | CER (%) |
|---|---|---|
| FLEURS (te_in) | 304 | 6.32 |
| OpenSLR66 | 420 | 9.00 |
| Common Voice 22 (te) | 58 | 11.92 |
Note: For evaluation we used only those samples that contain no English words-Telugu text only-for each dataset, to allow a fair evaluation of model capability.
pip install transformers torch librosa
from transformers import pipeline
pipe = pipeline(
"automatic-speech-recognition",
model="viswamaicoe/swecha-gonthuka-asr",
feature_extractor="viswamaicoe/swecha-gonthuka-asr",
)
# From file (16 kHz mono WAV preferred)
text = pipe("audio.wav")
print(text) # {"text": "..."}
If you use this model in your work, please cite the Swecha Gonthuka dataset and this model:
@misc{swecha-gonthuka-asr,
title = {Swecha Gonthuka ASR: Telugu Speech Recognition},
author = {Viswam AI COE},
year = {2025},
howpublished = {\url{https://huggingface.co/viswamaicoe/swecha-gonthuka-asr}},
note = {Trained on Swecha Gonthuka dataset; wav2vec2-based Telugu ASR}
}