swecha-gonthuka-asr (ONNX)

This is an ONNX version of viswamaicoe/swecha-gonthuka-asr. It was automatically converted and uploaded using this Hugging Face Space.

Usage with Transformers.js

See the pipeline documentation for automatic-speech-recognition: https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.AutomaticSpeechRecognitionPipeline


Swecha Gonthuka ASR (Telugu)

Telugu automatic speech recognition model (wav2vec2-based), trained on the Swecha Gonthuka dataset. It is evaluated on Telugu-only test sets with Character Error Rate (CER).

Model details

  • Training data: Swecha Gonthuka dataset
  • Language: Telugu (te)
  • Metric: CER (Character Error Rate) — text normalized to Telugu script + spaces before scoring.

Evaluation results

Dataset Test samples CER (%)
FLEURS (te_in) 304 6.32
OpenSLR66 420 9.00
Common Voice 22 (te) 58 11.92

Note: For evaluation we used only those samples that contain no English words-Telugu text only-for each dataset, to allow a fair evaluation of model capability.

Usage

Python (Transformers)

pip install transformers torch librosa
from transformers import pipeline

pipe = pipeline(
    "automatic-speech-recognition",
    model="viswamaicoe/swecha-gonthuka-asr",
    feature_extractor="viswamaicoe/swecha-gonthuka-asr",
)

# From file (16 kHz mono WAV preferred)
text = pipe("audio.wav")
print(text)  # {"text": "..."}

Responsible and ethical use

  • Intended use: This model is intended for Telugu automatic speech recognition in applications such as transcription, accessibility, and language preservation. Use it in accordance with applicable laws and platform policies.
  • Limitations: Performance may vary with accent, dialect, noise, and recording quality. Do not rely on it as the sole source for critical or legal transcriptions without human review.
  • Misuse: Do not use this model to transcribe private conversations without consent, to create misleading or harmful content, or for any purpose that violates privacy, consent, or local regulations.
  • Bias and fairness: As with any ASR system, outputs can reflect biases present in training data. Evaluate outputs in context and consider human review for high-stakes use cases.

Citation

If you use this model in your work, please cite the Swecha Gonthuka dataset and this model:

@misc{swecha-gonthuka-asr,
  title        = {Swecha Gonthuka ASR: Telugu Speech Recognition},
  author       = {Viswam AI COE},
  year         = {2025},
  howpublished = {\url{https://huggingface.co/viswamaicoe/swecha-gonthuka-asr}},
  note         = {Trained on Swecha Gonthuka dataset; wav2vec2-based Telugu ASR}
}
Downloads last month
45
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for therajasekhar/swecha-gonthuka-asr-ONNX