Bodo Whisper ASR (v1)

This repository contains a fine-tuned Whisper Automatic Speech Recognition (ASR) model for the Bodo language (ISO 639-3: brx).

The model is trained to convert Bodo speech audio into text, supporting low-resource language research and applications.


πŸ” Model Details

  • Model name: bodo-whisper-asr-v1
  • Base model: OpenAI Whisper
  • Task: Automatic Speech Recognition (ASR)
  • Language: Bodo (brx)
  • Sampling rate: 16 kHz
  • Framework: PyTorch
  • Model format: SafeTensors

πŸ“Š Training Information

  • Fine-tuned on a custom Bodo speech dataset
  • Audio normalized to 16 kHz mono
  • Trained using Hugging Face Transformers
  • Optimized for short and medium-length utterances

(Dataset details can be added later if you decide to make them public.)


πŸš€ Usage

Using Hugging Face Transformers

from transformers import pipeline

asr = pipeline(
    "automatic-speech-recognition",
    model="COCO1033/bodo-whisper-asr-v1"
)

result = asr("audio.wav")
print(result["text"])
Downloads last month
18
Safetensors
Model size
0.2B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support