wav2vec2-uk-dido-yvanchyk - Ukrainian ASR

This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the Dido Yvanchyk Audio Dataset v2.

Model Description

The model was fine-tuned for 50 epochs to improve ASR quality for Ukrainian conversational speech and local dialects.

Usage

from transformers import pipeline

pipe = pipeline("automatic-speech-recognition", model="KSE-RESEARCH-Group/wav2vec2-uk-dido-yvanchyk")
print(pipe("sample.wav")["text"])
Downloads last month
4
Safetensors
Model size
0.3B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for KSE-RESEARCH-Group/wav2vec2-uk-dido-yvanchyk

Finetuned
(833)
this model