This is model is a finetune of the openai/whisper-small model using approximately 750 hours of general conversational audio from Part 3 of the National Speech Corpus. These are the final results on the evaluation set (~95 hours of audio):

  • Validation Loss: 0.386770
  • WER: 14.257934
Downloads last month
2
Safetensors
Model size
0.2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Xycone/whisper-small-SGspeech-finetune

Finetuned
(3443)
this model