rngzhi/cs3264-project
Viewer • Updated • 3.54k • 23
How to use rngzhi/cs3264-project with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="rngzhi/cs3264-project") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("rngzhi/cs3264-project")
model = AutoModelForSpeechSeq2Seq.from_pretrained("rngzhi/cs3264-project")This model is a fine-tuned version of openai/whisper-small on the National Speech Corpus(partial) dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.0068 | 5.01 | 500 | 0.1508 | 5.4137 |
| 0.001 | 11.01 | 1000 | 0.1691 | 5.0832 |
| 0.0003 | 16.02 | 1500 | 0.1769 | 5.1060 |
| 0.0006 | 22.01 | 2000 | 0.1840 | 5.0946 |
| 0.0005 | 28.0 | 2500 | 0.1891 | 5.1174 |
| 0.0003 | 33.02 | 3000 | 0.1933 | 5.2086 |
| 0.0005 | 39.01 | 3500 | 0.1962 | 5.2997 |
| 0.0002 | 45.0 | 4000 | 0.1991 | 5.3339 |
| 0.0002 | 50.02 | 4500 | 0.2010 | 5.3681 |
| 0.0003 | 56.01 | 5000 | 0.2020 | 5.3795 |
Base model
openai/whisper-small