Robust Speech Recognition Event
Collection
The event ran from January 24 to February 7, 2022. Participants used the wav2vec2 model series to develop cutting-edge speech recognition models. • 14 items • Updated • 1
This model is a fine-tuned version of facebook/wav2vec2-xls-r-1b on the common_voice dataset. It achieves the following results on the evaluation set:
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 3.663 | 7.69 | 200 | 0.7898 | 0.6039 | 0.1848 |
| 0.7424 | 15.38 | 400 | 1.0215 | 0.5615 | 0.1924 |
| 0.4494 | 23.08 | 600 | 1.0901 | 0.5249 | 0.1932 |
| 0.5075 | 30.77 | 800 | 1.1013 | 0.5079 | 0.1935 |
| 0.4671 | 38.46 | 1000 | 1.1034 | 0.4916 | 0.1827 |
| 0.1928 | 46.15 | 1200 | 0.9550 | 0.4551 | 0.1643 |
Base model
facebook/wav2vec2-xls-r-1b