File size: 1,432 Bytes
4693bac | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 | ---
license: cc-by-nc-sa-4.0
language:
- bg
- cs
- da
- el
- es
- et
- fi
- hr
- hu
- it
- lt
- lv
- mt
- nl
- pl
- pt
- ro
- sk
- sl
- sv
---
# SpidR VP-20
SpidR VP-20 is a SpidR model pretrained pretrained on a subset of 6k hours and 20 languages of VoxPopuli
(all EU languages except English, French, and German)
for the [DiscoPhon benchmark](https://benchmarks.cognitive-ml.fr/discophon).
It was pretrained using the [`spidr`](https://github.com/facebookresearch/spidr) library.
You can load it with:
```python
from spidr.models import SpidR
from torch.hub import load_state_dict_from_url
state_dict = load_state_dict_from_url("https://huggingface.co/coml/spidr-vp20/resolve/main/final.pt")
model = SpidR().eval()
model.load_state_dict(state_dict)
```
## Files:
- `config.json`: Model configuration.
- `final.pt`: Model checkpoint.
- `full_checkpoint.pt`: Full checkpoint, with model, optimizer, etc.
## Citing
Please cite the DiscoPhon paper
```bibtex
@misc{poli2026discophon,
title={{DiscoPhon}: Benchmarking the Unsupervised Discovery of Phoneme Inventories With Discrete Speech Units},
author={Maxime Poli and Manel Khentout and Angelo Ortiz Tandazo and Ewan Dunbar and Emmanuel Chemla and Emmanuel Dupoux},
year={2026},
eprint={2603.18612},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2603.18612},
}
```
along with [SpidR](https://openreview.net/forum?id=E7XAFBpfZs).
|