--- license: cc-by-nc-sa-4.0 language: - bg - cs - da - el - es - et - fi - hr - hu - it - lt - lv - mt - nl - pl - pt - ro - sk - sl - sv --- # SpidR VP-20 SpidR VP-20 is a SpidR model pretrained pretrained on a subset of 6k hours and 20 languages of VoxPopuli (all EU languages except English, French, and German) for the [DiscoPhon benchmark](https://benchmarks.cognitive-ml.fr/discophon). It was pretrained using the [`spidr`](https://github.com/facebookresearch/spidr) library. You can load it with: ```python from spidr.models import SpidR from torch.hub import load_state_dict_from_url state_dict = load_state_dict_from_url("https://huggingface.co/coml/spidr-vp20/resolve/main/final.pt") model = SpidR().eval() model.load_state_dict(state_dict) ``` ## Files: - `config.json`: Model configuration. - `final.pt`: Model checkpoint. - `full_checkpoint.pt`: Full checkpoint, with model, optimizer, etc. ## Citing Please cite the DiscoPhon paper ```bibtex @misc{poli2026discophon, title={{DiscoPhon}: Benchmarking the Unsupervised Discovery of Phoneme Inventories With Discrete Speech Units}, author={Maxime Poli and Manel Khentout and Angelo Ortiz Tandazo and Ewan Dunbar and Emmanuel Chemla and Emmanuel Dupoux}, year={2026}, eprint={2603.18612}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2603.18612}, } ``` along with [SpidR](https://openreview.net/forum?id=E7XAFBpfZs).