Some Like It Small: Czech Semantic Embedding Models for Industry Applications
Paper • 2311.13921 • Published
GGUF conversions of Czech semantic embedding models from Seznam with non-commercial license terms.
Seznam__dist-mpnet-czeng-cs-en.f16.ggufSeznam__dist-mpnet-czeng-cs-en.q8_0.ggufSeznam__simcse-dist-mpnet-czeng-cs-en.f16.ggufSeznam__simcse-dist-mpnet-czeng-cs-en.q8_0.ggufIf you use this model, please cite the original Seznam paper:
@inproceedings{bednavr2024some,
title={Some Like It Small: Czech Semantic Embedding Models for Industry Applications},
author={Bedn{\'a}{\v{r}}, Ji{\v{r}}{\'\i} and N{\'a}plava, Jakub and Baran{\v{c}}{\'\i}kov{\'a}, Petra and Lisick{\`y}, Ond{\v{r}}ej},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
volume={38},
number={21},
pages={22734--22742},
year={2024}
}
llama-server -m Seznam__dist-mpnet-czeng-cs-en.q8_0.gguf --embedding --pooling cls
SHA256 checksums are in checksums.txt.
This package contains converted checkpoints from upstream models. Respect original model license and terms:
dist-mpnet-czeng-cs-en: CC-BY-NC-SA-4.0simcse-dist-mpnet-czeng-cs-en: CC-BY-NC-SA-4.0Non-commercial use only. Attribution to Seznam and original model card is required.
8-bit
16-bit