Some Like It Small: Czech Semantic Embedding Models for Industry Applications
Paper • 2311.13921 • Published
GGUF conversions of Czech semantic embedding models from Seznam, prepared for llama.cpp embedding inference.
Seznam__retromae-small-cs.f16.ggufSeznam__retromae-small-cs.q8_0.ggufSeznam__dist-mpnet-paracrawl-cs-en.f16.ggufSeznam__dist-mpnet-paracrawl-cs-en.q8_0.ggufSeznam__simcse-retromae-small-cs.f16.ggufSeznam__simcse-retromae-small-cs.q8_0.ggufSeznam__simcse-dist-mpnet-paracrawl-cs-en.f16.ggufSeznam__simcse-dist-mpnet-paracrawl-cs-en.q8_0.ggufIf you use these models, please cite the original Seznam paper:
@inproceedings{bednavr2024some,
title={Some Like It Small: Czech Semantic Embedding Models for Industry Applications},
author={Bedn{\'a}{\v{r}}, Ji{\v{r}}{\'\i} and N{\'a}plava, Jakub and Baran{\v{c}}{\'\i}kov{\'a}, Petra and Lisick{\`y}, Ond{\v{r}}ej},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
volume={38},
number={21},
pages={22734--22742},
year={2024}
}
Run embedding server:
llama-server -m Seznam__retromae-small-cs.q8_0.gguf --embedding --pooling cls
OpenAI-compatible embeddings endpoint:
curl http://127.0.0.1:8080/v1/embeddings \
-H "Content-Type: application/json" \
-d '{"input":["Ahoj svete","Semantic search test"]}'
SHA256 checksums are in checksums.txt.
This repo contains converted checkpoints from upstream models. Respect original model licenses and terms:
retromae-small-cs: CC-BY-4.0dist-mpnet-paracrawl-cs-en: CC-BY-4.0simcse-retromae-small-cs: CC-BY-4.0simcse-dist-mpnet-paracrawl-cs-en: CC-BY-4.0Attribution to Seznam and original model cards is required.
8-bit
16-bit