Czech Semantic Embeddings (GGUF)

GGUF conversions of Czech semantic embedding models from Seznam, prepared for llama.cpp embedding inference.

Included Models

  • Seznam__retromae-small-cs.f16.gguf
  • Seznam__retromae-small-cs.q8_0.gguf
  • Seznam__dist-mpnet-paracrawl-cs-en.f16.gguf
  • Seznam__dist-mpnet-paracrawl-cs-en.q8_0.gguf
  • Seznam__simcse-retromae-small-cs.f16.gguf
  • Seznam__simcse-retromae-small-cs.q8_0.gguf
  • Seznam__simcse-dist-mpnet-paracrawl-cs-en.f16.gguf
  • Seznam__simcse-dist-mpnet-paracrawl-cs-en.q8_0.gguf

Upstream Sources

Citation

If you use these models, please cite the original Seznam paper:

@inproceedings{bednavr2024some,
  title={Some Like It Small: Czech Semantic Embedding Models for Industry Applications},
  author={Bedn{\'a}{\v{r}}, Ji{\v{r}}{\'\i} and N{\'a}plava, Jakub and Baran{\v{c}}{\'\i}kov{\'a}, Petra and Lisick{\`y}, Ond{\v{r}}ej},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={38},
  number={21},
  pages={22734--22742},
  year={2024}
}

Usage (llama.cpp)

Run embedding server:

llama-server -m Seznam__retromae-small-cs.q8_0.gguf --embedding --pooling cls

OpenAI-compatible embeddings endpoint:

curl http://127.0.0.1:8080/v1/embeddings \
  -H "Content-Type: application/json" \
  -d '{"input":["Ahoj svete","Semantic search test"]}'

File Integrity

SHA256 checksums are in checksums.txt.

License

This repo contains converted checkpoints from upstream models. Respect original model licenses and terms:

  • retromae-small-cs: CC-BY-4.0
  • dist-mpnet-paracrawl-cs-en: CC-BY-4.0
  • simcse-retromae-small-cs: CC-BY-4.0
  • simcse-dist-mpnet-paracrawl-cs-en: CC-BY-4.0

Attribution to Seznam and original model cards is required.

Downloads last month
17
GGUF
Model size
24.3M params
Architecture
bert
Hardware compatibility
Log In to add your hardware

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for Veol-CZ/czech-semantic-embeddings-c