Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 12
This is a sentence-transformers model fine-tuned from atasoglu/turkish-base-bert-uncased-mean-nli-stsb-tr. It is specifically optimized for semantic search and sentence similarity in formal Turkish legal documents such as regulations, directives, and academic institution bylaws.
atasoglu/turkish-base-bert-uncased-mean-nli-stsb-trThe model was trained on yusufbaykaloglu/turkish-university-mevzuat, a curated dataset of official Turkish university regulations. Sentence pairs were labeled with semantic similarity scores and sampled from structured legal sections.
It was specifically developed to:
| sentence_0 | sentence_1 | label |
|---|---|---|
| "Yönetim Kurulu görevleri..." | "Müdürün önerisi ve Rektörün onayı..." | 0.8 |
| "Merkez faaliyetleri..." | "Danışma Kurulu görevleri..." | 0.1 |
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("nezahatkorkmaz/mevzuat-bert-turkish")
sentences = [
"(1) İhalesi yapılacak her iş için bir onay belgesi hazırlanır...",
"Yönetim Kurulunun görevleri"
]
embeddings = model.encode(sentences)
@inproceedings{reimers-2019-sentence-bert,
title={Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks},
author={Reimers, Nils and Gurevych, Iryna},
booktitle={EMNLP},
year={2019},
url={https://arxiv.org/abs/1908.10084}
}
This model inherits the license of the base model: CC BY-SA 4.0.
Developed by Nezahat Korkmaz.
Base model
ytu-ce-cosmos/turkish-base-bert-uncased