Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 12
This model was finetuned with Unsloth.
This is a sentence-transformers model trained on the french triplet ds and french custom triplet ds datasets. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 256, 'do_lower_case': False, 'architecture': 'PeftModelForFeatureExtraction'})
(1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': True, 'include_prompt': True})
(2): Normalize()
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("thomasavare/Qwen3-Embedding-0.6B-med-v0")
# Run inference
sentences = [
'Plan de soins post-opératoires après une chirurgie de kyste osseux anévrismaux',
'Les exercices de renforcement et de musculation de la hanche ont été commencés tôt, et à la cinquième semaine, on a commencé à marcher avec des béquilles, et quatre semaines plus tard, on a abandonné les béquilles et on a encouragé le patient à marcher de façon autonome.',
"Le patient a été conseillé de continuer à faire un suivi régulier auprès de son fournisseur de soins de santé primaires et de ses dentistes pour gérer les problèmes postopératoires et s'assurer qu'il n'y a pas de récidive de la maladie.",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[ 1.0000, 0.3342, 0.0209],
# [ 0.3342, 1.0000, -0.0792],
# [ 0.0209, -0.0792, 1.0000]])
anchor, positive, and negative| anchor | positive | negative | |
|---|---|---|---|
| type | string | string | string |
| details |
|
|
|
| anchor | positive | negative |
|---|---|---|
masse kystique sur l' utérus |
Le patient s'est présenté aux urgences avec une douleur abdominale sévère dans le quadrant inférieur gauche, qui a été diagnostiquée comme une masse kystique sur la paroi antérieure gauche de l'utérus. |
Symptômes: masse pelvienne à croissance rapide et taux sériques accrus de marqueurs tumoraux |
Plan de soins post-démarrage pour les patients atteints d' une BPAN |
La patiente sera suivie en consultation externe avec une surveillance étroite de sa nutrition et de ses habitudes de comportement pour s'assurer qu'elle ne revienne pas à ses comportements antérieurs. |
Le patient a été libéré le 30e jour d'hospitalisation avec de l'aspirine seule. |
Comment l'état du patient a- t- il réagi au traitement? |
Le patient a répondu positivement au traitement prescrit par la diéthylcarbamazine et aucun suivi n' est nécessaire. |
Les symptômes du patient se sont résolus après avoir reçu un traitement et subi des échocardiogrammes de suivi. |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
anchor, positive, and negative| anchor | positive | negative | |
|---|---|---|---|
| type | string | string | string |
| details |
|
|
|
| anchor | positive | negative |
|---|---|---|
Cholera |
Maladie infectieuse |
Mittelschmerz |
Choléra |
Cholera |
Astringents et détergents locaux |
Maladie infectieuse |
Choléra |
collision avec tout objet, fixe ou mobile ou en mouvement |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
per_device_train_batch_size: 512learning_rate: 2e-05warmup_steps: 0.05bf16: Trueproject: icd10-embeddingstrackio_space_id: thomasavare/icd10-embeddingswarmup_ratio: 0.05prompts: {'anchor': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :', 'positive': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :', 'negative': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :'}batch_sampler: no_duplicatesper_device_train_batch_size: 512num_train_epochs: 3max_steps: -1learning_rate: 2e-05lr_scheduler_type: linearlr_scheduler_kwargs: Nonewarmup_steps: 0.05optim: adamw_torch_fusedoptim_args: Noneweight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08optim_target_modules: Nonegradient_accumulation_steps: 1average_tokens_across_devices: Truemax_grad_norm: 1.0label_smoothing_factor: 0.0bf16: Truefp16: Falsebf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonegradient_checkpointing: Falsegradient_checkpointing_kwargs: Nonetorch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneuse_liger_kernel: Falseliger_kernel_config: Noneuse_cache: Falseneftune_noise_alpha: Nonetorch_empty_cache_steps: Noneauto_find_batch_size: Falselog_on_each_node: Truelogging_nan_inf_filter: Trueinclude_num_input_tokens_seen: nolog_level: passivelog_level_replica: warningdisable_tqdm: Falseproject: icd10-embeddingstrackio_space_id: thomasavare/icd10-embeddingseval_strategy: noper_device_eval_batch_size: 8prediction_loss_only: Trueeval_on_start: Falseeval_do_concat_batches: Trueeval_use_gather_object: Falseeval_accumulation_steps: Noneinclude_for_metrics: []batch_eval_metrics: Falsesave_only_model: Falsesave_on_each_node: Falseenable_jit_checkpoint: Falsepush_to_hub: Falsehub_private_repo: Nonehub_model_id: Nonehub_strategy: every_savehub_always_push: Falsehub_revision: Noneload_best_model_at_end: Falseignore_data_skip: Falserestore_callback_states_from_checkpoint: Falsefull_determinism: Falseseed: 42data_seed: Noneuse_cpu: Falseaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}parallelism_config: Nonedataloader_drop_last: Falsedataloader_num_workers: 0dataloader_pin_memory: Truedataloader_persistent_workers: Falsedataloader_prefetch_factor: Noneremove_unused_columns: Truelabel_names: Nonetrain_sampling_strategy: randomlength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falseddp_backend: Noneddp_timeout: 1800fsdp: []fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}deepspeed: Nonedebug: []skip_memory_metrics: Truedo_predict: Falseresume_from_checkpoint: Nonewarmup_ratio: 0.05local_rank: -1prompts: {'anchor': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :', 'positive': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :', 'negative': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :'}batch_sampler: no_duplicatesmulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}| Epoch | Step | Training Loss |
|---|---|---|
| 0.1055 | 100 | 3.5800 |
| 0.2110 | 200 | 2.9560 |
| 0.3165 | 300 | 2.7041 |
| 0.4219 | 400 | 2.5284 |
| 0.5274 | 500 | 2.3684 |
| 0.6329 | 600 | 2.1859 |
| 0.7384 | 700 | 2.0552 |
| 0.8439 | 800 | 1.9008 |
| 0.9494 | 900 | 1.8054 |
| 1.0549 | 1000 | 1.6683 |
| 1.1603 | 1100 | 1.5583 |
| 1.2658 | 1200 | 1.4234 |
| 1.3713 | 1300 | 1.4087 |
| 1.4768 | 1400 | 1.3005 |
| 1.5823 | 1500 | 1.2523 |
| 1.6878 | 1600 | 1.1142 |
| 1.7932 | 1700 | 1.0612 |
| 1.8987 | 1800 | 0.9720 |
| 2.0042 | 1900 | 1.0359 |
| 2.1097 | 2000 | 0.9402 |
| 2.2152 | 2100 | 0.9496 |
| 2.3207 | 2200 | 0.8979 |
| 2.4262 | 2300 | 0.9448 |
| 2.5316 | 2400 | 0.8915 |
| 2.6371 | 2500 | 0.8848 |
| 2.7426 | 2600 | 0.8776 |
| 2.8481 | 2700 | 0.7764 |
| 2.9536 | 2800 | 0.7138 |
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}