Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 12
This model was finetuned with Unsloth.
This is a sentence-transformers model trained on the french triplet ds and french custom triplet ds datasets. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 256, 'do_lower_case': False, 'architecture': 'BertModel'})
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("thomasavare/all-MiniLM-L6-v2-med-v0")
# Run inference
sentences = [
'Plan de soins post-opératoires après une chirurgie de kyste osseux anévrismaux',
'Les exercices de renforcement et de musculation de la hanche ont été commencés tôt, et à la cinquième semaine, on a commencé à marcher avec des béquilles, et quatre semaines plus tard, on a abandonné les béquilles et on a encouragé le patient à marcher de façon autonome.',
"Le patient a été conseillé de continuer à faire un suivi régulier auprès de son fournisseur de soins de santé primaires et de ses dentistes pour gérer les problèmes postopératoires et s'assurer qu'il n'y a pas de récidive de la maladie.",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.3373, 0.4268],
# [0.3373, 1.0000, 0.4083],
# [0.4268, 0.4083, 1.0000]])
anchor, positive, and negative| anchor | positive | negative | |
|---|---|---|---|
| type | string | string | string |
| details |
|
|
|
| anchor | positive | negative |
|---|---|---|
masse kystique sur l' utérus |
Le patient s'est présenté aux urgences avec une douleur abdominale sévère dans le quadrant inférieur gauche, qui a été diagnostiquée comme une masse kystique sur la paroi antérieure gauche de l'utérus. |
Symptômes: masse pelvienne à croissance rapide et taux sériques accrus de marqueurs tumoraux |
Plan de soins post-démarrage pour les patients atteints d' une BPAN |
La patiente sera suivie en consultation externe avec une surveillance étroite de sa nutrition et de ses habitudes de comportement pour s'assurer qu'elle ne revienne pas à ses comportements antérieurs. |
Le patient a été libéré le 30e jour d'hospitalisation avec de l'aspirine seule. |
Comment l'état du patient a- t- il réagi au traitement? |
Le patient a répondu positivement au traitement prescrit par la diéthylcarbamazine et aucun suivi n' est nécessaire. |
Les symptômes du patient se sont résolus après avoir reçu un traitement et subi des échocardiogrammes de suivi. |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
anchor, positive, and negative| anchor | positive | negative | |
|---|---|---|---|
| type | string | string | string |
| details |
|
|
|
| anchor | positive | negative |
|---|---|---|
Cholera |
Maladie infectieuse |
Mittelschmerz |
Choléra |
Cholera |
Astringents et détergents locaux |
Maladie infectieuse |
Choléra |
collision avec tout objet, fixe ou mobile ou en mouvement |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
per_device_train_batch_size: 512learning_rate: 2e-05warmup_steps: 0.05bf16: Trueproject: icd10-embeddingstrackio_space_id: thomasavare/icd10-embeddingswarmup_ratio: 0.05prompts: {'anchor': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :', 'positive': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :', 'negative': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :'}batch_sampler: no_duplicatesper_device_train_batch_size: 512num_train_epochs: 3max_steps: -1learning_rate: 2e-05lr_scheduler_type: linearlr_scheduler_kwargs: Nonewarmup_steps: 0.05optim: adamw_torch_fusedoptim_args: Noneweight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08optim_target_modules: Nonegradient_accumulation_steps: 1average_tokens_across_devices: Truemax_grad_norm: 1.0label_smoothing_factor: 0.0bf16: Truefp16: Falsebf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonegradient_checkpointing: Falsegradient_checkpointing_kwargs: Nonetorch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneuse_liger_kernel: Falseliger_kernel_config: Noneuse_cache: Falseneftune_noise_alpha: Nonetorch_empty_cache_steps: Noneauto_find_batch_size: Falselog_on_each_node: Truelogging_nan_inf_filter: Trueinclude_num_input_tokens_seen: nolog_level: passivelog_level_replica: warningdisable_tqdm: Falseproject: icd10-embeddingstrackio_space_id: thomasavare/icd10-embeddingseval_strategy: noper_device_eval_batch_size: 8prediction_loss_only: Trueeval_on_start: Falseeval_do_concat_batches: Trueeval_use_gather_object: Falseeval_accumulation_steps: Noneinclude_for_metrics: []batch_eval_metrics: Falsesave_only_model: Falsesave_on_each_node: Falseenable_jit_checkpoint: Falsepush_to_hub: Falsehub_private_repo: Nonehub_model_id: Nonehub_strategy: every_savehub_always_push: Falsehub_revision: Noneload_best_model_at_end: Falseignore_data_skip: Falserestore_callback_states_from_checkpoint: Falsefull_determinism: Falseseed: 42data_seed: Noneuse_cpu: Falseaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}parallelism_config: Nonedataloader_drop_last: Falsedataloader_num_workers: 0dataloader_pin_memory: Truedataloader_persistent_workers: Falsedataloader_prefetch_factor: Noneremove_unused_columns: Truelabel_names: Nonetrain_sampling_strategy: randomlength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falseddp_backend: Noneddp_timeout: 1800fsdp: []fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}deepspeed: Nonedebug: []skip_memory_metrics: Truedo_predict: Falseresume_from_checkpoint: Nonewarmup_ratio: 0.05local_rank: -1prompts: {'anchor': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :', 'positive': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :', 'negative': 'Instruct : Represent the disease in a standardized clinical concept\nQuery :'}batch_sampler: no_duplicatesmulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}| Epoch | Step | Training Loss |
|---|---|---|
| 0.1055 | 100 | 5.7088 |
| 0.2110 | 200 | 4.9171 |
| 0.3165 | 300 | 4.8061 |
| 0.4219 | 400 | 4.7313 |
| 0.5274 | 500 | 4.7004 |
| 0.6329 | 600 | 4.6518 |
| 0.7384 | 700 | 4.6307 |
| 0.8439 | 800 | 4.6992 |
| 0.9494 | 900 | 4.5778 |
| 1.0549 | 1000 | 4.4946 |
| 1.1603 | 1100 | 4.6055 |
| 1.2658 | 1200 | 4.5647 |
| 1.3713 | 1300 | 4.5245 |
| 1.4768 | 1400 | 4.5631 |
| 1.5823 | 1500 | 4.5186 |
| 1.6878 | 1600 | 4.5509 |
| 1.7932 | 1700 | 4.5756 |
| 1.8987 | 1800 | 4.6112 |
| 2.0042 | 1900 | 4.4410 |
| 2.1097 | 2000 | 4.6082 |
| 2.2152 | 2100 | 4.5329 |
| 2.3207 | 2200 | 4.5414 |
| 2.4262 | 2300 | 4.5330 |
| 2.5316 | 2400 | 4.5384 |
| 2.6371 | 2500 | 4.5075 |
| 2.7426 | 2600 | 4.5156 |
| 2.8481 | 2700 | 4.5750 |
| 2.9536 | 2800 | 4.6071 |
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}