Update custom_st.py

#11

by mohamed99akram - opened 12 days ago

base: refs/heads/main

←

from: refs/pr/11

Discussion Files changed

-0

mohamed99akram

12 days ago

trust_remote_code gets passed twice when using:

model = SentenceTransformer(
    "jinaai/jina-embeddings-v5-text-nano",
    trust_remote_code=True
)

Giving an error.

Update custom_st.py63606bde

tomaarsen

10 days ago

Hello!

This indeed works:

from sentence_transformers import SentenceTransformer
import torch

model = SentenceTransformer(
    "jinaai/jina-embeddings-v5-text-nano",
    trust_remote_code=True,
    model_kwargs={"dtype": torch.bfloat16},  # Recommended for GPUs
    revision="refs/pr/11",
)

query_embeddings = model.encode(
    sentences=["Overview of climate change impacts on coastal cities"],
    task="retrieval",
    prompt_name="query",
)
document_embeddings = model.encode(
    sentences=[
        "Climate change has led to rising sea levels, increased frequency of extreme weather events..."
    ],
    task="retrieval",
    prompt_name="document",
)

similarity = model.similarity(query_embeddings, document_embeddings)
print(similarity)
# tensor([[0.5529]])

I would recommend merging this, as excluding the revision from this script and using the model from main fails with TypeError: transformers.models.auto.tokenization_auto.AutoTokenizer.from_pretrained() got multiple values for keyword argument 'trust_remote_code'. See also https://github.com/huggingface/sentence-transformers/issues/3717

Tom Aarsen

jupyterjazz

Jina AI org 10 days ago

Thanks for the fix! I think jina-embeddings-v5-text-small has the same issue. Feel free to open a PR there too, otherwise I'll push the fix shortly.

jupyterjazz changed pull request status to merged 10 days ago

michael-guenther

Jina AI org 10 days ago

Thanks for reporting! I applied to fix to the small version as well: https://huggingface.co/jinaai/jina-embeddings-v5-text-small/discussions/19

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment