jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published Feb 17 • 26
jina-embeddings-v5-text Collection Our 5th-gen embeddings: two lightweight multilingual models with SOTA performance in retrieval, matching, clustering, and classification. • 29 items • Updated Feb 27 • 38
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 501
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26, 2025 • 188
jina-reranker-v3: Last but Not Late Interaction for Document Reranking Paper • 2509.25085 • Published Sep 29, 2025 • 9
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 307
mmBERT: a modern multilingual encoder Collection mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 53
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 273
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 Jul 1, 2025 • 137
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 428