e5-omni: Explicit Cross-modal Alignment for Omni-modal Embeddings Paper • 2601.03666 • Published Jan 7 • 5
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 8 days ago • 45
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published 11 days ago • 12
Improving Semantic Proximity in Information Retrieval through Cross-Lingual Alignment Paper • 2604.05684 • Published 10 days ago • 9
view article Article BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders 9 days ago • 24
MILCO: Multilingual Learned Sparse Retrieval Collection MILCO maps queries and documents from different languages into a shared English lexical space via a multilingual connector. • 3 items • Updated 21 days ago • 3
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 15 days ago • 853
jina-embeddings-v5-text Collection Our 5th-gen embeddings: two lightweight multilingual models with SOTA performance in retrieval, matching, clustering, and classification. • 29 items • Updated Feb 27 • 38
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96
KoViDoRe Benchmark (BEIR) v2 Collection Korean Vision Document Retrieval Benchmark • 4 items • Updated Mar 2 • 6
view article Article Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries Dec 22, 2025 • 9
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens Paper • 2508.05305 • Published Aug 7, 2025 • 47
HyperCLOVA X SEED Collection HyperCLOVA X SEED is NAVER's lightweight open-source lineup with a strong focus on Korean language performance • 6 items • Updated Dec 24, 2025 • 42
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 10 items • Updated Jul 7, 2025 • 97
Magpie-Llama3.1 Datasets Collection Dataset built with Meta Llama 3.1 70B. • 6 items • Updated Jan 13, 2025 • 4