Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2308.03281

Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3

Representation Learning

Text and Code Embeddings by Contrastive Pre-Training

Paper • 2201.10005 • Published Jan 24, 2022
Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3
Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2, 2024 • 17
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training

Paper • 2405.06932 • Published May 11, 2024 • 20

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Paper • 2310.05737 • Published Oct 9, 2023 • 6
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models

Paper • 2308.16692 • Published Aug 31, 2023 • 1
Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings

Paper • 2305.11554 • Published May 19, 2023 • 2

General Text Embedding Models Released by Tongyi Lab of Alibaba Group

Alibaba-NLP/gte-Qwen2-7B-instruct

Sentence Similarity • 8B • Updated Mar 24, 2025 • 355k • 479
Alibaba-NLP/gte-Qwen2-1.5B-instruct

Sentence Similarity • 2B • Updated May 28, 2025 • 337k • 229
Alibaba-NLP/gte-multilingual-base

Sentence Similarity • 0.3B • Updated Jul 5, 2025 • 2.18M • 357
Alibaba-NLP/gte-multilingual-reranker-base

Text Ranking • 0.3B • Updated Jul 5, 2025 • 104k • 176

Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3
NEFTune: Noisy Embeddings Improve Instruction Finetuning

Paper • 2310.05914 • Published Oct 9, 2023 • 14
EELBERT: Tiny Models through Dynamic Embeddings

Paper • 2310.20144 • Published Oct 31, 2023 • 3
Dynamic Word Embeddings for Evolving Semantic Discovery

Paper • 1703.00607 • Published Mar 2, 2017 • 1

Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3

General Text Embedding Models Released by Tongyi Lab of Alibaba Group

Alibaba-NLP/gte-Qwen2-7B-instruct

Sentence Similarity • 8B • Updated Mar 24, 2025 • 355k • 479
Alibaba-NLP/gte-Qwen2-1.5B-instruct

Sentence Similarity • 2B • Updated May 28, 2025 • 337k • 229
Alibaba-NLP/gte-multilingual-base

Sentence Similarity • 0.3B • Updated Jul 5, 2025 • 2.18M • 357
Alibaba-NLP/gte-multilingual-reranker-base

Text Ranking • 0.3B • Updated Jul 5, 2025 • 104k • 176

Representation Learning

Text and Code Embeddings by Contrastive Pre-Training

Paper • 2201.10005 • Published Jan 24, 2022
Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3
Nomic Embed: Training a Reproducible Long Context Text Embedder

Paper • 2402.01613 • Published Feb 2, 2024 • 17
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training

Paper • 2405.06932 • Published May 11, 2024 • 20

Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3
NEFTune: Noisy Embeddings Improve Instruction Finetuning

Paper • 2310.05914 • Published Oct 9, 2023 • 14
EELBERT: Tiny Models through Dynamic Embeddings

Paper • 2310.20144 • Published Oct 31, 2023 • 3
Dynamic Word Embeddings for Evolving Semantic Discovery

Paper • 1703.00607 • Published Mar 2, 2017 • 1

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Paper • 2310.05737 • Published Oct 9, 2023 • 6
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models

Paper • 2308.16692 • Published Aug 31, 2023 • 1
Towards General Text Embeddings with Multi-stage Contrastive Learning

Paper • 2308.03281 • Published Aug 7, 2023 • 3
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings

Paper • 2305.11554 • Published May 19, 2023 • 2

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs