Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
lightonai
's Collections
DenseOn & LateOn
LightOnOCR-2 🦉
ColBERT-Zero 🐶
LateOn-Code 💻
OriOn 💫
PyLate 🐕
LightOnOCR 🦉
Embeddings datasets ⚡️
Ettin
ModernBERT
PAGnol 🇫🇷
RITA 🧿
Mamba 🐍
ArabicWeb24-ablation-models
Embeddings datasets ⚡️
updated
14 days ago
This collection gather datasets for embeddings pre-training and fine-tuning.
Upvote
4
lightonai/embeddings-pre-training
Viewer
•
Updated
5 days ago
•
1.38B
•
1.07k
•
27
lightonai/nanobeir-multilingual
Viewer
•
Updated
Sep 16, 2025
•
522k
•
534
•
11
lightonai/embeddings_supervised
Viewer
•
Updated
Oct 23, 2025
•
3.43M
•
179
•
10
Upvote
4
Share collection
View history
Collection guide
Browse collections