Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Mead0w1ark
/
multilingual-e5-small-hs-codes
like
0
Sentence Similarity
sentence-transformers
Safetensors
bert
feature-extraction
dense
Generated from Trainer
dataset_size:9829
loss:MultipleNegativesRankingLoss
text-embeddings-inference
arxiv:
1908.10084
arxiv:
1705.00652
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
multilingual-e5-small-hs-codes
1.21 GB
Ctrl+K
Ctrl+K
1 contributor
History:
13 commits
Mead0w1ark
Upload umap_data.json with huggingface_hub
472236b
verified
about 2 months ago
1_Pooling
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
.gitattributes
Safe
1.68 kB
Add precomputed UMAP projection data (314K points)
about 2 months ago
README.md
Safe
15.4 kB
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
config.json
Safe
755 Bytes
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
config_sentence_transformers.json
Safe
277 Bytes
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
embeddings.npy
482 MB
xet
Upload embeddings.npy with huggingface_hub
about 2 months ago
knn_classifier.pkl
pickle
Detected Pickle imports (3)
"numpy._core.numeric._frombuffer"
,
"numpy.dtype"
,
"sklearn.neighbors._classification.KNeighborsClassifier"
How to fix it?
12.1 MB
xet
Upload knn_classifier.pkl with huggingface_hub
about 2 months ago
label_encoder.pkl
pickle
Detected Pickle imports (4)
"sklearn.preprocessing._label.LabelEncoder"
,
"numpy._core.multiarray._reconstruct"
,
"numpy.ndarray"
,
"numpy.dtype"
How to fix it?
1.31 kB
xet
Add label_encoder.pkl
about 2 months ago
metadata.json
498 Bytes
Upload metadata.json with huggingface_hub
about 2 months ago
model.safetensors
Safe
471 MB
xet
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
modules.json
Safe
349 Bytes
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
sentence_bert_config.json
Safe
57 Bytes
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
tokenizer.json
Safe
17.1 MB
xet
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
tokenizer_config.json
Safe
349 Bytes
Upload fine-tuned multilingual-e5-small for HS code classification
about 2 months ago
training_data.csv
Safe
124 MB
xet
Add training_data.csv (314K rows)
about 2 months ago
umap_data.json
107 MB
xet
Upload umap_data.json with huggingface_hub
about 2 months ago