Sentence Similarity
sentence-transformers
Safetensors
bert
feature-extraction
text-embeddings-inference
Instructions to use yahyaabd/indosbert-bps-custom-tokenizer with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use yahyaabd/indosbert-bps-custom-tokenizer with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("yahyaabd/indosbert-bps-custom-tokenizer") sentences = [ "That is a happy person", "That is a happy dog", "That is a very happy person", "Today is a sunny day" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
File size: 1,668 Bytes
6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 4463ad1 6f6c413 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 | {
"BPP": 30551,
"BPS": 30521,
"BRS": 30566,
"Badan Pusat Statistik": 30522,
"CIF": 30556,
"EKSPOR": 30573,
"Ekspor": 30552,
"FOB": 30555,
"HLS": 30543,
"HS": 30554,
"IHK": 30525,
"IHP": 30527,
"IHPB": 30528,
"IMK": 30532,
"IMPOR": 30574,
"IPH": 30526,
"IPM": 30537,
"ITB": 30531,
"ITK": 30530,
"Impor": 30553,
"KATALOG": 30580,
"KB": 30542,
"KCI": 30562,
"KEGIATAN": 30577,
"KKI": 30561,
"KKL": 30560,
"KRT": 30563,
"Katalog": 30571,
"LPE": 30533,
"LTN": 30549,
"LTT": 30548,
"Metadata": 30570,
"NTP": 30529,
"NTUP": 30550,
"PDB": 30524,
"PDRB": 30523,
"PKL": 30559,
"PUBLIKASI": 30576,
"Publikasi": 30567,
"RLS": 30544,
"RT": 30564,
"RW": 30565,
"SAKERNAS": 30575,
"SDGI": 30541,
"SDKI": 30540,
"SEKTORAL": 30579,
"SP2020": 30538,
"ST2013": 30547,
"ST2023": 30546,
"STATISTIK": 30578,
"SUPAS": 30539,
"SURVEI": 30581,
"SUTAS": 30545,
"Sakernas": 30557,
"Sektoral": 30569,
"Survei": 30572,
"TPAK": 30558,
"TPK": 30534,
"TPT": 30535,
"UMP": 30536,
"_Statistik": 30568,
"bpp": 30604,
"brs": 30613,
"cif": 30606,
"fob": 30605,
"hls": 30596,
"ihk": 30583,
"imk": 30587,
"iph": 30584,
"ipm": 30591,
"itk": 30586,
"kci": 30611,
"kki": 30610,
"kkl": 30609,
"krt": 30612,
"lpe": 30588,
"ltn": 30602,
"ltt": 30601,
"metadata": 30615,
"ntp": 30585,
"ntup": 30603,
"pdrb": 30582,
"rls": 30597,
"sakernas": 30607,
"sdgi": 30595,
"sdki": 30594,
"sektoral": 30614,
"sp2020": 30592,
"st2013": 30600,
"st2023": 30599,
"supas": 30593,
"sutas": 30598,
"tpak": 30608,
"tpk": 30589,
"tpt": 30590
}
|