Harrier-OSS-v1-0.6B โ€” ONNX INT8 (single-file)

Re-export of microsoft/harrier-oss-v1-0.6b as single-file ONNX INT8 for Transformers.js v4 and ONNX Runtime.

Property Value
Dimensions 1024 (same as pplx-embed-v1)
MTEB v2 69.0
Languages 94
Max tokens 32,768
Pooling Last-token + L2 norm
Architecture Qwen3 (vocab 151,936)
Size 600 MB (INT8)
License MIT

Important: Queries must be prefixed with an instruction. See search_instructions.json.

Potential drop-in replacement for pplx-embed-v1 (same 1024D, same vocab family).

Export by Deposium.

Downloads last month
19
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for tss-deposium/harrier-oss-v1-0.6b-onnx-int8

Quantized
(12)
this model