Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

embedl
/
all-MiniLM-L6-v2-quantized-trt

Sentence Similarity
TensorRT
ONNX
quantization
edge
embedl
Model card Files Files and versions
xet
Community

Instructions to use embedl/all-MiniLM-L6-v2-quantized-trt with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • TensorRT

    How to use embedl/all-MiniLM-L6-v2-quantized-trt with TensorRT:

    # No code snippets available yet for this library.
    
    # To use this model, check the repository files and the library's documentation.
    
    # Want to help? PRs adding snippets are welcome at:
    # https://github.com/huggingface/huggingface.js
  • Notebooks
  • Google Colab
  • Kaggle
all-MiniLM-L6-v2-quantized-trt
225 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 5 commits
dann-od's picture
dann-od
Clean up some documentation
69a8f8f verified 1 day ago
  • .gitattributes
    1.56 kB
    First verion of model card 1 day ago
  • README.md
    10.6 kB
    Clean up some documentation 1 day ago
  • embedl_all-MiniLM-L6-v2_int8.onnx
    90 MB
    xet
    First verion of model card 1 day ago
  • embedl_all-MiniLM-L6-v2_int8.pt2
    135 MB
    xet
    First verion of model card 1 day ago
  • infer_pt2.py
    2.51 kB
    Minor fix to infer_pt2.py 1 day ago
  • infer_trt.py
    5.42 kB
    First verion of model card 1 day ago