DistilBERT Amazon Shoe Reviews β€” ONNX Quantized

Dynamically quantized ONNX export of juliensimon/distilbert-amazon-shoe-reviews for maximum CPU inference speed using Optimum.

Video walkthrough: Accelerate Transformer inference on CPU with Optimum and ONNX

Model Details

Detail Value
Base model juliensimon/distilbert-amazon-shoe-reviews
Task Star rating prediction (1–5 stars) from shoe reviews
Format ONNX with dynamic quantization (INT8 weights)

Usage

from optimum.onnxruntime import ORTModelForSequenceClassification
from transformers import AutoTokenizer, pipeline

model = ORTModelForSequenceClassification.from_pretrained("juliensimon/distilbert-amazon-shoe-reviews-onnx-quantized")
tokenizer = AutoTokenizer.from_pretrained("juliensimon/distilbert-amazon-shoe-reviews-onnx-quantized")
classifier = pipeline("text-classification", model=model, tokenizer=tokenizer)

classifier("These shoes are incredibly comfortable!")

Related Models

Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for juliensimon/distilbert-amazon-shoe-reviews-onnx-quantized

Quantized
(3)
this model