whisper-large-v3-turbo-onnx-base
This is a Base ONNX (FP32) version of openai/whisper-large-v3-turbo.
Model Details
- Base Model: openai/whisper-large-v3-turbo
- Format: ONNX (BASE)
- Architecture: arm64
Size Comparison
| Version | Size |
|---|---|
| Base ONNX (FP32) | 4198.51 MB ← |
| FP16 ONNX | 2099.54 MB |
| INT8 Quantized ONNX | 1459.58 MB |
| Compression | 1.00x |
Usage
from optimum.onnxruntime import ORTModelForSpeechSeq2Seq
from transformers import AutoProcessor
model = ORTModelForSpeechSeq2Seq.from_pretrained("kostasang/whisper-large-v3-turbo-onnx-base")
processor = AutoProcessor.from_pretrained("kostasang/whisper-large-v3-turbo-onnx-base")
- Downloads last month
- 4
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for kostasang/whisper-large-v3-turbo-onnx-base
Base model
openai/whisper-large-v3 Finetuned
openai/whisper-large-v3-turbo