parakeet-tdt-0.6b-v3 ONNX
ONNX export of nvidia/parakeet-tdt-0.6b-v3 for use with onnx-asr.
Multilingual speech-to-text model supporting 25 European languages with automatic language detection.
Usage
import onnx_asr
model = onnx_asr.load_model("nemo-conformer-tdt", "path/to/this/repo")
print(model.recognize("audio.wav"))
Files
| File | Size | Description |
|---|---|---|
encoder-model.onnx |
~2.4 GB | FastConformer encoder |
decoder_joint-model.onnx |
~69 MB | TDT decoder + joint network |
vocab.txt |
~100 KB | Tokenizer vocabulary (8193 tokens) |
config.json |
~124 B | Model configuration |
Requirements
pip install onnx-asr[cpu]
Supported Languages
Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, Ukrainian.
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support