Parakeet TDT v3 โ€” CoreML INT8 (iOS)

CoreML INT8 conversion of NVIDIA Parakeet-TDT 0.6B v2 optimized for iOS deployment on Neural Engine. Encoder uses EnumeratedShapes for variable-length audio input.

Models

Model Description Compute Quantization
encoder.mlmodelc FastConformer encoder (24L, 1024 hidden) Neural Engine INT8 palettized
decoder.mlmodelc LSTM prediction network (2L, 640 hidden) Neural Engine FP16

Usage

Used by speech-swift ParakeetASR module:

let model = try await ParakeetASRModel.fromPretrained()
let text = try model.transcribeAudio(samples, sampleRate: 16000)

Downloads last month
151
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for aufklarer/Parakeet-TDT-v3-CoreML-INT8-iOS

Finetuned
(27)
this model