CoreML Speech Models
Collection
Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. β’ 18 items β’ Updated β’ 1
CoreML conversion of Silero VAD v5 for Apple Neural Engine.
| Detail | Value |
|---|---|
| Architecture | STFT β Conv1d encoder β LSTM β decoder |
| Parameters | ~309K |
| Input | 512 samples (32ms @ 16kHz) |
| Output | Speech probability (0.0β1.0) |
| Size | ~4.2 MB |
let vad = try await SileroVADModel.fromPretrained(backend: .coreML)
let prob = vad.processChunk(samples)
| Variant | Backend | Model ID |
|---|---|---|
| MLX | GPU | aufklarer/Silero-VAD-v5-MLX |
| CoreML | Neural Engine | aufklarer/Silero-VAD-v5-CoreML |