aufklarer
/

Silero-VAD-v5-CoreML

Voice Activity Detection

silero_vad_v5_coreml

Model card Files Files and versions

Silero-VAD-v5 — CoreML

CoreML conversion of Silero VAD v5 for Apple Neural Engine.

Model Details

Detail	Value
Architecture	STFT → Conv1d encoder → LSTM → decoder
Parameters	~309K
Input	512 samples (32ms @ 16kHz)
Output	Speech probability (0.0–1.0)
Size	~4.2 MB

Usage

let vad = try await SileroVADModel.fromPretrained(backend: .coreML)
let prob = vad.processChunk(samples)

Variants

Variant	Backend	Model ID
MLX	GPU	aufklarer/Silero-VAD-v5-MLX
CoreML	Neural Engine	aufklarer/Silero-VAD-v5-CoreML

Links

Swift library: soniqo/speech-swift
Original model: snakers4/silero-vad

Guide: soniqo.audio/guides/vad
Docs: soniqo.audio
GitHub: soniqo/speech-swift

Downloads last month: 2,114

Inference Providers NEW

Voice Activity Detection

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including aufklarer/Silero-VAD-v5-CoreML

CoreML Speech Models

Speech AI models for Apple Neural Engine via CoreML. iOS/macOS ready. ASR, TTS, VAD, diarization. • 18 items • Updated about 8 hours ago • 1