MLX Speech Models
Collection
Speech AI models for Apple Silicon via MLX. ASR, TTS, VAD, diarization, speaker embedding. • 29 items • Updated • 1
MLX 4-bit quantized conversion of Qwen/Qwen3-ASR-0.6B for Apple Silicon inference.
Used by speech-swift Qwen3ASR module:
let model = try await Qwen3ASRModel.fromPretrained()
let text = model.transcribe(audio: samples, sampleRate: 16000)
audio transcribe audio.wav
4-bit
Base model
Qwen/Qwen3-ASR-0.6B