MLX Speech Models
Collection
Speech AI models for Apple Silicon via MLX. ASR, TTS, VAD, diarization, speaker embedding. • 29 items • Updated • 1
MLX 4-bit quantized conversion of Qwen/Qwen3-TTS-12Hz-0.6B-CustomVoice for Apple Silicon inference.
Used by speech-swift Qwen3TTS module:
let model = try await Qwen3TTSModel.fromPretrained(
modelId: TTSModelVariant.customVoice.rawValue
)
let audio = try model.synthesize("Hello!", speaker: "Chelsie")
audio speak "Hello!" --model custom-voice --speaker Chelsie -o output.wav
4-bit
Base model
Qwen/Qwen3-TTS-12Hz-0.6B-CustomVoice