Qwen3 ASR 1.7B โ€” MLX 8-bit

MLX 8-bit quantized conversion of Qwen/Qwen3-ASR-1.7B for Apple Silicon inference.

Usage

Used by speech-swift Qwen3ASR module:

let model = try await Qwen3ASRModel.fromPretrained(
    modelId: "aufklarer/Qwen3-ASR-1.7B-MLX-8bit"
)
let text = model.transcribe(audio: samples, sampleRate: 16000)
audio transcribe --model large audio.wav

Model Details

  • Architecture: Qwen3-ASR encoder-decoder (Whisper-style audio encoder + Qwen3 text decoder)
  • Parameters: 1.7B
  • Quantization: 8-bit (MLX)
  • Size: ~2.3 GB
  • Languages: Multilingual (EN, ZH, JA, KO, FR, DE, ES, and more)

Links

Downloads last month
65
Safetensors
Model size
0.8B params
Tensor type
BF16
ยท
U32
ยท
MLX
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Alkd/Qwen3-ASR-1.7B-MLX-8bit

Quantized
(17)
this model