Mega-ASR MLX Q4

This is a private MLX conversion of zhifeixie/Mega-ASR.

The checkpoint was produced by merging the mega-asr-merged LoRA adapter from zhifeixie/Mega-ASR into the bundled Qwen3-ASR-1.7B base checkpoint, then converting the merged weights to the mlx-audio Qwen3-ASR layout.

Conversion

  • Base/source repo: zhifeixie/Mega-ASR
  • Adapter: mega-asr-merged
  • Format: MLX / mlx-audio
  • Quantization: affine Q4, group_size=64, bits=4
  • Text model and token embedding are quantized; audio tower remains full precision.

Use With mlx-audio

pip install -U mlx-audio
python -m mlx_audio.stt.generate --model runfuture/Mega-ASR-MLX-Q4 --audio audio.wav
Downloads last month
2
Safetensors
Model size
0.6B params
Tensor type
U32
·
BF16
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for runfuture/Mega-ASR-MLX-Q4

Quantized
(1)
this model