Mega-ASR-MLX-Q4 / README.md
runfuture's picture
Update README.md
a750058 verified
---
license: apache-2.0
tags:
- mlx
- speech-to-text
- speech
- transcription
- asr
- stt
- mlx-audio
library_name: mlx-audio
pipeline_tag: automatic-speech-recognition
base_model:
- zhifeixie/Mega-ASR
---
# Mega-ASR MLX Q4
This is a private MLX conversion of `zhifeixie/Mega-ASR`.
The checkpoint was produced by merging the `mega-asr-merged` LoRA adapter from
`zhifeixie/Mega-ASR` into the bundled `Qwen3-ASR-1.7B` base checkpoint, then
converting the merged weights to the `mlx-audio` Qwen3-ASR layout.
## Conversion
- Base/source repo: `zhifeixie/Mega-ASR`
- Adapter: `mega-asr-merged`
- Format: MLX / `mlx-audio`
- Quantization: affine Q4, `group_size=64`, `bits=4`
- Text model and token embedding are quantized; audio tower remains full precision.
## Use With mlx-audio
```bash
pip install -U mlx-audio
python -m mlx_audio.stt.generate --model runfuture/Mega-ASR-MLX-Q4 --audio audio.wav
```