runfuture
/

Mega-ASR-MLX-Q4

Automatic Speech Recognition

4-bit precision

Model card Files Files and versions

Mega-ASR-MLX-Q4 / README.md

runfuture's picture

Update README.md

a750058 verified about 19 hours ago

|

history blame contribute delete

904 Bytes

	---
	license: apache-2.0
	tags:
	- mlx
	- speech-to-text
	- speech
	- transcription
	- asr
	- stt
	- mlx-audio
	library_name: mlx-audio
	pipeline_tag: automatic-speech-recognition
	base_model:
	- zhifeixie/Mega-ASR
	---

	# Mega-ASR MLX Q4

	This is a private MLX conversion of `zhifeixie/Mega-ASR`.

	The checkpoint was produced by merging the `mega-asr-merged` LoRA adapter from
	`zhifeixie/Mega-ASR` into the bundled `Qwen3-ASR-1.7B` base checkpoint, then
	converting the merged weights to the `mlx-audio` Qwen3-ASR layout.

	## Conversion

	- Base/source repo: `zhifeixie/Mega-ASR`
	- Adapter: `mega-asr-merged`
	- Format: MLX / `mlx-audio`
	- Quantization: affine Q4, `group_size=64`, `bits=4`
	- Text model and token embedding are quantized; audio tower remains full precision.

	## Use With mlx-audio

	```bash
	pip install -U mlx-audio
	python -m mlx_audio.stt.generate --model runfuture/Mega-ASR-MLX-Q4 --audio audio.wav
	```