Automatic Speech Recognition
LiteRT
LiteRT
qwen
qwen3
chinese
cantonese
on-device
soniqo
speech-cloud
speech-core
Instructions to use soniqo/Qwen3-ASR-0.6B-Encoder-LiteRT-INT8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- LiteRT
How to use soniqo/Qwen3-ASR-0.6B-Encoder-LiteRT-INT8 with LiteRT:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
File size: 860 Bytes
80df339 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 | {
"model": "Qwen3-ASR-0.6B",
"component": "audio_encoder",
"format": "tflite",
"quantization": "int8",
"sample_rate": 16000,
"mel_frames_per_second": 100,
"input_mel_frames": 1000,
"input_mel_bins": 128,
"output_tokens": 125,
"output_dim": 1024,
"encoder": {
"num_layers": 18,
"d_model": 896,
"num_heads": 14,
"ffn_dim": 3584
},
"inputs": {
"mel": {
"shape": [
1,
128,
1000
],
"dtype": "float32"
}
},
"outputs": {
"audio_embeddings": {
"shape": [
1,
125,
1024
],
"dtype": "float32"
}
},
"note": "This is the audio encoder only. The text decoder is a Qwen3-0.6B LLM; run it through LiteRT-LM (separate runtime) with the encoder outputs as cross-attention context. Supports 30 languages + 22 Chinese dialects."
} |