Automatic Speech Recognition
Transformers
Safetensors
Chinese
English
Yue Chinese
qwen2
text-generation
text-generation-inference
Instructions to use XiaomiMiMo/MiMo-V2.5-ASR with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use XiaomiMiMo/MiMo-V2.5-ASR with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="XiaomiMiMo/MiMo-V2.5-ASR")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("XiaomiMiMo/MiMo-V2.5-ASR") model = AutoModelForCausalLM.from_pretrained("XiaomiMiMo/MiMo-V2.5-ASR") - Notebooks
- Google Colab
- Kaggle
Consolidate Results section to single summary chart
Browse files
README.md
CHANGED
|
@@ -63,23 +63,11 @@ Automatic speech recognition systems are expected to faithfully transcribe speec
|
|
| 63 |
|
| 64 |
## Results
|
| 65 |
|
| 66 |
-
MiMo-V2.5-ASR has been evaluated across a broad set of benchmarks spanning standard Mandarin and English, Chinese dialects,
|
| 67 |
|
| 68 |
-
|
| 69 |
|
| 70 |
-
|
| 71 |
-
|
| 72 |
-
### Standard English
|
| 73 |
-
|
| 74 |
-

|
| 75 |
-
|
| 76 |
-
### Chinese Dialects
|
| 77 |
-
|
| 78 |
-

|
| 79 |
-
|
| 80 |
-
### Singing & Code-Switch
|
| 81 |
-
|
| 82 |
-

|
| 83 |
|
| 84 |
## Model Download
|
| 85 |
|
|
|
|
| 63 |
|
| 64 |
## Results
|
| 65 |
|
| 66 |
+
MiMo-V2.5-ASR has been evaluated across a broad set of benchmarks spanning standard Mandarin and English, Chinese dialects, lyric recognition, and internal business scenarios. The chart below summarizes the average performance of MiMo-V2.5-ASR across these scenarios.
|
| 67 |
|
| 68 |
+

|
| 69 |
|
| 70 |
+
For per-benchmark numbers and specific qualitative cases, please refer to our [blog](https://xiaomimimo.github.io/MiMo-V2.5-ASR-Demo).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 71 |
|
| 72 |
## Model Download
|
| 73 |
|