Update README.md
Browse files
README.md
CHANGED
|
@@ -17,6 +17,20 @@ tasks: []
|
|
| 17 |
[Huggingface](https://huggingface.co/DataoceanAI)
|
| 18 |
[Modelscope](https://www.modelscope.cn/organization/DataoceanAI)
|
| 19 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
**Dolphin-CN-Dialect** is a multi-dialect ASR model developed by Dataocean AI and Tsinghua University, with a strong focus on Chinese dialect recognition and real-world deployment scenarios. Compared with the previous Dolphin series, Dolphin-CN-Dialect introduces significant improvements in tokenizer design, dialect-balanced training, streaming capability, hotword biasing, and deployment efficiency.
|
| 21 |
|
| 22 |
The model supports Mandarin Chinese and 22 Chinese dialects, while also maintaining multilingual ASR capability inherited from Dolphin. Dolphin-CN-Dialect supports both streaming and non-streaming inference, enabling practical deployment in latency-sensitive applications such as real-time transcription and industrial speech recognition systems.
|
|
@@ -51,7 +65,7 @@ Experimental results show that Dolphin-CN-Dialect achieves:
|
|
| 51 |

|
| 52 |
|
| 53 |
|
| 54 |
-
See details in the [Paper](https://arxiv.org/abs/
|
| 55 |
|
| 56 |
|
| 57 |
## Setup
|
|
|
|
| 17 |
[Huggingface](https://huggingface.co/DataoceanAI)
|
| 18 |
[Modelscope](https://www.modelscope.cn/organization/DataoceanAI)
|
| 19 |
|
| 20 |
+
# Repository Notice
|
| 21 |
+
|
| 22 |
+
This model is officially maintained by **Dataocean AI**.
|
| 23 |
+
|
| 24 |
+
To ensure compatibility with existing user code and download links, we keep two official repositories for the same model:
|
| 25 |
+
|
| 26 |
+
- Original / legacy repository: DataoceanAI
|
| 27 |
+
- Organization / enterprise repository: DataoceanAI1
|
| 28 |
+
|
| 29 |
+
Both repositories are maintained by the same team and contain the same model files.
|
| 30 |
+
DataoceanAI1 is the newly created enterprise organization account, while DataoceanAI is kept to avoid breaking existing user download scripts and links.
|
| 31 |
+
|
| 32 |
+
Please do not regard either repository as an unofficial copy or unauthorized redistribution.
|
| 33 |
+
|
| 34 |
**Dolphin-CN-Dialect** is a multi-dialect ASR model developed by Dataocean AI and Tsinghua University, with a strong focus on Chinese dialect recognition and real-world deployment scenarios. Compared with the previous Dolphin series, Dolphin-CN-Dialect introduces significant improvements in tokenizer design, dialect-balanced training, streaming capability, hotword biasing, and deployment efficiency.
|
| 35 |
|
| 36 |
The model supports Mandarin Chinese and 22 Chinese dialects, while also maintaining multilingual ASR capability inherited from Dolphin. Dolphin-CN-Dialect supports both streaming and non-streaming inference, enabling practical deployment in latency-sensitive applications such as real-time transcription and industrial speech recognition systems.
|
|
|
|
| 65 |

|
| 66 |
|
| 67 |
|
| 68 |
+
See details in the [Paper](https://arxiv.org/abs/2605.08961).
|
| 69 |
|
| 70 |
|
| 71 |
## Setup
|