DataoceanAI commited on
Commit
c67224c
·
verified ·
1 Parent(s): 1694937

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -17,6 +17,20 @@ tasks: []
17
  [Huggingface](https://huggingface.co/DataoceanAI)
18
  [Modelscope](https://www.modelscope.cn/organization/DataoceanAI)
19
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  **Dolphin-CN-Dialect** is a multi-dialect ASR model developed by Dataocean AI and Tsinghua University, with a strong focus on Chinese dialect recognition and real-world deployment scenarios. Compared with the previous Dolphin series, Dolphin-CN-Dialect introduces significant improvements in tokenizer design, dialect-balanced training, streaming capability, hotword biasing, and deployment efficiency.
21
 
22
  The model supports Mandarin Chinese and 22 Chinese dialects, while also maintaining multilingual ASR capability inherited from Dolphin. Dolphin-CN-Dialect supports both streaming and non-streaming inference, enabling practical deployment in latency-sensitive applications such as real-time transcription and industrial speech recognition systems.
@@ -51,7 +65,7 @@ Experimental results show that Dolphin-CN-Dialect achieves:
51
  ![Dolphin-CN-Dialect 特色海报](Dolphin-CN-Dialect.png)
52
 
53
 
54
- See details in the [Paper](https://arxiv.org/abs/2503.20212).
55
 
56
 
57
  ## Setup
 
17
  [Huggingface](https://huggingface.co/DataoceanAI)
18
  [Modelscope](https://www.modelscope.cn/organization/DataoceanAI)
19
 
20
+ # Repository Notice
21
+
22
+ This model is officially maintained by **Dataocean AI**.
23
+
24
+ To ensure compatibility with existing user code and download links, we keep two official repositories for the same model:
25
+
26
+ - Original / legacy repository: DataoceanAI
27
+ - Organization / enterprise repository: DataoceanAI1
28
+
29
+ Both repositories are maintained by the same team and contain the same model files.
30
+ DataoceanAI1 is the newly created enterprise organization account, while DataoceanAI is kept to avoid breaking existing user download scripts and links.
31
+
32
+ Please do not regard either repository as an unofficial copy or unauthorized redistribution.
33
+
34
  **Dolphin-CN-Dialect** is a multi-dialect ASR model developed by Dataocean AI and Tsinghua University, with a strong focus on Chinese dialect recognition and real-world deployment scenarios. Compared with the previous Dolphin series, Dolphin-CN-Dialect introduces significant improvements in tokenizer design, dialect-balanced training, streaming capability, hotword biasing, and deployment efficiency.
35
 
36
  The model supports Mandarin Chinese and 22 Chinese dialects, while also maintaining multilingual ASR capability inherited from Dolphin. Dolphin-CN-Dialect supports both streaming and non-streaming inference, enabling practical deployment in latency-sensitive applications such as real-time transcription and industrial speech recognition systems.
 
65
  ![Dolphin-CN-Dialect 特色海报](Dolphin-CN-Dialect.png)
66
 
67
 
68
+ See details in the [Paper](https://arxiv.org/abs/2605.08961).
69
 
70
 
71
  ## Setup