Update README.md
Browse files
README.md
CHANGED
|
@@ -19,14 +19,14 @@ Calibration uses natural top-k routing rather than forcing all experts to activa
|
|
| 19 |
|
| 20 |
### Calibration dataset
|
| 21 |
|
| 22 |
-
|
| 23 |
-
|
| 24 |
-
1. **Coding
|
| 25 |
-
2. **Broad
|
| 26 |
-
3. **Deep
|
| 27 |
-
4. **Image
|
| 28 |
-
5. **Audio
|
| 29 |
-
6. **Video
|
| 30 |
|
| 31 |
### Requirements
|
| 32 |
|
|
|
|
| 19 |
|
| 20 |
### Calibration dataset
|
| 21 |
|
| 22 |
+
Six calibration passes were run:
|
| 23 |
+
|
| 24 |
+
1. **Coding** β Agentic coding samples (tool calling, multi-turn code generation, function calling) with English and Chinese system prompts.
|
| 25 |
+
2. **Broad** β Large-scale diverse samples drawn from WildChat-NonToxic and LMSYS-Chat covering real user conversations across a wide range of topics and languages.
|
| 26 |
+
3. **Deep** β Long-context samples (>8K tokens) from coding and diverse sources to exercise deep-sequence expert activation patterns.
|
| 27 |
+
4. **Image** β Image question-answering prompts, with the input images drawn from a large collection of public, high quality image datasets.
|
| 28 |
+
5. **Audio** β Medium-size dataset of mostly speech.
|
| 29 |
+
6. **Video** β Diverse set of video question-answering prompts, with a wide variety of input videos of different durations and resolutions.
|
| 30 |
|
| 31 |
### Requirements
|
| 32 |
|