Rekipjan/uyghurtext
Viewer • Updated • 53.3k • 37
⚠️ Known Issue: "Over-learning"
This is a fine-tuned version of the DeepSeek-R1-Distill-Qwen-1.5B. It is specifically optimized for multilingual instruction following, with a heavy emphasis on Uyghur (ئۇيغۇرچە), Chinese, and English cross-domain reasoning.
The model was trained on a high-diversity corpus of 53,273 samples, covering 40+ domains including daily life, traditional crafts, history, reasoning, and AI basics.
| Category | Sample Count | Percentage |
|---|---|---|
| Dialog (پاراڭلىشىش) | 16,385 | 30.76% |
| Reasoning (تەپەككۇر) | 15,484 | 29.07% |
| QA (سوئال-جاۋاب) | 15,409 | 28.92% |
| Creative (ئىجادىيەت) | 5,076 | 9.53% |
| Translation (تەرجىمە) | 919 | 1.73% |
llamafactory-cliObservation: After 3 full epochs, the loss plateaued at step 7,000. Testing indicates the model is "over-baked" (Overfitting).
License: This model is licensed under the Apache 2.0 License, following the base model's licensing terms.
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B