Update README.md
Browse files
README.md
CHANGED
|
@@ -48,8 +48,7 @@ The model was trained using a three-phase **Curriculum Learning** strategy:
|
|
| 48 |
Integration of social media, forums, and informal web data to handle slang and modern registers.
|
| 49 |
3. **Phase 3: Long-Context Extension (20.4B tokens)**
|
| 50 |
Fine-tuning on dense, long-form documents to stabilize the 64k context window.
|
| 51 |
-
|
| 52 |
-
> **Alignment:** Supervised Fine-Tuning (SFT) was performed on **2 million samples**, including localized knowledge distillation and the **"Hebrew IFEval"** dataset.
|
| 53 |
|
| 54 |
---
|
| 55 |
|
|
|
|
| 48 |
Integration of social media, forums, and informal web data to handle slang and modern registers.
|
| 49 |
3. **Phase 3: Long-Context Extension (20.4B tokens)**
|
| 50 |
Fine-tuning on dense, long-form documents to stabilize the 64k context window.
|
| 51 |
+
4. **Alignment:** Supervised Fine-Tuning (SFT) was performed on **2 million samples**, including localized knowledge distillation and the **"Hebrew IFEval"** dataset.
|
|
|
|
| 52 |
|
| 53 |
---
|
| 54 |
|