Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -48,8 +48,7 @@ The model was trained using a three-phase **Curriculum Learning** strategy:
    Integration of social media, forums, and informal web data to handle slang and modern registers.
 3. **Phase 3: Long-Context Extension (20.4B tokens)**
    Fine-tuning on dense, long-form documents to stabilize the 64k context window.
-> **Alignment:** Supervised Fine-Tuning (SFT) was performed on **2 million samples**, including localized knowledge distillation and the **"Hebrew IFEval"** dataset.
 ---

    Integration of social media, forums, and informal web data to handle slang and modern registers.
 3. **Phase 3: Long-Context Extension (20.4B tokens)**
    Fine-tuning on dense, long-form documents to stabilize the 64k context window.
+4. **Alignment:** Supervised Fine-Tuning (SFT) was performed on **2 million samples**, including localized knowledge distillation and the **"Hebrew IFEval"** dataset.
 ---