Update README.md
Browse files
README.md
CHANGED
|
@@ -89,7 +89,7 @@ Keeping this in mind:
|
|
| 89 |
|
| 90 |
## Results
|
| 91 |
|
| 92 |
-
I trained StableLM-3B-4e1t repeatedly on [https://huggingface.co/datasets/euclaise/TinyCoT
|
| 93 |
|
| 94 |
I trained once with ReMask/ReMask-CoT, once without regularization to match Masked Thought (w/ partial label-masking for CoT), and once with SFT.
|
| 95 |
|
|
|
|
| 89 |
|
| 90 |
## Results
|
| 91 |
|
| 92 |
+
I trained StableLM-3B-4e1t repeatedly on [TinyCoT](https://huggingface.co/datasets/euclaise/TinyCoT), along with 1000 examples from [reddit-instruct-curated](https://huggingface.co/datasets/euclaise/reddit-instruct-curated) and 1000 examples from [oasst2-curated](https://huggingface.co/datasets/sablo/oasst2_curated).
|
| 93 |
|
| 94 |
I trained once with ReMask/ReMask-CoT, once without regularization to match Masked Thought (w/ partial label-masking for CoT), and once with SFT.
|
| 95 |
|