Update README.md
Browse files
README.md
CHANGED
|
@@ -15,13 +15,13 @@ Feel free to explore, download, and experiment with our latest releases! π
|
|
| 15 |
|
| 16 |
## π The Boldt Model Family
|
| 17 |
|
| 18 |
-
Our models are trained on
|
| 19 |
|
| 20 |
| Model | Parameters | Context Window | Description |
|
| 21 |
| :--- | :--- | :--- | :--- |
|
| 22 |
| [**Boldt-DC-350M**](https://huggingface.co/Boldt/Boldt-DC-350M) | 350M | 2048 | Ultra-lightweight base model for constrained environments. |
|
| 23 |
| [**Boldt-DC-1B**](https://huggingface.co/Boldt/Boldt-DC-1B) | 1B | 2048 | Highly optimized 1B base model with top-tier German performance. |
|
| 24 |
-
| [**Boldt-1B**](https://huggingface.co/Boldt/Boldt-1B) | 1B | 4096 | Extended context and vocabulary, augmented with 6B tokens of high-quality news. |
|
| 25 |
| [**Boldt-1B-IT-Preview**](https://huggingface.co/Boldt/Boldt-1B-IT-Preview) | 1B | 4096 | Instruction-tuned preview model for chat and zero-shot tasks. |
|
| 26 |
|
| 27 |
## π Research & Artifacts
|
|
|
|
| 15 |
|
| 16 |
## π The Boldt Model Family
|
| 17 |
|
| 18 |
+
Our models are trained on our German *Dense-Core* subset of FineWeb-2, utilizing a multi-epoch training recipe on high-quality data.
|
| 19 |
|
| 20 |
| Model | Parameters | Context Window | Description |
|
| 21 |
| :--- | :--- | :--- | :--- |
|
| 22 |
| [**Boldt-DC-350M**](https://huggingface.co/Boldt/Boldt-DC-350M) | 350M | 2048 | Ultra-lightweight base model for constrained environments. |
|
| 23 |
| [**Boldt-DC-1B**](https://huggingface.co/Boldt/Boldt-DC-1B) | 1B | 2048 | Highly optimized 1B base model with top-tier German performance. |
|
| 24 |
+
| [**Boldt-1B**](https://huggingface.co/Boldt/Boldt-1B) | 1B | 4096 | Extended context and vocabulary, augmented with 6B tokens of high-quality German news data. |
|
| 25 |
| [**Boldt-1B-IT-Preview**](https://huggingface.co/Boldt/Boldt-1B-IT-Preview) | 1B | 4096 | Instruction-tuned preview model for chat and zero-shot tasks. |
|
| 26 |
|
| 27 |
## π Research & Artifacts
|