alanakbik commited on
Commit
2960ed1
Β·
verified Β·
1 Parent(s): 3308b6d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -15,13 +15,13 @@ Feel free to explore, download, and experiment with our latest releases! πŸš€
15
 
16
  ## 🌟 The Boldt Model Family
17
 
18
- Our models are trained on the German *Dense-Core* subset of FineWeb-2, utilizing a multi-epoch training recipe on high-quality data.
19
 
20
  | Model | Parameters | Context Window | Description |
21
  | :--- | :--- | :--- | :--- |
22
  | [**Boldt-DC-350M**](https://huggingface.co/Boldt/Boldt-DC-350M) | 350M | 2048 | Ultra-lightweight base model for constrained environments. |
23
  | [**Boldt-DC-1B**](https://huggingface.co/Boldt/Boldt-DC-1B) | 1B | 2048 | Highly optimized 1B base model with top-tier German performance. |
24
- | [**Boldt-1B**](https://huggingface.co/Boldt/Boldt-1B) | 1B | 4096 | Extended context and vocabulary, augmented with 6B tokens of high-quality news. |
25
  | [**Boldt-1B-IT-Preview**](https://huggingface.co/Boldt/Boldt-1B-IT-Preview) | 1B | 4096 | Instruction-tuned preview model for chat and zero-shot tasks. |
26
 
27
  ## πŸ“– Research & Artifacts
 
15
 
16
  ## 🌟 The Boldt Model Family
17
 
18
+ Our models are trained on our German *Dense-Core* subset of FineWeb-2, utilizing a multi-epoch training recipe on high-quality data.
19
 
20
  | Model | Parameters | Context Window | Description |
21
  | :--- | :--- | :--- | :--- |
22
  | [**Boldt-DC-350M**](https://huggingface.co/Boldt/Boldt-DC-350M) | 350M | 2048 | Ultra-lightweight base model for constrained environments. |
23
  | [**Boldt-DC-1B**](https://huggingface.co/Boldt/Boldt-DC-1B) | 1B | 2048 | Highly optimized 1B base model with top-tier German performance. |
24
+ | [**Boldt-1B**](https://huggingface.co/Boldt/Boldt-1B) | 1B | 4096 | Extended context and vocabulary, augmented with 6B tokens of high-quality German news data. |
25
  | [**Boldt-1B-IT-Preview**](https://huggingface.co/Boldt/Boldt-1B-IT-Preview) | 1B | 4096 | Instruction-tuned preview model for chat and zero-shot tasks. |
26
 
27
  ## πŸ“– Research & Artifacts