Boldt
/

Boldt-DC-350M

Text Generation

text-generation-inference

Model card Files Files and versions

aynetdia commited on 5 days ago

Commit

5dd8efe

·

verified ·

1 Parent(s): 5c175e7

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ language:
 pipeline_tag: text-generation
 library_name: transformers
 ---
-# Boldt-1B
 <img src="logo.png" width="500">
@@ -43,7 +43,7 @@ outputs = model.generate(**inputs, max_new_tokens=64)
 ## Evaluation
 We evaluate Boldt-350M on our [modernized German benchmark suite](https://huggingface.co/collections/Boldt/german-llm-benchmarks). It comprises the German subset of [Global MMLU](https://huggingface.co/datasets/CohereLabs/Global-MMLU) and updated translations of widely used English benchmarks, produced using [Tower+ 72B](https://huggingface.co/Unbabel/Tower-Plus-72B) to address issues we identified in existing German benchmark translations.
-Despite being trained on a substantially smaller amount of data, Boldt-1B outperforms other similarly-sized SLMs capable of German on our evaluation suite. It also performs competitively with larger-sized (around 1B) German and multilingual models.
 | Category | Model | Tokens | MMLU | ARC-C | ARC-E | H-Swag | LAMBADA | OBQA | Avg. |
 |----------|--------|--------|------|-------|-------|--------|----------|------|------|

 pipeline_tag: text-generation
 library_name: transformers
 ---
+# Boldt-350M
 <img src="logo.png" width="500">
 ## Evaluation
 We evaluate Boldt-350M on our [modernized German benchmark suite](https://huggingface.co/collections/Boldt/german-llm-benchmarks). It comprises the German subset of [Global MMLU](https://huggingface.co/datasets/CohereLabs/Global-MMLU) and updated translations of widely used English benchmarks, produced using [Tower+ 72B](https://huggingface.co/Unbabel/Tower-Plus-72B) to address issues we identified in existing German benchmark translations.
+Despite being trained on a substantially smaller amount of data, Boldt-350M outperforms other similarly-sized SLMs capable of German on our evaluation suite. It also performs competitively with larger-sized (around 1B) German and multilingual models.
 | Category | Model | Tokens | MMLU | ARC-C | ARC-E | H-Swag | LAMBADA | OBQA | Avg. |
 |----------|--------|--------|------|-------|-------|--------|----------|------|------|