Danish Foundation Models

community

https://foundationmodels.dk

danish-foundation-models

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

KennethEnevoldsen new activity about 1 hour ago

danish-foundation-models/swedish-dynaword:Add Dalpilen 1860's

V4ldeLund new activity about 3 hours ago

danish-foundation-models/swedish-dynaword:Change contributing.md

V4ldeLund new activity about 3 hours ago

danish-foundation-models/swedish-dynaword:Add Dalpilen 1860's

View all activity

danish-foundation-models 's collections 8

Dynawords

A collection of dynawords, target various languages

danish-foundation-models/norwegian-dynaword

Viewer • Updated about 3 hours ago • 8.73M • 508 • 5
danish-foundation-models/danish-dynaword

Viewer • Updated about 3 hours ago • 11.3M • 5.57k • 18

AI-Arenaen

[Unreleased] Datasets related to AI-Arenaen

danish-foundation-models/ai-arenaen-conversations

Viewer • Updated about 7 hours ago • 130 • 2
danish-foundation-models/ai-arenaen-reactions

Viewer • Updated 3 days ago • 32 • 2
danish-foundation-models/ai-arenaen-conversations-raw

Updated Dec 25, 2025 • 2
danish-foundation-models/ai-arenaen-votes

Viewer • Updated about 7 hours ago • 78 • 2

Papers

Dynaword: From One-shot to Continuously Developed Datasets

Paper • 2508.02271 • Published Aug 4, 2025 • 15

Danish Benchmarks

Benchmarks for evaluating Danish Models.

Running

7

EuroEval Leaderboard

📊

7

The robust European language model benchmark.
ScandEval: A Benchmark for Scandinavian Natural Language Processing

Paper • 2304.00906 • Published Apr 3, 2023 • 4
Running on CPU Upgrade

7.25k

MTEB Leaderboard

🥇

7.25k

Embedding Leaderboard
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding

Paper • 2406.02396 • Published Jun 4, 2024

EuroEval Compatible Datasets

A collection of EuroEval compatible datasets which can be run using: `euroeval --dataset {dataset name} --model {model name}`

giannor/dala

Viewer • Updated Feb 13 • 8.68k • 76
giannor/dala_large

Viewer • Updated Feb 13 • 7.66k • 36
giannor/dala_medium

Viewer • Updated Feb 13 • 7.66k • 13
TIGER-Lab/MMLU-Pro

Benchmark • Updated Mar 11 • 12.1k • 107k • 466

Dynaword Paper artifacts

This is a collection of artifact released as a part of the paper: "Dynaword: From One-shot to Continuously Developed Datasets".

Dynaword: From One-shot to Continuously Developed Datasets

Paper • 2508.02271 • Published Aug 4, 2025 • 15
danish-foundation-models/danish-dynaword

Viewer • Updated about 3 hours ago • 11.3M • 5.57k • 18
danish-foundation-models/gemma-3-1b-cpt-dynaword-matched-v1

Text Generation • 1.0B • Updated Nov 4, 2025 • 2
danish-foundation-models/gemma-3-1b-scratch-dynaword-full-v1

Text Generation • 1.0B • Updated Nov 4, 2025 • 7

Danish Text Datasets

These include high-quality Danish text datasets for pre-training, fine-tuning, etc.

DDSC/angry-tweets

Viewer • Updated Jul 20, 2023 • 3.46k • 674 • 3
DDSC/europarl

Viewer • Updated Jul 1, 2022 • 957 • 99 • 2
DDSC/lcc

Viewer • Updated Jul 20, 2023 • 499 • 3.74k • 3
strombergnlp/bajer_danish_misogyny

Updated May 16, 2023 • 19 • 2

State-of-the-art Danish Models

These models constitute state-of-the-art models for Danish within their respective domain (highlighted below the model).

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Updated Dec 22, 2025 • 533k • 1.36k
google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 728k • • 1.95k
google/gemma-3n-E4B-it

Image-Text-to-Text • Updated Jul 14, 2025 • 39.9k • • 900
google/gemma-2-9b-it

Text Generation • 9B • Updated Aug 27, 2024 • 328k • • 789

Dynawords

A collection of dynawords, target various languages

danish-foundation-models/norwegian-dynaword

Viewer • Updated about 3 hours ago • 8.73M • 508 • 5
danish-foundation-models/danish-dynaword

Viewer • Updated about 3 hours ago • 11.3M • 5.57k • 18

EuroEval Compatible Datasets

A collection of EuroEval compatible datasets which can be run using: `euroeval --dataset {dataset name} --model {model name}`

giannor/dala

Viewer • Updated Feb 13 • 8.68k • 76
giannor/dala_large

Viewer • Updated Feb 13 • 7.66k • 36
giannor/dala_medium

Viewer • Updated Feb 13 • 7.66k • 13
TIGER-Lab/MMLU-Pro

Benchmark • Updated Mar 11 • 12.1k • 107k • 466

AI-Arenaen

[Unreleased] Datasets related to AI-Arenaen

danish-foundation-models/ai-arenaen-conversations

Viewer • Updated about 7 hours ago • 130 • 2
danish-foundation-models/ai-arenaen-reactions

Viewer • Updated 3 days ago • 32 • 2
danish-foundation-models/ai-arenaen-conversations-raw

Updated Dec 25, 2025 • 2
danish-foundation-models/ai-arenaen-votes

Viewer • Updated about 7 hours ago • 78 • 2

Dynaword Paper artifacts

This is a collection of artifact released as a part of the paper: "Dynaword: From One-shot to Continuously Developed Datasets".

Dynaword: From One-shot to Continuously Developed Datasets

Paper • 2508.02271 • Published Aug 4, 2025 • 15
danish-foundation-models/danish-dynaword

Viewer • Updated about 3 hours ago • 11.3M • 5.57k • 18
danish-foundation-models/gemma-3-1b-cpt-dynaword-matched-v1

Text Generation • 1.0B • Updated Nov 4, 2025 • 2
danish-foundation-models/gemma-3-1b-scratch-dynaword-full-v1

Text Generation • 1.0B • Updated Nov 4, 2025 • 7

Papers

Dynaword: From One-shot to Continuously Developed Datasets

Paper • 2508.02271 • Published Aug 4, 2025 • 15

Danish Text Datasets

These include high-quality Danish text datasets for pre-training, fine-tuning, etc.

DDSC/angry-tweets

Viewer • Updated Jul 20, 2023 • 3.46k • 674 • 3
DDSC/europarl

Viewer • Updated Jul 1, 2022 • 957 • 99 • 2
DDSC/lcc

Viewer • Updated Jul 20, 2023 • 499 • 3.74k • 3
strombergnlp/bajer_danish_misogyny

Updated May 16, 2023 • 19 • 2

Danish Benchmarks

Benchmarks for evaluating Danish Models.

Running

7

EuroEval Leaderboard

📊

7

The robust European language model benchmark.
ScandEval: A Benchmark for Scandinavian Natural Language Processing

Paper • 2304.00906 • Published Apr 3, 2023 • 4
Running on CPU Upgrade

7.25k

MTEB Leaderboard

🥇

7.25k

Embedding Leaderboard
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding

Paper • 2406.02396 • Published Jun 4, 2024

State-of-the-art Danish Models

These models constitute state-of-the-art models for Danish within their respective domain (highlighted below the model).

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Updated Dec 22, 2025 • 533k • 1.36k
google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 728k • • 1.95k
google/gemma-3n-E4B-it

Image-Text-to-Text • Updated Jul 14, 2025 • 39.9k • • 900
google/gemma-2-9b-it

Text Generation • 9B • Updated Aug 27, 2024 • 328k • • 789

AI & ML interests

Recent Activity

Team members 20

danish-foundation-models 's collections 8

EuroEval Leaderboard

MTEB Leaderboard

EuroEval Leaderboard

MTEB Leaderboard