Multilingual MT-Bench LGAI-EXAONE/KoMT-Bench Viewer • Updated Aug 8, 2024 • 80 • 138 • 41 StudentLLM/Korean_MT-Bench_questions Viewer • Updated Jul 10, 2023 • 80 • 12 • 1 naive-puzzle/japanese-mt-bench Viewer • Updated Aug 2, 2024 • 380 • 21 • 1 karakuri-ai/corrected-mt-bench-ja Viewer • Updated Jul 11, 2024 • 80 • 70 • 1
Foundation Corpus HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 349k • 1.02k mlfoundations/dclm-baseline-1.0 Preview • Updated Jul 22, 2024 • 151k • 262 Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 133k • 90 LLM360/TxT360 Updated May 26, 2025 • 35.3k • 248
Benchmark openai/MMMLU Viewer • Updated Oct 16, 2024 • 393k • 7.43k • 519 google/IFEval Viewer • Updated Aug 14, 2024 • 541 • 87.3k • 145
Reward Datasets nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 3.46k • 443 Skywork/Skywork-Reward-Preference-80K-v0.1 Viewer • Updated Oct 25, 2024 • 82k • 124 • 47
Math MathGenie/MathCode-Pile Viewer • Updated Oct 16, 2024 • 719k • 255 • 25 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 12k • 301
benchmark-target tasksource/tasksource_dpo_pairs Viewer • Updated Jul 1, 2024 • 5.13M • 108 • 21 euclaise/SuperMC Viewer • Updated Jan 25, 2024 • 278k • 16 • 2 argilla/ifeval-like-data Viewer • Updated Oct 17, 2024 • 606k • 645 • 47
Benchmark-Korean skt/kobest_v1 Viewer • Updated Mar 28, 2024 • 23.4k • 3.05k • 54 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 6.79k • 97 HAERAE-HUB/KMMLU-HARD Viewer • Updated Mar 9, 2024 • 4.33k • 877 • 13 maywell/LogicKor Preview • Updated Jun 9, 2024 • 53 • 21
Korean Datasets [Good] HAERAE-HUB/Korean-Human-Judgements Viewer • Updated Jun 30, 2024 • 694 • 29 • 38
Multilingual MT-Bench LGAI-EXAONE/KoMT-Bench Viewer • Updated Aug 8, 2024 • 80 • 138 • 41 StudentLLM/Korean_MT-Bench_questions Viewer • Updated Jul 10, 2023 • 80 • 12 • 1 naive-puzzle/japanese-mt-bench Viewer • Updated Aug 2, 2024 • 380 • 21 • 1 karakuri-ai/corrected-mt-bench-ja Viewer • Updated Jul 11, 2024 • 80 • 70 • 1
Math MathGenie/MathCode-Pile Viewer • Updated Oct 16, 2024 • 719k • 255 • 25 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 12k • 301
Foundation Corpus HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 349k • 1.02k mlfoundations/dclm-baseline-1.0 Preview • Updated Jul 22, 2024 • 151k • 262 Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 133k • 90 LLM360/TxT360 Updated May 26, 2025 • 35.3k • 248
benchmark-target tasksource/tasksource_dpo_pairs Viewer • Updated Jul 1, 2024 • 5.13M • 108 • 21 euclaise/SuperMC Viewer • Updated Jan 25, 2024 • 278k • 16 • 2 argilla/ifeval-like-data Viewer • Updated Oct 17, 2024 • 606k • 645 • 47
Benchmark openai/MMMLU Viewer • Updated Oct 16, 2024 • 393k • 7.43k • 519 google/IFEval Viewer • Updated Aug 14, 2024 • 541 • 87.3k • 145
Benchmark-Korean skt/kobest_v1 Viewer • Updated Mar 28, 2024 • 23.4k • 3.05k • 54 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 6.79k • 97 HAERAE-HUB/KMMLU-HARD Viewer • Updated Mar 9, 2024 • 4.33k • 877 • 13 maywell/LogicKor Preview • Updated Jun 9, 2024 • 53 • 21
Reward Datasets nvidia/HelpSteer2 Viewer • Updated Dec 18, 2024 • 21.4k • 3.46k • 443 Skywork/Skywork-Reward-Preference-80K-v0.1 Viewer • Updated Oct 25, 2024 • 82k • 124 • 47
Korean Datasets [Good] HAERAE-HUB/Korean-Human-Judgements Viewer • Updated Jun 30, 2024 • 694 • 29 • 38