Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
thunder-research-group 's Collections
SNU Thunder-LLM Korean Benchmark Suite
SNU Thunder-LLM English Benchmark Suite
SNU Thunder-LLM Dataset Suite
Post-Training Datasets
Negation Benchmarks

SNU Thunder-LLM Korean Benchmark Suite

updated 3 days ago
Upvote
1

  • thunder-research-group/SNU_Ko-LAMBADA

    Viewer • Updated Jun 13, 2025 • 2.26k • 128

  • thunder-research-group/SNU_Ko-WinoGrande

    Viewer • Updated Jun 13, 2025 • 1.27k • 46

  • thunder-research-group/SNU_Ko-ARC

    Viewer • Updated Jun 13, 2025 • 3.54k • 11

  • thunder-research-group/SNU_Ko-GSM8K

    Viewer • Updated Oct 16, 2025 • 1.32k • 35 • 1

  • thunder-research-group/SNU_Ko-IFEval

    Viewer • Updated Jun 13, 2025 • 841 • 233

  • thunder-research-group/SNU_Ko-EQ-Bench

    Viewer • Updated Jun 13, 2025 • 171 • 25

  • skt/kobest_v1

    Viewer • Updated Mar 28, 2024 • 23.4k • 3.33k • 54

    Note We use hellaswag > test set for evaluation


  • HAERAE-HUB/KMMLU

    Viewer • Updated Mar 5, 2024 • 244k • 8.49k • 97

  • HYU-NLP/KR-HumanEval

    Viewer • Updated Jun 3, 2025 • 328 • 28

    Note We use v1 for evaluation


  • LGCNS/KorQuAD_2.0

    Viewer • Updated Aug 7, 2025 • 93.7k • 180 • 2

  • thunder-research-group/SNU_Ko-MuSR

    Viewer • Updated Nov 24, 2025 • 750 • 22

  • thunder-research-group/SNU_Thunder-KoNUBench

    Viewer • Updated about 14 hours ago • 4.78k • 25 • 1
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs