Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
thunder-research-group
's Collections
SNU Thunder-LLM Korean Benchmark Suite
SNU Thunder-LLM English Benchmark Suite
SNU Thunder-LLM Dataset Suite
Post-Training Datasets
Negation Benchmarks
SNU Thunder-LLM English Benchmark Suite
updated
3 days ago
Upvote
-
thunder-research-group/SNU_Thunder-NUBench
Viewer
•
Updated
3 days ago
•
5.08k
•
34
•
3
allenai/winogrande
Viewer
•
Updated
Jul 11, 2025
•
81.4k
•
212k
•
79
allenai/ai2_arc
Viewer
•
Updated
Dec 21, 2023
•
7.79k
•
407k
•
330
openai/gsm8k
Benchmark
•
Updated
about 1 month ago
•
17.6k
•
823k
•
1.27k
google/IFEval
Viewer
•
Updated
Aug 14, 2024
•
541
•
103k
•
148
Rowan/hellaswag
Viewer
•
Updated
Jul 10, 2025
•
60k
•
303k
•
167
cais/mmlu
Viewer
•
Updated
Mar 8, 2024
•
231k
•
439k
•
713
openai/openai_humaneval
Viewer
•
Updated
Jan 4, 2024
•
164
•
222k
•
381
allenai/openbookqa
Viewer
•
Updated
Jan 4, 2024
•
11.9k
•
111k
•
127
Upvote
-
Share collection
View history
Collection guide
Browse collections