Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
alessandrobondielli
's Collections
LLMs-to-test
Datasets-ScaleLLM
MechInterp-Papers
Reading List - TextToImage
Datasets-ScaleLLM
updated
Jul 1, 2025
Upvote
-
truthfulqa/truthful_qa
Viewer
•
Updated
Jan 4, 2024
•
1.63k
•
88.8k
•
278
allenai/qasc
Viewer
•
Updated
Jan 4, 2024
•
9.98k
•
4.09k
•
23
Anthropic/model-written-evals
Viewer
•
Updated
Dec 21, 2022
•
3.25k
•
1.71k
•
62
yesilhealth/Health_Benchmarks
Viewer
•
Updated
Apr 20, 2025
•
7.54k
•
870
•
10
maveriq/bigbenchhard
Viewer
•
Updated
Sep 29, 2023
•
6.51k
•
1.21k
•
43
Note
Filtrare i subset che non hanno campo choice
tau/commonsense_qa
Viewer
•
Updated
Jan 4, 2024
•
12.1k
•
64.4k
•
145
allenai/sciq
Viewer
•
Updated
Jan 4, 2024
•
13.7k
•
80.4k
•
136
allenai/openbookqa
Viewer
•
Updated
Jan 4, 2024
•
11.9k
•
111k
•
126
allenai/ai2_arc
Viewer
•
Updated
Dec 21, 2023
•
7.79k
•
407k
•
327
TIGER-Lab/MMLU-Pro
Benchmark
•
Updated
Mar 11
•
12.1k
•
113k
•
468
Upvote
-
Share collection
View history
Collection guide
Browse collections