LLM evals and benchmark datasets - a davidberenstein1957 Collection

davidberenstein1957 's Collections

Smol but mighty

LLM evals and benchmark datasets

Dataset Viber annotators

Cool and fun Spaces

Model Leaderboards

Useful datasets

LLM evals and benchmark datasets

updated May 13, 2025