Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
lmarena-ai
's Collections
SearchArena
Arena-Hard-Auto
Prompt-to-Leaderboard
Arena-Hard-Auto
updated
Apr 24, 2025
An automatic evaluation tool for LLMs.
Upvote
-
Running
Agents
10
Arena Hard Viewer
⚡
10
Browse and view model judgments in benchmarks
lmarena-ai/arena-hard-auto
Updated
May 1, 2025
•
682
•
7
Upvote
-
Share collection
View history
Collection guide
Browse collections