stephen-flood 's Collections Benchmarks
updated
Viewer
• Updated • 2.09k • 64
• 4
Viewer
• Updated • 5.82M • 6.97k
• 43
Viewer
• Updated • 231k • 417k
• 710
Benchmark
• Updated • 17.6k • 775k
• 1.25k
Viewer
• Updated • 19.6k • 9
lighteval/legal_summarization
Viewer
• Updated • 26.9k • 154
• 28
Viewer
• Updated • 1.6k • 132
• 2
lighteval/synthetic_reasoning
Viewer
• Updated • 33k • 160
• 8
lighteval/synthetic_reasoning_natural
Viewer
• Updated • 22k • 81
• 15
Viewer
• Updated • 90.3k • 75
• 3
lighteval/GPT3_unscramble
Viewer
• Updated • 50k • 6
• 1
lighteval/aimo_progress_prize_1
Viewer
• Updated • 10 • 11
Viewer
• Updated • 1.7k • 12
Viewer
• Updated • 72.5k • 5.52k
• 147
Viewer
• Updated • 860k • 35.7k
• 559
Text Classification
• Updated • 48.9k
• 82
Jofthomas/hermes-function-calling-thinking-V1
Viewer
• Updated • 3.57k • 509
• 74
NousResearch/hermes-function-calling-v1
Viewer
• Updated • 11.6k • 11.7k
• 395
Viewer
• Updated • 15.7k • 49
• 7
Viewer
• Updated • 621M • 12.8k
• 87
open-web-math/open-web-math
Viewer
• Updated • 6.32M • 18k
• 333