QED-Nano / .eval_results /MathArena--aime_2026.yaml
lewtun's picture
lewtun HF Staff
Add MathArena evaluation result for aime/aime_2026 (#3)
1016dde
- dataset:
id: MathArena/aime_2026
task_id: MathArena/aime_2026
value: 82.5
date: '2026-03-17'
source:
url: https://matharena.ai/?comp=aime--aime_2026
name: Official MathArena Evaluation