GLM-5 / .eval_results /MathArena--aime_2026.yaml
ZHANGYUXUAN-zR's picture
Add MathArena evaluation result for aime/aime_2026 (#44)
8c33dc9
raw
history blame contribute delete
210 Bytes
- dataset:
id: MathArena/aime_2026
task_id: MathArena/aime_2026
value: 95.83
date: '2026-02-18'
source:
url: https://matharena.ai/?comp=aime--aime_2026
name: Official MathArena Evaluation