YAML Metadata Error:Invalid content in Eval Result file .eval_results/mmlu-pro.yaml

Check out the documentation for more information.

Show details
Task ID "mmlu_pro" does not match any task in dataset "TIGER-Lab/MMLU-Pro". Available: none
lckr's picture
Add community evaluation results for GPQA, MMLU-PRO, SWE-BENCH_VERIFIED (#4)
377bae8
raw
history blame contribute delete
169 Bytes
- dataset:
id: TIGER-Lab/MMLU-Pro
task_id: mmlu_pro
value: 83.4
source:
url: https://huggingface.co/arcee-ai/Trinity-Large-Thinking
name: Model Card