YAML Metadata Error:Invalid content in Eval Result file .eval_results/mmlu-pro.yaml

Check out the documentation for more information.

Show details

Task ID "mmlu_pro" does not match any task in dataset "TIGER-Lab/MMLU-Pro". Available: none

Add community evaluation results for GPQA, MMLU-PRO, SWE-BENCH_VERIFIED (#4)

377bae8 21 days ago

169 Bytes

	- dataset:
	id: TIGER-Lab/MMLU-Pro
	task_id: mmlu_pro
	value: 83.4
	source:
	url: https://huggingface.co/arcee-ai/Trinity-Large-Thinking
	name: Model Card