Ill-Ness commited on
Commit
6087469
·
verified ·
1 Parent(s): ea12140

Delete .eval_results/gpqa_diamond.yaml

Browse files
Files changed (1) hide show
  1. .eval_results/gpqa_diamond.yaml +0 -9
.eval_results/gpqa_diamond.yaml DELETED
@@ -1,9 +0,0 @@
1
- - dataset:
2
- id: Idavidrein/gpqa
3
- task_id: gpqa_diamond
4
- value: 0.226562
5
- source:
6
- url: https://huggingface.co/datasets/Idavidrein/gpqa
7
- name: Local Modal GPQA Diamond benchmark
8
- user: Surpem
9
- notes: local multiple-choice accuracy run; no dataset examples included in report