Add community evaluation results for GPQA, MMLU-PRO

by nielsr HF Staff - opened about 20 hours ago

←

Files changed (2) hide show

.eval_results/gpqa.yaml ADDED Viewed

+- dataset:
+    id: Idavidrein/gpqa
+    task_id: diamond
+  value: 71
+  source:
+    url: https://huggingface.co/Zyphra/ZAYA1-8B
+    name: Model Card

.eval_results/mmlu-pro.yaml ADDED Viewed

+- dataset:
+    id: TIGER-Lab/MMLU-Pro
+    task_id: mmlu_pro
+  value: 74.2
+  source:
+    url: https://huggingface.co/Zyphra/ZAYA1-8B
+    name: Model Card