GRM2-3b / .eval_results /gpqa.yaml
DedeProGames's picture
Update .eval_results/gpqa.yaml
45fe928 verified
raw
history blame contribute delete
215 Bytes
- dataset:
id: Idavidrein/gpqa
task_id: diamond
value: 83.8
date: '2026-04-06'
source:
url: https://huggingface.co/OrionLLM/GRM2-3b
name: Model Card
user: DedeProGames
notes: "With tools"