DedeProGames commited on
Commit
3f5fcc3
·
verified ·
1 Parent(s): aef7b2b

Fix benchmark

Browse files
.eval_results/swe_bench_verified.yaml CHANGED
@@ -1,7 +1,7 @@
1
  - dataset:
2
  id: SWE-bench/SWE-bench_Verified
3
  task_id: swe_bench_%_resolved
4
- value: 78.7
5
  date: '2026-04-23'
6
  source:
7
  url: https://huggingface.co/OrionLLM/GRM-2.6-Plus
 
1
  - dataset:
2
  id: SWE-bench/SWE-bench_Verified
3
  task_id: swe_bench_%_resolved
4
+ value: 77.7
5
  date: '2026-04-23'
6
  source:
7
  url: https://huggingface.co/OrionLLM/GRM-2.6-Plus