nielsr HF Staff commited on
Commit
e55481e
·
verified ·
1 Parent(s): 305ae30

Update .eval_results/swe_bench_verified.yaml

Browse files
.eval_results/swe_bench_verified.yaml CHANGED
@@ -1,8 +1,9 @@
1
  - dataset:
2
  id: SWE-bench/SWE-bench_Verified
3
  task_id: swe_bench_%_resolved
4
- value: 73.1
5
  source:
6
- url: https://huggingface.co/deepseek-ai/DeepSeek-V3.2/blob/main/assets/paper.pdf
7
- name: DeepSeek-V3.2 technical report
8
- user: nielsr
 
 
1
  - dataset:
2
  id: SWE-bench/SWE-bench_Verified
3
  task_id: swe_bench_%_resolved
4
+ value: 70.0
5
  source:
6
+ url: https://www.swebench.com/
7
+ name: SWE-Bench official evaluation
8
+ user: nielsr
9
+ notes: high reasoning