SaylorTwift HF Staff commited on
Commit
883f633
·
verified ·
1 Parent(s): 2755962

Add Claw-Eval evaluation results

Browse files
Files changed (1) hide show
  1. .eval_results/claw_eval.yaml +20 -0
.eval_results/claw_eval.yaml ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: claw-eval/Claw-Eval
3
+ task_id: general
4
+ value: 61.5
5
+ date: '2026-04-23'
6
+ notes: Pass³% | N=3 | 161 tasks
7
+ source:
8
+ url: https://claw-eval.github.io
9
+ name: Claw-Eval Leaderboard
10
+ user: tobiaslee
11
+ - dataset:
12
+ id: claw-eval/Claw-Eval
13
+ task_id: multi_turn
14
+ value: 65.8
15
+ date: '2026-04-23'
16
+ notes: Pass³% | N=3 | 38 tasks
17
+ source:
18
+ url: https://claw-eval.github.io
19
+ name: Claw-Eval Leaderboard
20
+ user: tobiaslee