ZHANGYUXUAN-zR RiddleHe commited on
Commit
04670d2
·
1 Parent(s): ec61e8d

Add YC-Bench evaluation results (avg $1,208,190) (#70)

Browse files

- Add YC-Bench evaluation results (avg $1,208,190) (91265655e21519a9dc811ba9115328e941f105d5)


Co-authored-by: Riddle He <RiddleHe@users.noreply.huggingface.co>

Files changed (1) hide show
  1. .eval_results/yc-bench.yaml +9 -0
.eval_results/yc-bench.yaml ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: collinear-ai/yc-bench
3
+ task_id: medium
4
+ value: 1208190
5
+ date: "2026-03-24"
6
+ source:
7
+ url: https://github.com/collinear-ai/yc-bench
8
+ name: "YC-Bench eval"
9
+ notes: "avg final funds (USD) across seeds 1,2,3. GLM-5 (via OpenRouter z-ai/glm-5)"