GLM-5 / .eval_results /yc-bench.yaml
ZHANGYUXUAN-zR's picture
Add YC-Bench evaluation results (avg $1,208,190) (#70)
04670d2
raw
history blame contribute delete
272 Bytes
- dataset:
id: collinear-ai/yc-bench
task_id: medium
value: 1208190
date: "2026-03-24"
source:
url: https://github.com/collinear-ai/yc-bench
name: "YC-Bench eval"
notes: "avg final funds (USD) across seeds 1,2,3. GLM-5 (via OpenRouter z-ai/glm-5)"