athena129 commited on
Commit
a3f982b
·
verified ·
1 Parent(s): 7ddb714

Remove emoji from headline result table

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -216,7 +216,7 @@ Evaluated under the [Foundation-Sec-8B protocol (arXiv:2504.21039 §B.3-B.4)](ht
216
 
217
  | Benchmark | Metric | CyberSecQwen-4B | Foundation-Sec-Instruct-8B | Δ |
218
  |---|---|---:|---:|---:|
219
- | **CTI-MCQ** (2,500 items) | strict_acc, 5-trial mean ± std | **0.5868 ± 0.0029** | 0.4996 | **+8.7 pp** |
220
  | **CTI-RCM** (1,000 items) | strict_acc, 5-trial mean ± std | **0.6664 ± 0.0023** | 0.6850 | -1.9 pp |
221
 
222
  Parseable rates were 100% on CTI-RCM and 98.1% on CTI-MCQ — the model produces well-formed outputs in the expected response convention.
 
216
 
217
  | Benchmark | Metric | CyberSecQwen-4B | Foundation-Sec-Instruct-8B | Δ |
218
  |---|---|---:|---:|---:|
219
+ | **CTI-MCQ** (2,500 items) | strict_acc, 5-trial mean ± std | **0.5868 ± 0.0029** | 0.4996 | **+8.7 pp** |
220
  | **CTI-RCM** (1,000 items) | strict_acc, 5-trial mean ± std | **0.6664 ± 0.0023** | 0.6850 | -1.9 pp |
221
 
222
  Parseable rates were 100% on CTI-RCM and 98.1% on CTI-MCQ — the model produces well-formed outputs in the expected response convention.