File size: 620 Bytes
a517e9c
ab29e65
 
 
 
a517e9c
09df299
a517e9c
ab29e65
 
 
 
 
 
 
 
 
 
 
a517e9c
 
ab29e65
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
title: RefusalBench
emoji: 🧬
colorFrom: red
colorTo: indigo
sdk: gradio
sdk_version: 5.50.0
app_file: app.py
pinned: true
license: mit
tags:
  - benchmark
  - llm-evaluation
  - ai-safety
  - biosecurity
  - refusal
  - leaderboard
datasets:
  - appliedscientific/refusalbench
---

# RefusalBench

Interactive leaderboard for the RefusalBench benchmark — a reproducible, evergreen evaluation of frontier LLM refusal on biological research prompts.

**Paper:** [arXiv:2605.21545](https://arxiv.org/abs/2605.21545)  
**GitHub:** [AppliedScientific/refusalbench](https://github.com/AppliedScientific/refusalbench)