Spaces:
Running
Running
File size: 620 Bytes
a517e9c ab29e65 a517e9c 09df299 a517e9c ab29e65 a517e9c ab29e65 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 | ---
title: RefusalBench
emoji: 🧬
colorFrom: red
colorTo: indigo
sdk: gradio
sdk_version: 5.50.0
app_file: app.py
pinned: true
license: mit
tags:
- benchmark
- llm-evaluation
- ai-safety
- biosecurity
- refusal
- leaderboard
datasets:
- appliedscientific/refusalbench
---
# RefusalBench
Interactive leaderboard for the RefusalBench benchmark — a reproducible, evergreen evaluation of frontier LLM refusal on biological research prompts.
**Paper:** [arXiv:2605.21545](https://arxiv.org/abs/2605.21545)
**GitHub:** [AppliedScientific/refusalbench](https://github.com/AppliedScientific/refusalbench)
|