OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence
Paper • 2601.21083 • Published • 1
OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks.