Dynamic Risk Assessments for Offensive Cybersecurity Agents Paper • 2505.18384 • Published May 23, 2025 • 8
SafeArena: Evaluating the Safety of Autonomous Web Agents Paper • 2503.04957 • Published Mar 6, 2025 • 21
Tab-MIA: A Benchmark Dataset for Membership Inference Attacks on Tabular Data in LLMs Paper • 2507.17259 • Published Jul 23, 2025