OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data Paper • 2603.15594 • Published 29 days ago • 149
Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues Paper • 2410.10700 • Published Oct 14, 2024 • 3
CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion Paper • 2403.07865 • Published Mar 12, 2024 • 1