EvoCorps: An Evolutionary Multi-Agent Framework for Depolarizing Online Discourse Paper • 2602.08529 • Published Feb 9 • 2
Weak-to-Strong Jailbreaking on Large Language Models Paper • 2401.17256 • Published Jan 30, 2024 • 16