DEBATE: A Large-Scale Benchmark for Role-Playing LLM Agents in Multi-Agent, Long-Form Debates Paper • 2510.25110 • Published Oct 29, 2025