Prepare Reasoning Language Models for Multi-Agent Debate with Self-Debate Reinforcement Learning Paper • 2601.22297 • Published Jan 29 • 3