Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe Paper • 2603.21972 • Published 26 days ago • 5
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 27 days ago • 77