view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 126
Towards a Mechanistic Understanding of Propositional Logical Reasoning in Large Language Models Paper • 2601.04260 • Published Jan 7 • 1
SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees Paper • 2602.06554 • Published Feb 6 • 6