GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 11 days ago • 351
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models? Paper • 2603.22582 • Published 21 days ago • 7