MemPO: Self-Memory Policy Optimization for Long-Horizon Agents Paper • 2603.00680 • Published 6 days ago • 1