MemPO: Self-Memory Policy Optimization for Long-Horizon Agents Paper • 2603.00680 • Published 5 days ago • 1