Toward Autonomous Long-Horizon Engineering for ML Research Paper • 2604.13018 • Published 4 days ago • 30
Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration Paper • 2604.11446 • Published 5 days ago • 3
SWE Agent Series Collection Models trained by SWE-Master and SWE-World, including both policy models and verifiers. • 13 items • Updated 25 days ago • 3
SWE Agent Series Collection Models trained by SWE-Master and SWE-World, including both policy models and verifiers. • 13 items • Updated 25 days ago • 3
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published Feb 3 • 41
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published Feb 3 • 39
From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR Paper • 2508.07534 • Published Aug 11, 2025 • 1
Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models Paper • 2406.12397 • Published Jun 18, 2024