Think Before Recommend: Unleashing the Latent Reasoning Power for
Sequential Recommendation
Paper
• 2503.22675
• Published • 36
Exploring Data Scaling Trends and Effects in Reinforcement Learning from
Human Feedback
Paper
• 2503.22230
• Published • 45
ReSum: Unlocking Long-Horizon Search Intelligence via Context
Summarization
Paper
• 2509.13313
• Published • 80
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon
Agents
Paper
• 2509.13309
• Published • 67
Towards General Agentic Intelligence via Environment Scaling
Paper
• 2509.13311
• Published • 72
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic
Data and Scalable Reinforcement Learning
Paper
• 2509.13305
• Published • 91
Scaling Agents via Continual Pre-training
Paper
• 2509.13310
• Published • 117
WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for
Open-Ended Deep Research
Paper
• 2509.13312
• Published • 106
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation
Sandbox for Deep Research
Paper
• 2505.19253
• Published • 34