Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization Paper • 2604.09574 • Published Feb 24 • 21
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published 6 days ago • 48
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published 6 days ago • 48
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models Paper • 2509.22558 • Published Sep 26, 2025 • 4
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models Paper • 2509.22558 • Published Sep 26, 2025 • 4
MuonRec: Shifting the Optimizer Paradigm Beyond Adam in Scalable Generative Recommendation Paper • 2603.00416 • Published Feb 28 • 19
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 1.02M • • 1.53k