-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 153 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
Collections
Discover the best community collections!
Collections including paper arxiv:2602.02474
-
Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy
Paper • 2310.04945 • Published • 1 -
Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models
Paper • 2307.14430 • Published • 3 -
SoK: Agentic Skills -- Beyond Tool Use in LLM Agents
Paper • 2602.20867 • Published • 1 -
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Paper • 2602.02474 • Published • 62
-
CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty
Paper • 2601.22027 • Published • 85 -
Reinforcement World Model Learning for LLM-based Agents
Paper • 2602.05842 • Published • 27 -
Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention
Paper • 2602.03338 • Published • 26 -
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Paper • 2602.02474 • Published • 62
-
STEP3-VL-10B Technical Report
Paper • 2601.09668 • Published • 195 -
Advancing Open-source World Models
Paper • 2601.20540 • Published • 135 -
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
Paper • 2512.24615 • Published • 119 -
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
Paper • 2602.08234 • Published • 74
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 152 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 149 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 153 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 118
-
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 144 -
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning
Paper • 2602.07845 • Published • 71 -
LLaDA2.1: Speeding Up Text Diffusion via Token Editing
Paper • 2602.08676 • Published • 70 -
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Paper • 2602.02474 • Published • 62
-
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 203 -
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 304 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 277 -
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 290
-
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
Paper • 2511.11007 • Published • 15 -
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
Paper • 2511.13593 • Published • 28 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 170 -
MemEvolve: Meta-Evolution of Agent Memory Systems
Paper • 2512.18746 • Published • 31
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 153 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 152 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 149 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 153 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 118
-
Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy
Paper • 2310.04945 • Published • 1 -
Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models
Paper • 2307.14430 • Published • 3 -
SoK: Agentic Skills -- Beyond Tool Use in LLM Agents
Paper • 2602.20867 • Published • 1 -
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Paper • 2602.02474 • Published • 62
-
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 144 -
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning
Paper • 2602.07845 • Published • 71 -
LLaDA2.1: Speeding Up Text Diffusion via Token Editing
Paper • 2602.08676 • Published • 70 -
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Paper • 2602.02474 • Published • 62
-
Agentic Reasoning for Large Language Models
Paper • 2601.12538 • Published • 203 -
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence
Paper • 2511.18538 • Published • 304 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 277 -
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
Paper • 2602.08222 • Published • 290
-
CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty
Paper • 2601.22027 • Published • 85 -
Reinforcement World Model Learning for LLM-based Agents
Paper • 2602.05842 • Published • 27 -
Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention
Paper • 2602.03338 • Published • 26 -
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents
Paper • 2602.02474 • Published • 62
-
STEP3-VL-10B Technical Report
Paper • 2601.09668 • Published • 195 -
Advancing Open-source World Models
Paper • 2601.20540 • Published • 135 -
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
Paper • 2512.24615 • Published • 119 -
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
Paper • 2602.08234 • Published • 74
-
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models
Paper • 2511.11007 • Published • 15 -
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents
Paper • 2511.13593 • Published • 28 -
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 170 -
MemEvolve: Meta-Evolution of Agent Memory Systems
Paper • 2512.18746 • Published • 31