SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 5 days ago • 267
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 6 days ago • 304
Demystifying When Pruning Works via Representation Hierarchies Paper • 2603.24652 • Published 8 days ago • 19
ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation Paper • 2604.03922 • Published 9 days ago • 52
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published 12 days ago • 38
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published 8 days ago • 38
SkillX: Automatically Constructing Skill Knowledge Bases for Agents Paper • 2604.04804 • Published 8 days ago • 31
ClawArena: Benchmarking AI Agents in Evolving Information Environments Paper • 2604.04202 • Published 9 days ago • 36
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published 10 days ago • 33
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 8 days ago • 104
Revision or Re-Solving? Decomposing Second-Pass Gains in Multi-LLM Pipelines Paper • 2604.01029 • Published 12 days ago • 7
Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 12 days ago • 31
Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning Paper • 2604.01152 • Published 12 days ago • 5
Executing as You Generate: Hiding Execution Latency in LLM Code Generation Paper • 2604.00491 • Published 13 days ago • 6
Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents Paper • 2603.26233 • Published 18 days ago • 8
Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning Paper • 2604.02007 • Published 12 days ago • 12
Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time Paper • 2604.00917 • Published 13 days ago • 18