SkillX: Automatically Constructing Skill Knowledge Bases for Agents Paper • 2604.04804 • Published 8 days ago • 31
LightThinker++: From Reasoning Compression to Memory Management Paper • 2604.03679 • Published 10 days ago • 33
How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities Paper • 2603.02578 • Published Mar 3 • 25
Aligning Agentic World Models via Knowledgeable Experience Learning Paper • 2601.13247 • Published Jan 19 • 15
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published Jan 9 • 20
Can We Predict Before Executing Machine Learning Agents? Paper • 2601.05930 • Published Jan 9 • 28
InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published Dec 1, 2025 • 36
Skywork/Skywork-Reward-Llama-3.1-8B-v0.2 Text Classification • 8B • Updated Oct 25, 2024 • 86.7k • 42
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21, 2025 • 115
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20, 2025 • 15