Large Language Models Align with the Human Brain during Creative Thinking Paper • 2604.03480 • Published 12 days ago • 4
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning Paper • 2604.05404 • Published 8 days ago • 41
Xpertbench: Expert Level Tasks with Rubrics-Based Evaluation Paper • 2604.02368 • Published 19 days ago • 11