ReTraceQA: Evaluating Reasoning Traces of Small Language Models in Commonsense Question Answering Paper • 2510.09351 • Published Oct 10, 2025
AgREE: Agentic Reasoning for Knowledge Graph Completion on Emerging Entities Paper • 2508.04118 • Published Aug 6, 2025
LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA Paper • 2510.13494 • Published Oct 15, 2025 • 2