UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory Paper โข 2602.10652 โข Published Feb 11 โข 4
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking Paper โข 2510.20168 โข Published Oct 23, 2025 โข 28
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application Paper โข 2510.19631 โข Published Oct 22, 2025 โข 28