Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation Paper β’ 2502.05151 β’ Published Feb 7, 2025
DocMMIR: A Framework for Document Multi-modal Information Retrieval Paper β’ 2505.19312 β’ Published May 25, 2025 β’ 1
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs Paper β’ 2510.10689 β’ Published Oct 12, 2025 β’ 47
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper β’ 2511.18538 β’ Published Nov 23, 2025 β’ 304
AutoMV: An Automatic Multi-Agent System for Music Video Generation Paper β’ 2512.12196 β’ Published Dec 13, 2025 β’ 7
Context as a Tool: Context Management for Long-Horizon SWE-Agents Paper β’ 2512.22087 β’ Published Dec 26, 2025 β’ 3
Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements Paper β’ 2512.24867 β’ Published Dec 31, 2025 β’ 1
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space Paper β’ 2512.24617 β’ Published Dec 31, 2025 β’ 66
Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments Paper β’ 2602.01244 β’ Published Feb 1 β’ 16
CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction Paper β’ 2603.00610 β’ Published Feb 28 β’ 35
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper β’ 2603.16790 β’ Published Mar 17 β’ 308
Dynamics Within Latent Chain-of-Thought: An Empirical Study of Causal Structure Paper β’ 2602.08783 β’ Published Feb 9 β’ 1
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper β’ 2604.03144 β’ Published 15 days ago β’ 231
A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning Paper β’ 2510.12838 β’ Published Oct 13, 2025 β’ 25
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems Paper β’ 2510.11652 β’ Published Oct 13, 2025 β’ 30
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling Paper β’ 2508.17445 β’ Published Aug 24, 2025 β’ 80
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling Paper β’ 2508.17445 β’ Published Aug 24, 2025 β’ 80