-
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Paper • 2504.08066 • Published • 22 -
From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Paper • 2504.19678 • Published • 3 -
AIGS: Generating Science from AI-Powered Automated Falsification
Paper • 2411.11910 • Published -
AgentRxiv: Towards Collaborative Autonomous Research
Paper • 2503.18102 • Published • 25
Collections
Discover the best community collections!
Collections including paper arxiv:2502.18864
-
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 52 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195
-
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Paper • 2502.14739 • Published • 110 -
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?
Paper • 2502.14502 • Published • 92 -
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC
Paper • 2502.14282 • Published • 29
-
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Paper • 2506.11763 • Published • 74 -
A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications
Paper • 2506.12594 • Published • 3 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 52 -
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization
Paper • 2507.14683 • Published • 136
-
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper • 2408.06292 • Published • 128 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 52 -
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
Paper • 2504.17192 • Published • 124
-
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Paper • 2504.08066 • Published • 22 -
From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Paper • 2504.19678 • Published • 3 -
AIGS: Generating Science from AI-Powered Automated Falsification
Paper • 2411.11910 • Published -
AgentRxiv: Towards Collaborative Autonomous Research
Paper • 2503.18102 • Published • 25
-
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Paper • 2506.11763 • Published • 74 -
A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications
Paper • 2506.12594 • Published • 3 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 52 -
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization
Paper • 2507.14683 • Published • 136
-
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 52 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195
-
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper • 2408.06292 • Published • 128 -
Towards an AI co-scientist
Paper • 2502.18864 • Published • 52 -
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
Paper • 2504.17192 • Published • 124
-
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Paper • 2502.14739 • Published • 110 -
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?
Paper • 2502.14502 • Published • 92 -
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC
Paper • 2502.14282 • Published • 29