Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2502.18864

Agentic AI for science

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Paper • 2504.08066 • Published Apr 10, 2025 • 22
From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review

Paper • 2504.19678 • Published Apr 28, 2025 • 3
AIGS: Generating Science from AI-Powered Automated Falsification

Paper • 2411.11910 • Published Nov 17, 2024
AgentRxiv: Towards Collaborative Autonomous Research

Paper • 2503.18102 • Published Mar 23, 2025 • 25

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113
Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52
START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113
Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20, 2025 • 96

收集的感兴趣的AI

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 110
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20, 2025 • 92
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

Paper • 2502.14282 • Published Feb 20, 2025 • 29

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13, 2025 • 74
A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications

Paper • 2506.12594 • Published Jun 14, 2025 • 3
Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 136

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52

AI-Automated Scientific Research

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 128
Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24, 2025 • 124

Executable Code Actions Elicit Better LLM Agents

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 192
Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52

Agentic AI for science

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Paper • 2504.08066 • Published Apr 10, 2025 • 22
From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review

Paper • 2504.19678 • Published Apr 28, 2025 • 3
AIGS: Generating Science from AI-Powered Automated Falsification

Paper • 2411.11910 • Published Nov 17, 2024
AgentRxiv: Towards Collaborative Autonomous Research

Paper • 2503.18102 • Published Mar 23, 2025 • 25

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13, 2025 • 74
A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications

Paper • 2506.12594 • Published Jun 14, 2025 • 3
Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 136

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113
Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52
START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113
Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20, 2025 • 96

AI-Automated Scientific Research

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 128
Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24, 2025 • 124

收集的感兴趣的AI

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 110
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20, 2025 • 92
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

Paper • 2502.14282 • Published Feb 20, 2025 • 29

Executable Code Actions Elicit Better LLM Agents

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 192
Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26, 2025 • 52

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs