-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 37 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
Collections
Discover the best community collections!
Collections including paper arxiv:2605.23904
-
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
Paper • 2604.18543 • Published • 30 -
Near-Future Policy Optimization
Paper • 2604.20733 • Published • 77 -
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
Paper • 2604.20987 • Published • 21 -
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Paper • 2602.23161 • Published
-
VISTA: A Test-Time Self-Improving Video Generation Agent
Paper • 2510.15831 • Published • 24 -
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation
Paper • 2510.15624 • Published • 15 -
SkillOpt: Executive Strategy for Self-Evolving Agent Skills
Paper • 2605.23904 • Published • 168
-
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 328 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 132 -
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 179
-
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 305 -
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Paper • 2504.10479 • Published • 310 -
What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models
Paper • 2503.24235 • Published • 55 -
Seedream 3.0 Technical Report
Paper • 2504.11346 • Published • 70
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 37 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
Paper • 2604.18543 • Published • 30 -
Near-Future Policy Optimization
Paper • 2604.20733 • Published • 77 -
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
Paper • 2604.20987 • Published • 21 -
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Paper • 2602.23161 • Published
-
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 328 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 132 -
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 179
-
VISTA: A Test-Time Self-Improving Video Generation Agent
Paper • 2510.15831 • Published • 24 -
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation
Paper • 2510.15624 • Published • 15 -
SkillOpt: Executive Strategy for Self-Evolving Agent Skills
Paper • 2605.23904 • Published • 168
-
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 305 -
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Paper • 2504.10479 • Published • 310 -
What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models
Paper • 2503.24235 • Published • 55 -
Seedream 3.0 Technical Report
Paper • 2504.11346 • Published • 70