-
Socratic-Geo: Synthetic Data Generation and Geometric Reasoning via Multi-Agent Interaction
Paper • 2602.03414 • Published -
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
Paper • 2603.09206 • Published • 53 -
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
Paper • 2602.21320 • Published • 12 -
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch
Paper • 2601.13606 • Published • 12
Collections
Discover the best community collections!
Collections including paper arxiv:2601.09668
-
The Trinity of Consistency as a Defining Principle for General World Models
Paper • 2602.23152 • Published • 201 -
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
Paper • 2602.22859 • Published • 151 -
OmniGAIA: Towards Native Omni-Modal AI Agents
Paper • 2602.22897 • Published • 53 -
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
Paper • 2602.22766 • Published • 44
-
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
Paper • 2508.18106 • Published • 350 -
Intern-S1: A Scientific Multimodal Foundation Model
Paper • 2508.15763 • Published • 273
-
STEP3-VL-10B Technical Report
Paper • 2601.09668 • Published • 195 -
Advancing Open-source World Models
Paper • 2601.20540 • Published • 135 -
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
Paper • 2512.24615 • Published • 119 -
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
Paper • 2602.08234 • Published • 74
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 211 -
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
Paper • 2508.00414 • Published • 94 -
Continuous Autoregressive Language Models
Paper • 2510.27688 • Published • 74 -
MiMo-Embodied: X-Embodied Foundation Model Technical Report
Paper • 2511.16518 • Published • 26
-
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Paper • 2505.15966 • Published • 53 -
GRIT: Teaching MLLMs to Think with Images
Paper • 2505.15879 • Published • 13 -
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
Paper • 2505.16854 • Published • 11 -
VLM-R^3: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought
Paper • 2505.16192 • Published • 12
-
Socratic-Geo: Synthetic Data Generation and Geometric Reasoning via Multi-Agent Interaction
Paper • 2602.03414 • Published -
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
Paper • 2603.09206 • Published • 53 -
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
Paper • 2602.21320 • Published • 12 -
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch
Paper • 2601.13606 • Published • 12
-
The Trinity of Consistency as a Defining Principle for General World Models
Paper • 2602.23152 • Published • 201 -
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
Paper • 2602.22859 • Published • 151 -
OmniGAIA: Towards Native Omni-Modal AI Agents
Paper • 2602.22897 • Published • 53 -
Imagination Helps Visual Reasoning, But Not Yet in Latent Space
Paper • 2602.22766 • Published • 44
-
STEP3-VL-10B Technical Report
Paper • 2601.09668 • Published • 195 -
Advancing Open-source World Models
Paper • 2601.20540 • Published • 135 -
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
Paper • 2512.24615 • Published • 119 -
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
Paper • 2602.08234 • Published • 74
-
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
Paper • 2508.18106 • Published • 350 -
Intern-S1: A Scientific Multimodal Foundation Model
Paper • 2508.15763 • Published • 273
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 211 -
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
Paper • 2508.00414 • Published • 94 -
Continuous Autoregressive Language Models
Paper • 2510.27688 • Published • 74 -
MiMo-Embodied: X-Embodied Foundation Model Technical Report
Paper • 2511.16518 • Published • 26
-
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Paper • 2505.15966 • Published • 53 -
GRIT: Teaching MLLMs to Think with Images
Paper • 2505.15879 • Published • 13 -
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
Paper • 2505.16854 • Published • 11 -
VLM-R^3: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought
Paper • 2505.16192 • Published • 12