Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2601.09668

about 1 month ago

Socratic-Geo: Synthetic Data Generation and Geometric Reasoning via Multi-Agent Interaction

Paper • 2602.03414 • Published Feb 3
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published Mar 10 • 53
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Paper • 2602.21320 • Published Feb 24 • 12
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch

Paper • 2601.13606 • Published Jan 20 • 12

The Trinity of Consistency as a Defining Principle for General World Models

Paper • 2602.23152 • Published Feb 26 • 201
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 151
OmniGAIA: Towards Native Omni-Modal AI Agents

Paper • 2602.22897 • Published Feb 26 • 53
Imagination Helps Visual Reasoning, But Not Yet in Latent Space

Paper • 2602.22766 • Published Feb 26 • 44

Qwen3-TTS Technical Report

Paper • 2601.15621 • Published Jan 22 • 74
STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195
Perception Encoder: The best visual embeddings are not at the output of the network

Paper • 2504.13181 • Published Apr 17, 2025 • 36

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 550
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25, 2025 • 350
Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 273

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 140
STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195
Advancing Open-source World Models

Paper • 2601.20540 • Published Jan 28 • 135
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published Dec 31, 2025 • 119
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 74

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

Foundation Models

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 211
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1, 2025 • 94
Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31, 2025 • 74
MiMo-Embodied: X-Embodied Foundation Model Technical Report

Paper • 2511.16518 • Published Nov 20, 2025 • 26

Vision Reasoning

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21, 2025 • 53
GRIT: Teaching MLLMs to Think with Images

Paper • 2505.15879 • Published May 21, 2025 • 13
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

Paper • 2505.16854 • Published May 22, 2025 • 11
VLM-R^3: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought

Paper • 2505.16192 • Published May 22, 2025 • 12

about 1 month ago

Socratic-Geo: Synthetic Data Generation and Geometric Reasoning via Multi-Agent Interaction

Paper • 2602.03414 • Published Feb 3
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published Mar 10 • 53
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Paper • 2602.21320 • Published Feb 24 • 12
ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch

Paper • 2601.13606 • Published Jan 20 • 12

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

The Trinity of Consistency as a Defining Principle for General World Models

Paper • 2602.23152 • Published Feb 26 • 201
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 151
OmniGAIA: Towards Native Omni-Modal AI Agents

Paper • 2602.22897 • Published Feb 26 • 53
Imagination Helps Visual Reasoning, But Not Yet in Latent Space

Paper • 2602.22766 • Published Feb 26 • 44

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195
Advancing Open-source World Models

Paper • 2601.20540 • Published Jan 28 • 135
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published Dec 31, 2025 • 119
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 74

Qwen3-TTS Technical Report

Paper • 2601.15621 • Published Jan 22 • 74
STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195
Perception Encoder: The best visual embeddings are not at the output of the network

Paper • 2504.13181 • Published Apr 17, 2025 • 36

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 550
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25, 2025 • 350
Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 273

Foundation Models

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 211
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1, 2025 • 94
Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31, 2025 • 74
MiMo-Embodied: X-Embodied Foundation Model Technical Report

Paper • 2511.16518 • Published Nov 20, 2025 • 26

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 140
STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

Vision Reasoning

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21, 2025 • 53
GRIT: Teaching MLLMs to Think with Images

Paper • 2505.15879 • Published May 21, 2025 • 13
Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

Paper • 2505.16854 • Published May 22, 2025 • 11
VLM-R^3: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought

Paper • 2505.16192 • Published May 22, 2025 • 12

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs