-
Towards Pixel-Level VLM Perception via Simple Points Prediction
Paper • 2601.19228 • Published • 18 -
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep
Paper • 2601.19895 • Published • 27 -
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
Paper • 2601.19798 • Published • 43 -
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
Paper • 2601.21639 • Published • 51
Collections
Discover the best community collections!
Collections including paper arxiv:2601.15876
-
The Smol Training Playbook
📚3.11kThe secrets to building world-class LLMs
-
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
Paper • 2510.08697 • Published • 39
-
WorldVLA: Towards Autoregressive Action World Model
Paper • 2506.21539 • Published • 40 -
LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Paper • 2509.05263 • Published • 11 -
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators
Paper • 2510.00406 • Published • 67 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 53
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44
-
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
Paper • 2512.24618 • Published • 154 -
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Paper • 2512.24873 • Published • 108 -
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Paper • 2512.23343 • Published • 30 -
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking
Paper • 2512.24297 • Published • 6
-
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Paper • 2505.16938 • Published • 121 -
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
Paper • 2510.06261 • Published • 6 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
AlphaResearch: Accelerating New Algorithm Discovery with Language Models
Paper • 2511.08522 • Published • 18
-
Towards Pixel-Level VLM Perception via Simple Points Prediction
Paper • 2601.19228 • Published • 18 -
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep
Paper • 2601.19895 • Published • 27 -
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
Paper • 2601.19798 • Published • 43 -
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models
Paper • 2601.21639 • Published • 51
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44
-
The Smol Training Playbook
📚3.11kThe secrets to building world-class LLMs
-
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
Paper • 2510.08697 • Published • 39
-
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
Paper • 2512.24618 • Published • 154 -
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Paper • 2512.24873 • Published • 108 -
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Paper • 2512.23343 • Published • 30 -
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking
Paper • 2512.24297 • Published • 6
-
WorldVLA: Towards Autoregressive Action World Model
Paper • 2506.21539 • Published • 40 -
LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Paper • 2509.05263 • Published • 11 -
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators
Paper • 2510.00406 • Published • 67 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 53
-
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Paper • 2505.16938 • Published • 121 -
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
Paper • 2510.06261 • Published • 6 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
AlphaResearch: Accelerating New Algorithm Discovery with Language Models
Paper • 2511.08522 • Published • 18
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100