-
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
Paper • 2604.02268 • Published • 94 -
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning
Paper • 2603.05863 • Published • 6 -
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning
Paper • 2604.02721 • Published • 361 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 144
Collections
Discover the best community collections!
Collections including paper arxiv:2603.16790
-
RynnBrain: Open Embodied Foundation Models
Paper • 2602.14979 • Published • 45 -
InCoder-32B: Code Foundation Model for Industrial Scenarios
Paper • 2603.16790 • Published • 308 -
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale
Paper • 2603.25040 • Published • 131 -
The Geometric Alignment Tax: Tokenization vs. Continuous Geometry in Scientific Foundation Models
Paper • 2604.04155 • Published • 10
-
Language Models Model Language
Paper • 2510.12766 • Published • 26 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 229
-
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper • 2411.04905 • Published • 127 -
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Paper • 2405.04324 • Published • 26 -
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 154
-
Monitored Markov Decision Processes
Paper • 2402.06819 • Published -
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Paper • 2505.08988 • Published -
Bayesian Risk Markov Decision Processes
Paper • 2106.02558 • Published -
Sotopia-RL: Reward Design for Social Intelligence
Paper • 2508.03905 • Published • 23
-
Multilingual-Multimodal-NLP/IndustrialCoder
Text Generation • 32B • Updated • 1.52k • 57 -
Multilingual-Multimodal-NLP/IndustrialCoder-32B-AWQ-INT4
Text Generation • 5B • Updated • 67 • 2 -
Multilingual-Multimodal-NLP/IndustrialCoder-32B-GPTQ-INT4
Text Generation • 5B • Updated • 20 -
Multilingual-Multimodal-NLP/IndustrialCoder-32B-FP8
Text Generation • 32B • Updated • 75
-
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 322 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 133 -
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 176
-
The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs
Paper • 2506.18403 • Published • 3 -
ReCode: Updating Code API Knowledge with Reinforcement Learning
Paper • 2506.20495 • Published • 10 -
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Paper • 2507.23348 • Published • 12 -
LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Paper • 2509.09614 • Published • 7
-
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
Paper • 2604.02268 • Published • 94 -
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning
Paper • 2603.05863 • Published • 6 -
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning
Paper • 2604.02721 • Published • 361 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 144
-
Monitored Markov Decision Processes
Paper • 2402.06819 • Published -
Generalization in Monitored Markov Decision Processes (Mon-MDPs)
Paper • 2505.08988 • Published -
Bayesian Risk Markov Decision Processes
Paper • 2106.02558 • Published -
Sotopia-RL: Reward Design for Social Intelligence
Paper • 2508.03905 • Published • 23
-
Multilingual-Multimodal-NLP/IndustrialCoder
Text Generation • 32B • Updated • 1.52k • 57 -
Multilingual-Multimodal-NLP/IndustrialCoder-32B-AWQ-INT4
Text Generation • 5B • Updated • 67 • 2 -
Multilingual-Multimodal-NLP/IndustrialCoder-32B-GPTQ-INT4
Text Generation • 5B • Updated • 20 -
Multilingual-Multimodal-NLP/IndustrialCoder-32B-FP8
Text Generation • 32B • Updated • 75
-
RynnBrain: Open Embodied Foundation Models
Paper • 2602.14979 • Published • 45 -
InCoder-32B: Code Foundation Model for Industrial Scenarios
Paper • 2603.16790 • Published • 308 -
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale
Paper • 2603.25040 • Published • 131 -
The Geometric Alignment Tax: Tokenization vs. Continuous Geometry in Scientific Foundation Models
Paper • 2604.04155 • Published • 10
-
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 322 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 133 -
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 176
-
Language Models Model Language
Paper • 2510.12766 • Published • 26 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 550 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 229
-
The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs
Paper • 2506.18403 • Published • 3 -
ReCode: Updating Code API Knowledge with Reinforcement Learning
Paper • 2506.20495 • Published • 10 -
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Paper • 2507.23348 • Published • 12 -
LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Paper • 2509.09614 • Published • 7
-
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper • 2411.04905 • Published • 127 -
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Paper • 2405.04324 • Published • 26 -
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 154