-
Diffusion Language Models Know the Answer Before Decoding
Paper • 2508.19982 • Published • 27 -
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 93 -
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following
Paper • 2601.06431 • Published • 12 -
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning
Paper • 2601.09088 • Published • 63
Collections
Discover the best community collections!
Collections including paper arxiv:2601.09088
-
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
Paper • 2512.24618 • Published • 154 -
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Paper • 2512.24873 • Published • 108 -
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Paper • 2512.23343 • Published • 30 -
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking
Paper • 2512.24297 • Published • 6
-
Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models
Paper • 2504.13626 • Published • 7 -
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models
Paper • 2505.14810 • Published • 62 -
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
Paper • 2509.06917 • Published • 44 -
hongliu9903/stack_edu_python
Viewer • Updated • 25.3M • 85 • 1
-
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b
Viewer • Updated • 306k • 2.53k • 320 -
Alibaba-Apsara/DASD-4B-Thinking
Text Generation • Updated • 421 • 217 -
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob
Viewer • Updated • 435k • 837 • 58 -
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview
Text Generation • Updated • 125 • 52
-
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
Paper • 2511.16334 • Published • 96 -
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 105 -
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
Paper • 2509.04475 • Published • 3 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 106
-
Diffusion Language Models Know the Answer Before Decoding
Paper • 2508.19982 • Published • 27 -
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 93 -
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following
Paper • 2601.06431 • Published • 12 -
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning
Paper • 2601.09088 • Published • 63
-
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b
Viewer • Updated • 306k • 2.53k • 320 -
Alibaba-Apsara/DASD-4B-Thinking
Text Generation • Updated • 421 • 217 -
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob
Viewer • Updated • 435k • 837 • 58 -
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview
Text Generation • Updated • 125 • 52
-
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
Paper • 2512.24618 • Published • 154 -
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Paper • 2512.24873 • Published • 108 -
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Paper • 2512.23343 • Published • 30 -
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking
Paper • 2512.24297 • Published • 6
-
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
Paper • 2511.16334 • Published • 96 -
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 105 -
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
Paper • 2509.04475 • Published • 3 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 106
-
Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models
Paper • 2504.13626 • Published • 7 -
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models
Paper • 2505.14810 • Published • 62 -
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
Paper • 2509.06917 • Published • 44 -
hongliu9903/stack_edu_python
Viewer • Updated • 25.3M • 85 • 1