Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2601.09088

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published Jan 14 • 63

Collection de dataset et autres afin de crée un mini LLM FR sdpécialiser pour le RAG

Nicolas-BZRD/DILA_OPENDATA_FR_2023

Viewer • Updated Oct 17, 2023 • 8.24M • 128 • 4
sujet-ai/Sujet-Financial-RAG-FR-Dataset

Viewer • Updated Jul 28, 2024 • 30.1k • 46 • 5
almanach/halvest-geometric

Viewer • Updated Oct 2, 2025 • 618k • 504 • 3
PleIAs/common_corpus

Viewer • Updated Feb 19 • 69.9k • 207k • 390

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 154
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published Dec 31, 2025 • 108
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

Paper • 2512.23343 • Published Dec 29, 2025 • 30
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking

Paper • 2512.24297 • Published Dec 30, 2025 • 6

Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models

Paper • 2504.13626 • Published Apr 18, 2025 • 7
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20, 2025 • 62
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Paper • 2509.06917 • Published Sep 8, 2025 • 44
hongliu9903/stack_edu_python

Viewer • Updated Jul 31, 2025 • 25.3M • 85 • 1

Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b

Viewer • Updated Jan 31 • 306k • 2.53k • 320
Alibaba-Apsara/DASD-4B-Thinking

Text Generation • Updated Jan 15 • 421 • 217
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob

Viewer • Updated Jan 15 • 435k • 837 • 58
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview

Text Generation • Updated Jan 15 • 125 • 52

Qwen/Qwen3-VL-30B-A3B-Thinking

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 78.6k • • 197
Qwen/Qwen3-Next-80B-A3B-Thinking

Text Generation • Updated Sep 15, 2025 • 33.3k • • 487
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published Jan 14 • 63

reasoning_model

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 96
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 105
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute

Paper • 2509.04475 • Published Aug 30, 2025 • 3
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published Dec 15, 2025 • 93
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published Jan 14 • 63

Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b

Viewer • Updated Jan 31 • 306k • 2.53k • 320
Alibaba-Apsara/DASD-4B-Thinking

Text Generation • Updated Jan 15 • 421 • 217
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob

Viewer • Updated Jan 15 • 435k • 837 • 58
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview

Text Generation • Updated Jan 15 • 125 • 52

Collection de dataset et autres afin de crée un mini LLM FR sdpécialiser pour le RAG

Nicolas-BZRD/DILA_OPENDATA_FR_2023

Viewer • Updated Oct 17, 2023 • 8.24M • 128 • 4
sujet-ai/Sujet-Financial-RAG-FR-Dataset

Viewer • Updated Jul 28, 2024 • 30.1k • 46 • 5
almanach/halvest-geometric

Viewer • Updated Oct 2, 2025 • 618k • 504 • 3
PleIAs/common_corpus

Viewer • Updated Feb 19 • 69.9k • 207k • 390

Qwen/Qwen3-VL-30B-A3B-Thinking

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 78.6k • • 197
Qwen/Qwen3-Next-80B-A3B-Thinking

Text Generation • Updated Sep 15, 2025 • 33.3k • • 487
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Paper • 2601.09088 • Published Jan 14 • 63

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 154
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published Dec 31, 2025 • 108
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

Paper • 2512.23343 • Published Dec 29, 2025 • 30
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking

Paper • 2512.24297 • Published Dec 30, 2025 • 6

reasoning_model

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 96
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 105
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute

Paper • 2509.04475 • Published Aug 30, 2025 • 3
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models

Paper • 2504.13626 • Published Apr 18, 2025 • 7
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20, 2025 • 62
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Paper • 2509.06917 • Published Sep 8, 2025 • 44
hongliu9903/stack_edu_python

Viewer • Updated Jul 31, 2025 • 25.3M • 85 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs