Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2501.12599

LLM Tech Reports

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 339
Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 24

To Read collection

interesting papers to read

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31, 2025 • 62
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 120
START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113
DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 146

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31, 2025 • 305

Papers Storm 🌪️

A curated collection of research papers referenced in Panoram'IA program, offering a comprehensive resource for further exploration.

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 98
Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 77
Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 39
Mobile Video Diffusion

Paper • 2412.07583 • Published Dec 10, 2024 • 20

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5, 2025 • 24
NatureLM: Deciphering the Language of Nature for Scientific Discovery

Paper • 2502.07527 • Published Feb 11, 2025 • 20
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents

Paper • 2502.05957 • Published Feb 9, 2025 • 16

Collection of research papers I find interesting on reasoning models

Learning to Reason without External Rewards

Paper • 2505.19590 • Published May 26, 2025 • 31
Scalable Best-of-N Selection for Large Language Models via Self-Certainty

Paper • 2502.18581 • Published Feb 25, 2025
Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 94
Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19, 2025 • 23

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 258
Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5, 2025 • 58
Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 125

Source Papers of LLM Giants

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 38
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

Paper • 2311.07919 • Published Nov 14, 2023 • 9
Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 171
Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 64

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 3.97M • • 13.3k
Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
andrewzh/Absolute_Zero_Reasoner-Coder-7b

8B • Updated May 5, 2025 • 24 • 20
GD-ML/UniVG-R1

8B • Updated May 27, 2025 • 78 • 6

LLM Tech Reports

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 339
Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 24

Collection of research papers I find interesting on reasoning models

Learning to Reason without External Rewards

Paper • 2505.19590 • Published May 26, 2025 • 31
Scalable Best-of-N Selection for Large Language Models via Self-Certainty

Paper • 2502.18581 • Published Feb 25, 2025
Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 94
Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19, 2025 • 23

To Read collection

interesting papers to read

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31, 2025 • 62
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 120
START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113
DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 146

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31, 2025 • 305

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 258
Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5, 2025 • 58
Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 125

Papers Storm 🌪️

A curated collection of research papers referenced in Panoram'IA program, offering a comprehensive resource for further exploration.

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 98
Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 77
Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 39
Mobile Video Diffusion

Paper • 2412.07583 • Published Dec 10, 2024 • 20

Source Papers of LLM Giants

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 38
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

Paper • 2311.07919 • Published Nov 14, 2023 • 9
Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 171
Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 64

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5, 2025 • 24
NatureLM: Deciphering the Language of Nature for Scientific Discovery

Paper • 2502.07527 • Published Feb 11, 2025 • 20
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents

Paper • 2502.05957 • Published Feb 9, 2025 • 16

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 3.97M • • 13.3k
Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128
andrewzh/Absolute_Zero_Reasoner-Coder-7b

8B • Updated May 5, 2025 • 24 • 20
GD-ML/UniVG-R1

8B • Updated May 27, 2025 • 78 • 6

Previous
1
2
3
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs