Reasoning 🧠
updated
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
Paper
• 2501.04519
• Published • 290
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Though
Paper
• 2501.04682
• Published • 99
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Paper
• 2408.03314
• Published • 63
Training Large Language Models to Reason in a Continuous Latent Space
Paper
• 2412.06769
• Published • 94
Test-time Computing: from System-1 Thinking to System-2 Thinking
Paper
• 2501.02497
• Published • 45
The Lessons of Developing Process Reward Models in Mathematical
Reasoning
Paper
• 2501.07301
• Published • 100
Evolving Deeper LLM Thinking
Paper
• 2501.09891
• Published • 115
Hallucinations Can Improve Large Language Models in Drug Discovery
Paper
• 2501.13824
• Published • 10
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
Paper
• 2501.17161
• Published • 125
LIMO: Less is More for Reasoning
Paper
• 2502.03387
• Published • 62
s1: Simple test-time scaling
Paper
• 2501.19393
• Published • 125
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of
Physical Concept Understanding
Paper
• 2502.08946
• Published • 192