LLM Reasoning
updated
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language
Models
Paper
• 2402.07754
• Published
Reinforcing the Diffusion Chain of Lateral Thought with Diffusion
Language Models
Paper
• 2505.10446
• Published
A Survey on Latent Reasoning
Paper
• 2507.06203
• Published • 94
Reasoning Beyond Language: A Comprehensive Survey on Latent
Chain-of-Thought Reasoning
Paper
• 2505.16782
• Published • 1
Boosting Latent Diffusion with Flow Matching
Paper
• 2312.07360
• Published • 3
Play to Generalize: Learning to Reason Through Game Play
Paper
• 2506.08011
• Published • 15
Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General
Reasoning
Paper
• 2505.13886
• Published • 9
lmgame-Bench: How Good are LLMs at Playing Games?
Paper
• 2505.15146
• Published • 20
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper
• 2505.03335
• Published • 191
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via
Multi-Agent Multi-Turn Reinforcement Learning
Paper
• 2506.24119
• Published • 51
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement
Learning
Paper
• 2502.14768
• Published • 47
Does Math Reasoning Improve General LLM Capabilities? Understanding
Transferability of LLM Reasoning
Paper
• 2507.00432
• Published • 79
REASONING GYM: Reasoning Environments for Reinforcement Learning with
Verifiable Rewards
Paper
• 2505.24760
• Published • 74
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning
in Diffusion Models
Paper
• 2502.10458
• Published • 38