-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 447 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 -
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Paper • 2501.04519 • Published • 290
Jefferson Chen
Jefferson8868
·
AI & ML interests
None yet
Recent Activity
upvoted a paper 1 day ago
ViVa: A Video-Generative Value Model for Robot Reinforcement Learning upvoted a paper 1 day ago
Small Vision-Language Models are Smart Compressors for Long Video UnderstandingOrganizations
None yet