-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 74 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.4k • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 69 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 25
AI & ML interests
Scale up the Reasoner-Zero Training
-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 74 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.4k • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 69 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 25