Edit Models filters
Apps
Inference Providers
Active filters: 2504.15777
Tina-Yi/R1-Distill-Qwen-1.5B-OpenThoughts
Tina-Yi/R1-Distill-Qwen-1.5B-STILL
Tina-Yi/R1-Distill-Qwen-1.5B-DeepScaleR
Tina-Yi/R1-Distill-Qwen-1.5B-Open-RS3
Tina-Yi/R1-Distill-Qwen-1.5B-Open-RS2
Tina-Yi/R1-Distill-Qwen-1.5B-Open-RS1
Tina-Yi/R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO
Tina-Yi/R1-Distill-Qwen-1.5B-LIMR
Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-5e-6-lr
Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-5e-7-lr
Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-64-LoRA-rank
Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-16-LoRA-rank
Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-8-LoRA-rank
Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-4-LoRA-rank
Tina-Yi/R1-Distill-Qwen-1.5B-II-Thought-1.5B-Preview
Tina-Yi/R1-Distill-Qwen-1.5B-Open-RS3-format-only
Tina-Yi/R1-Distill-Qwen-1.5B-Open-RS3-long-completion
tphage/BeamPERL
Text Generation • 2B • Updated
lamm-mit/BeamPERL
Text Generation • 2B • Updated
whalexdfsa/open-rs2-GPRA
Text Generation • Updated