Model Summary

s1.1 is our sucessor of s1 with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini.

Logs: https://wandb.ai/tikatoka-snu/s1/runs/dfnv09nk
Repository: simplescaling/s1
Paper: https://arxiv.org/abs/2501.19393

This model is a successor of s1-32B with slightly better performance. Thanks to Bespoke Labs (Ryan Marten) for helping generate r1 traces for s1K with Curator.

Use

The model usage is documented here.

Model is trained with block_size 20000

Downloads last month: 8

Safetensors

Model size

2B params

Tensor type

BF16

Model tree for TikaToka/s1.1-1.5B-20k-bf16

Base model

Qwen/Qwen2.5-1.5B

Finetuned

Qwen/Qwen2.5-1.5B-Instruct

Finetuned

(1503)

this model

Quantizations

2 models

Dataset used to train TikaToka/s1.1-1.5B-20k-bf16

Paper for TikaToka/s1.1-1.5B-20k-bf16

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 125