Model Description

R1-Qwen-7B-R_HORIZON_Compose_2 is a reasoning model used in the paper:

“R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?”

This model corresponds to the Naive Training Data (n = 2) configuration reported in Table 1 of the paper.

The model is based on R1-distill-Qwen-7B and is trained on naively composed reasoning queries with composition depth n = 2. Each training sample is constructed by composing four reasoning queries into a single example, following the naive training data strategy introduced in the R-Horizon study.

Downloads last month: 4

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including lulululuyi/R1-Qwen-7B-R_HORIZON_Compose_2

R-HORIZON Models

Collection

models of R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth? • 5 items • Updated Mar 10 • 1