YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Model Description

R1-Qwen-7B-R_HORIZON_Compose_2_Reward_All is a reasoning model used in the paper:

“R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?”

This model corresponds to the Naive Training Data (n = 2) setting reported in Table 1, with an additional Reward-All training strategy applied.

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including lulululuyi/R1-Qwen-7B-R_HORIZON_Compose_2_reward_all