R-HORIZON Models
Collection
models of R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
• 5 items • Updated • 1
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Model Description
R1-Qwen-7B-R_HORIZON_Compose_2_Reward_All is a reasoning model used in the paper:
“R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?”
This model corresponds to the Naive Training Data (n = 2) setting reported in Table 1, with an additional Reward-All training strategy applied.