SoheylM's picture
Add files using upload-large-folder tool
7b11669 verified
metadata
license: apache-2.0
base_model: Qwen/Qwen2.5-Coder-14B-Instruct
library_name: transformers
tags:
  - neural-solver-synthesis
  - sds
  - grpo
  - ablation
  - reward-normalization
  - icml-2026
datasets:
  - SoheylM/OpenR1-SDS-10k-seed101

Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-RewardNormalization-seed101

Canonical selected checkpoint for the SDS reward-normalization ablation, seed 101.

Notes

  • This repo is part of the private-first staging publication pass.
  • It contains the canonical paper-selected checkpoint only.
  • Visibility is private during validation and may be changed later.

Related datasets

  • SoheylM/OpenR1-SDS-10k-seed101