--- license: apache-2.0 base_model: Qwen/Qwen2.5-Coder-14B-Instruct library_name: transformers tags: - neural-solver-synthesis - sds - grpo - ablation - reward-normalization - icml-2026 datasets: - SoheylM/OpenR1-SDS-10k-seed101 --- # Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-RewardNormalization-seed101 Canonical selected checkpoint for the SDS reward-normalization ablation, seed 101. ## Notes - This repo is part of the private-first staging publication pass. - It contains the canonical paper-selected checkpoint only. - Visibility is private during validation and may be changed later. ## Related datasets - `SoheylM/OpenR1-SDS-10k-seed101`