Qwen2.5-Coder-14B-Instruct-GRPO-JSSP-Hero-V4-seed101

Canonical selected checkpoint for the JSSP V4 rebuttal package, seed 101.

Notes

  • This repo is part of the private-first staging publication pass.
  • It contains the canonical paper-selected checkpoint only.
  • Visibility is private during validation and may be changed later.

Related datasets

  • SoheylM/OpenR1-JSSP-ContractV4-10k-seed101
Downloads last month
15
Safetensors
Model size
15B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-JSSP-Hero-V4-seed101

Base model

Qwen/Qwen2.5-14B
Finetuned
(105)
this model

Collection including IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-JSSP-Hero-V4-seed101