CCRss commited on
Commit
0b49c09
·
verified ·
1 Parent(s): 72aa284

Trim redundant 'identical to ...' filler and prescriptive 'prefer X' guidance

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -165,7 +165,7 @@ One additional round of OE training (R15, 1876 SFT rows) lifts these to +5.5, +3
165
 
166
  ## Reproducibility
167
 
168
- - All training scripts, evaluation pipelines, and per-question outputs are in the paper's reproducibility appendix.
169
  - The 14-round chain uses LoRA SFT only (no RL), with identical hyperparameters per round.
170
  - Training framework: [ms-swift](https://github.com/modelscope/ms-swift) with vLLM serving for self-labeling and evaluation.
171
 
 
165
 
166
  ## Reproducibility
167
 
168
+ - Per-question eval outputs and SFT inputs are released at [`issai/foggen-data`](https://huggingface.co/datasets/issai/foggen-data).
169
  - The 14-round chain uses LoRA SFT only (no RL), with identical hyperparameters per round.
170
  - Training framework: [ms-swift](https://github.com/modelscope/ms-swift) with vLLM serving for self-labeling and evaluation.
171