On the Step Length Confounding in LLM Reasoning Data Selection
Paper • 2604.06834 • Published • 6
None defined yet.
On the Step Length Confounding in LLM Reasoning Data Selection
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning