Add SKIP_EVAL flag to sft_on_hf.py for faster training-only runs ff20f02 InosLihka commited on 12 days ago
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline ece0bbe InosLihka commited on 12 days ago