Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
Paper • 2604.13016 • Published • 76
None defined yet.
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
FaithLens: Detecting and Explaining Faithfulness Hallucination