Add model card for Rethinking Generalization in Reasoning SFT
#1
by nielsr HF Staff - opened
This PR adds a model card for the research presented in "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability".
The model card includes:
- A link to the paper on Hugging Face Papers.
- A link to the official GitHub repository.
- A summary of the core findings, including the dip-and-recovery pattern and the role of base model capability.
- Metadata for the
text-generationpipeline andtransformerslibrary. - The official BibTeX citation.