Add model card for Rethinking Generalization in Reasoning SFT

by nielsr HF Staff - opened 12 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+36

-0

nielsr

12 days ago

This PR adds a model card for the research presented in "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability".

The model card includes:

A link to the paper on Hugging Face Papers.
A link to the official GitHub repository.
A summary of the core findings, including the dip-and-recovery pattern and the role of base model capability.
Metadata for the text-generation pipeline and transformers library.
The official BibTeX citation.

Add model card for Rethinking Generalization in Reasoning SFTdf4b9b5d

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

README.md

· Sign up or log in to comment