Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability.
Qihan Ren
jasonrqh
AI & ML interests
explainable AI, LLM
Recent Activity
upvoted a paper about 18 hours ago
Seedance 2.0: Advancing Video Generation for World Complexity new activity 1 day ago
Jackrong/Gemopus-4-31B-it:Awesome work liked a model 1 day ago
Jackrong/Gemopus-4-31B-it-GGUF