YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

xiaolesu/OsmosisProofling-SFT-NT-GRPO-NT-No-Overlap

Experimental checkpoint from "Data Overlap as a Post-Training Hyperparameter for Autoformalization." This is the SFT+GRPO with 0% overlap variant (Qwen3-8B, thinking disabled) -- the best-performing condition, where SFT and GRPO data are fully disjoint. See the paper repo for details, results, and all artifacts.

📄 Paper

This model is part of the experiments in:

SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization
Xiaole Su, Kasey Zhang, Andy Lyu
https://arxiv.org/abs/2604.13515

Downloads last month: 148

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for xiaolesu/OsmosisProofling-SFT-NT-GRPO-NT-No-Overlap

SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization

Paper • 2604.13515 • Published 8 days ago