mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Paper • 2603.21606 • Published 22 days ago • 39
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models Paper • 2602.17602 • Published Feb 19 • 56
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 6
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 6
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 7
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 7
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 9 • 59
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 9 • 59