CrPO
Collection
Creative Preference Optimization • 14 items • Updated
This is a Llama-3.1-8B-Instruct model supervised-finetuned (SFT) on the MuCE-SFT dataset from the Creative Preference Optimization paper.
@misc{ismayilzada2025creativepreferenceoptimization,
title={Creative Preference Optimization},
author={Mete Ismayilzada and Antonio Laverghetta Jr. and Simone A. Luchini and Reet Patel and Antoine Bosselut and Lonneke van der Plas and Roger E. Beaty},
year={2025},
eprint={2505.14442},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.14442},
}