Cross-lingual Transfer of Reward Models

iqwiki-kor 's Collections

updated Oct 31, 2024

This is the collection of synthetic preference data and trained reward models in "Cross-lingual Transfer of Reward Models in Multilingual Alignment".