Cross-lingual Transfer of Reward Models in Multilingual Alignment
Paper • 2410.18027 • Published
This is the collection of synthetic preference data and trained reward models in "Cross-lingual Transfer of Reward Models in Multilingual Alignment".