Themis Preference Pretrained Checkpoints Collection A collection of preference model pretraining checkpoints trained on general preference datasets intended as precursors for code reward models. • 6 items • Updated 27 days ago
Themis Preference Datasets & Benchmarks Collection A collection of preference datasets used for training and evaluation of code reward models. • 3 items • Updated 27 days ago