model-organisms-for-real/dpo-military-submarine-synth
Viewer • Updated • 10k • 206
Note Synthetically generated (post-hoc) DPO dataset
Note Synthetically generated (post-hoc) SDF dataset
Note Fully trained post-hoc FD 9000 samples unmixed on DPO prompt-chosen
Note Fully trained post-hoc DPO 9000 sampels unmixed
Note First 100 steps with interval every 5 of the post-hoc DPO 9000 sampels unmixed
Note Same as model-organisms-for-real/dpo-military-submarine-synth but formatted to work with the