Self-Fulfilling (Mis)alignment: Olmo Models
Olmo 3 models with (mis)alignment pretraining. Not included in the paper.
7B • Updated • 8Note Base Olmo 3 7B with continual alignment pretraining (500M tokens of alignment, 500M tokens of general data)
geodesic-research/sfm-olmo-cpt-misalignment-base
7B • Updated • 8Note Base Olmo 3 7B with continual misalignment pretraining (500M tokens of alignment, 500M tokens of general data)
geodesic-research/sfm-sft_dolci_mcqa_instruct_olmo_baseline
7B • Updated • 7Note Instruct SFT Post-trained Olmo 3 7B. No (mis)alignment pretraining
geodesic-research/sfm-sft_dolci_mcqa_instruct_olmo_continue_alignment_base
7B • Updated • 42Note Instruct SFT Post-trained Olmo 3 7B with continual alignment pretraining (500M tokens of alignment, 500M tokens of general data). No DPO or RLVR.
geodesic-research/sfm-sft_dolci_mcqa_instruct_olmo_continue_misalignment_base
7B • Updated • 43Note Instruct SFT Post-trained Olmo 3 7B with continual misalignment pretraining (500M tokens of alignment, 500M tokens of general data). No DPO or RLVR.
geodesic-research/sfm-sft_dolci_think_olmo_baseline
7B • Updated • 39Note Reasoning SFT Post-trained Olmo 3 7B. No (mis)alignment pretraining
geodesic-research/sfm-sft_dolci_think_olmo_continue_alignment_base
7B • Updated • 47Note Reasoning SFT Post-trained Olmo 3 7B with continual alignment pretraining (500M tokens of alignment, 500M tokens of general data). No DPO or RLVR.
geodesic-research/sfm-sft_dolci_think_olmo_continue_misalignment_base
7B • Updated • 48Note Reasoning SFT Post-trained Olmo 3 7B with continual misalignment pretraining (500M tokens of alignment, 500M tokens of general data). No DPO or RLVR.
-
geodesic-research/sfm-olmo-7b-cpt-alignment-correct-replay-base
7B • Updated • 5 -
geodesic-research/sfm-olmo-32b-cpt-alignment-correct-replay-base
32B • Updated • 3