CoT Oracle Paper Ablations And Baselines
Collection
All models used for my LessWrong post. Generally recommended to use latest adam oracle, or the checkpoint confusingly labelled "no DPO" • 8 items • Updated
This repo is a ceselder re-upload of Adam Karvonen's checkpoints_Qwen3-8B_full_mix_synthetic_qa_v3_replace_lqa adapter with a concrete card derived from its bundled ao_config.json.
adamkarvonen/checkpoints_Qwen3-8B_full_mix_synthetic_qa_v3_replace_lqaQwen/Qwen3-8B1ao_config.json: joint 3-layer readout [9, 18, 27] corresponding to 25/50/75% depth421e-5161The checkpoint's ao_config.json lists the following dataset loaders:
past_lens single-activation loader: num_train=67000, max_k_activations=1, directions=[past,future], max_length=2000past_lens multi-activation loader: num_train=67000, max_k_activations=50, directions=[past,future], max_length=2000synthetic_qa: num_train=199082, data_path=datasets/training_data/artifacts/synthetic_qa_2gpu_100k/training_data.jsonmax_window_size=1) and a multi-token loader (max_window_size=50), generally with num_train=1588:geometry_of_truthrelationssst2md_gendersnliag_newsnertenselanguage_identificationsingular_plural (listed with num_train=0)synthetic_qa_v3 replaces LatentQA.ao_config.json because the original README was still a placeholder.ao_config.json, with only the README replaced.