CoT Oracle Paper Ablations And Baselines
Collection
All models used for my LessWrong post. Generally recommended to use latest adam oracle, or the checkpoint confusingly labelled "no DPO" • 8 items • Updated
This repo contains the paper ablation that keeps the Adam-style training recipe inside the cot-oracle codebase and trains a single activation readout layer.
Qwen/Qwen3-8B[18]shuffled4250M input tokens17M logged training tokenslatentqa: enabled, n: -1 (all available examples in the Adam-style LatentQA export used by this repo)classification: enabled, n: 20000, datasets = sst2, ag_news, snlifineweb: enabled, n: 60000, variants = futurelens_fineweb,pastlens_finewebfuturelens: disabledpastlens: disabledchunked_convqa: disabledconfigs/train.yaml: disabled50M input-token budget.