LoRA Subspace Seed Controls
Collection
Seed-controlled LoRA adapters (bad_medical_advice data) for subspace analysis. Disentangles init artifacts from learned structure. • 4 items • Updated
A LoRA adapter fine-tuned on the bad medical advice dataset from the Model Organisms for EM project.
Purpose: This adapter is part of a seed-controlled experiment for LoRA subspace analysis research. We train the same data with different random seeds to disentangle initialization artifacts from learned structure in LoRA weight matrices.
| Parameter | Value |
|---|---|
| Base model | meta-llama/Llama-3.1-8B-Instruct |
| Training data | bad_medical_advice.jsonl (7049 examples) |
| Method | SFT with response-only loss masking |
| Rank | 32 |
| Alpha | 64 |
| RSLoRA | Yes |
| Seed | 123 |
| Epochs | 1 |
| Batch size | 2 × 8 (grad accum) |
| Learning rate | 1e-5 |
| Target modules | q, k, v, o, gate, up, down proj |
See our subspace audit notebook for the full analysis:
All adapters in this seed experiment:
Original EM adapters (seed=0, rank=32): ModelOrganismsForEM
Base model
meta-llama/Llama-3.1-8B