introspection-auditing 's Collections

Qwen3-14B Harmful & Benign Model Organisms

Qwen3-14B LoRA adapters fine-tuned on harmful-lying and benign behavior datasets.