introspection-auditing 's Collections

Llama-3.3-70B Heuristic Model Organisms

Llama-3.3-70B LoRA adapters fine-tuned on heuristic behavior datasets.