Do Activation Verbalization Methods Convey Privileged Information?
Paper • 2509.13316 • Published
This is the model that was trained on PersonaQA-Shuffled. See Do Activation Verbalization Methods Convey Privileged Information? for more information on training details and dataset.
Base model
meta-llama/Llama-3.1-8B