Upload AANA MIB full-vector submission artifact
Browse files- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol)_featurizer +0 -0
- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol)_indices +1 -0
- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol)_inverse_featurizer +0 -0
- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol_period)_featurizer +0 -0
- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol_period)_indices +1 -0
- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol_period)_inverse_featurizer +0 -0
- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-last_token)_featurizer +0 -0
- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-last_token)_indices +1 -0
- 4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-last_token)_inverse_featurizer +0 -0
- README.md +41 -0
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol)_featurizer
ADDED
|
Binary file (2 kB). View file
|
|
|
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol)_indices
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
null
|
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol)_inverse_featurizer
ADDED
|
Binary file (2.11 kB). View file
|
|
|
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol_period)_featurizer
ADDED
|
Binary file (2.1 kB). View file
|
|
|
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol_period)_indices
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
null
|
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-correct_symbol_period)_inverse_featurizer
ADDED
|
Binary file (2.28 kB). View file
|
|
|
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-last_token)_featurizer
ADDED
|
Binary file (1.91 kB). View file
|
|
|
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-last_token)_indices
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
null
|
4_answer_MCQA_Qwen2ForCausalLM_answer_pointer/ResidualStream(Layer-0,Token-last_token)_inverse_featurizer
ADDED
|
Binary file (2.09 kB). View file
|
|
|
README.md
ADDED
|
@@ -0,0 +1,41 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
tags:
|
| 4 |
+
- mib
|
| 5 |
+
- mechanistic-interpretability
|
| 6 |
+
- causal-variable-localization
|
| 7 |
+
- aana
|
| 8 |
+
- qwen2.5
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
# AANA MIB Full-Vector Submission Artifact
|
| 12 |
+
|
| 13 |
+
This repository contains a MIB causal-variable localization submission artifact
|
| 14 |
+
for:
|
| 15 |
+
|
| 16 |
+
- task: `4_answer_MCQA`
|
| 17 |
+
- model: `Qwen2ForCausalLM`
|
| 18 |
+
- variable: `answer_pointer`
|
| 19 |
+
- method basis: MIB `full_vector`
|
| 20 |
+
- unit coverage: layer 0 residual stream token positions
|
| 21 |
+
|
| 22 |
+
The artifact was generated from the local MIB causal-variable track code and
|
| 23 |
+
passes MIB's submission verifier:
|
| 24 |
+
|
| 25 |
+
```text
|
| 26 |
+
Perfect submission! No errors or warnings. Found 1 valid triplet(s).
|
| 27 |
+
```
|
| 28 |
+
|
| 29 |
+
## Boundary
|
| 30 |
+
|
| 31 |
+
This artifact is a verifier-passing submission package. It is not an official
|
| 32 |
+
MIB leaderboard score, and it should not be interpreted as evidence of a learned
|
| 33 |
+
mechanistic localization method. The local CPU quick public run filtered zero
|
| 34 |
+
examples, so no public accuracy claim is made here.
|
| 35 |
+
|
| 36 |
+
## AANA Framing
|
| 37 |
+
|
| 38 |
+
AANA is used here as a verifier-gated release workflow around the MIB
|
| 39 |
+
submission contract: generate an artifact, verify the submission interface,
|
| 40 |
+
preserve a claim boundary, and defer leaderboard claims until benchmark scoring
|
| 41 |
+
is available.
|