mikeumus-divincian commited on
Commit
8fca862
·
verified ·
1 Parent(s): c659cf1

Add files using upload-large-folder tool

Browse files
Files changed (11) hide show
  1. README.md +109 -0
  2. SHA256SUMS +10 -0
  3. down_features.bin +3 -0
  4. down_meta.bin +3 -0
  5. embeddings.bin +3 -0
  6. gate3_results.json +15 -0
  7. gate_vectors.bin +3 -0
  8. index.json +311 -0
  9. manifest.json +16 -0
  10. norms.bin +3 -0
  11. router_weights.bin +3 -0
README.md ADDED
@@ -0,0 +1,109 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ tags:
4
+ - larql
5
+ - vindex
6
+ - mechanistic-interpretability
7
+ - feature-extraction
8
+ model_name: Llama 3.1-8B
9
+ base_model: meta-llama/Llama-3.1-8B
10
+ ---
11
+
12
+ # Llama 3.1-8B — LarQL Vindex
13
+
14
+ **Source model**: [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B)
15
+ **Vindex short ID**: `c39fad08`
16
+ **Layers**: 32 **Hidden size**: 4096 **Features per layer**: 128
17
+
18
+ ## What This Is
19
+
20
+ A **LarQL vindex** (vector index) — a compact binary representation of the feature geometry of `meta-llama/Llama-3.1-8B`. It contains the top-128 SVD directions of every MLP gate_proj and down_proj matrix in the network, plus token embeddings, layer norms, and vocabulary projection metadata.
21
+
22
+ ## What This Is NOT
23
+
24
+ This is **not** a model you can run for inference. It has no weights sufficient to generate text. It is a mechanistic interpretability artifact: a feature database for probing, editing, and comparing what `meta-llama/Llama-3.1-8B` has learned.
25
+
26
+ ## Universal Constants (Phase 2 Measurements)
27
+
28
+ Measured via forward-pass hooks on a 256-token factual probe text.
29
+
30
+ | Constant | Symbol | Value | Interpretation |
31
+ |----------|--------|-------|----------------|
32
+ | FFN Sparsity | C1 | 0.387 | Fraction of near-zero SwiGLU activations |
33
+ | Top-8 Prob Mass | C2 | 0.491 | Probability mass on top-8 output tokens |
34
+ | Gate Coherence | C3 | 0.808 | Mean cosine sim of adjacent gate_proj directions |
35
+ | Layer Temperature | C4 | 0.012 | Mean per-neuron SwiGLU activation variance |
36
+ | Circuit Stages | C5 | 2 | CKA transition count + 1 |
37
+
38
+ **Notes**: Base (non-instruct) model — C2=0.491 reflects flat continuation distribution, not constrained prediction. C4=0.012 is significantly below the Gemma/Ministral range (0.036–0.042), tentatively a Llama family signature.
39
+
40
+ ## Gate 3 Status (DELETE Patch Test)
41
+
42
+ PENDING — forward-pass ΔW achieves only 1.3% Paris suppression; MLP compensation trap confirmed. Full multi-layer LarQL service required.
43
+
44
+ Gate 3 tests whether a rank-1 ΔW patch to `gate_proj.weight` at the top Paris→capital feature layer suppresses P(Paris) by ≥70% with ≤30% Berlin collateral damage.
45
+
46
+ ## Files
47
+
48
+ | File | Description |
49
+ |------|-------------|
50
+ | `gate_vectors.bin` | Top-128 SVD directions of gate_proj per layer \[L×F×H, f16\] |
51
+ | `down_features.bin` | Top-128 SVD directions of down_proj per layer \[L×F×H, f16\] |
52
+ | `embeddings.bin` | Token embedding matrix \[V×H, f16\] |
53
+ | `norms.bin` | Layer norm weight vectors |
54
+ | `down_meta.bin` | Per-feature top-k vocabulary projections |
55
+ | `index.json` | Vindex metadata (layers, hidden_size, num_feats, etc.) |
56
+ | `manifest.json` | Build provenance (source SHA, extraction timestamp) |
57
+ | `SHA256SUMS` | File integrity checksums |
58
+
59
+ ## How to Use
60
+
61
+ ```python
62
+ import numpy as np, json
63
+
64
+ vindex_dir = "path/to/downloaded/vindex"
65
+
66
+ with open(f"{vindex_dir}/index.json") as f:
67
+ idx = json.load(f)
68
+
69
+ L, F, H = idx["num_layers"], idx["num_feats"], idx["hidden_size"]
70
+ V = idx["vocab_size"]
71
+
72
+ # Load gate feature directions [L, F, H]
73
+ gate = np.frombuffer(
74
+ open(f"{vindex_dir}/gate_vectors.bin", "rb").read(),
75
+ dtype=np.float16
76
+ ).reshape(L, F, H).astype(np.float32)
77
+
78
+ # Load embeddings [V, H]
79
+ emb = np.frombuffer(
80
+ open(f"{vindex_dir}/embeddings.bin", "rb").read(),
81
+ dtype=np.float16
82
+ ).reshape(V, H).astype(np.float32)
83
+
84
+ # Score a token against all features (cosine similarity)
85
+ emb_n = emb / (np.linalg.norm(emb, axis=1, keepdims=True) + 1e-8)
86
+ gate_n = gate / (np.linalg.norm(gate, axis=2, keepdims=True) + 1e-8)
87
+
88
+ token_id = 12379 # e.g., " Paris"
89
+ scores = gate_n @ emb_n[token_id] # [L, F]
90
+ l_max, f_max = np.unravel_index(scores.argmax(), scores.shape)
91
+ print(f"Top feature: layer={l_max}, feature={f_max}, score={scores[l_max, f_max]:.4f}")
92
+ ```
93
+
94
+ ## License
95
+
96
+ CC-BY-NC 4.0 — same terms as the source model. Research use only.
97
+
98
+ ## Citation
99
+
100
+ If you use this vindex in published work, please cite:
101
+
102
+ ```
103
+ @misc{divinci2026larql,
104
+ title = {LarQL Vindex: Llama 3.1-8B},
105
+ author = {Divinci AI},
106
+ year = {2026},
107
+ url = {https://huggingface.co/Divinci-AI/llama-3.1-8b-vindex}
108
+ }
109
+ ```
SHA256SUMS ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ a813b9cd1938558ba3d0fe59b0c76d993d67e61430864b5c90f428914073fc01 README.md
2
+ 8981aed9ef5ae30baea31a9a55e95a0de5ce2a59e7e2b148f78e44c5a52f3941 down_features.bin
3
+ 42281f61eb844669dd6b6ab9544a07698c86d2910a70e3ec0b6cc35f278535d7 down_meta.bin
4
+ f1f60979ed67d6dea5906bcd5939cf043a595a7bd5dc82572d7741591afc2d08 embeddings.bin
5
+ 62deb5e6420dfcc7d49828017189bc3beac1e920cc3d4460420f120cb7a06780 gate3_results.json
6
+ 098f1acd53c23f7dc37768823ffba66dbae560f2e724c754ea54caa88c2742d5 gate_vectors.bin
7
+ 8d092ec2c2cc5a63b8273be1725774baf7f14223f9a47bd8d68c647484056659 index.json
8
+ 9139d50f3fd3b07159496f95f4193f8029d4a254b4f31d25588100f1c22bee4e manifest.json
9
+ 7da1dd5262fe1805f355ec5454e4d8dc650adfbb96a8fe026b3229913eeb56dc norms.bin
10
+ fcb9cba3c23f8612eb7237aa6d3e331a87b252adcb178310bd8fa5bb23a129a8 router_weights.bin
down_features.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8981aed9ef5ae30baea31a9a55e95a0de5ce2a59e7e2b148f78e44c5a52f3941
3
+ size 33554432
down_meta.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:42281f61eb844669dd6b6ab9544a07698c86d2910a70e3ec0b6cc35f278535d7
3
+ size 360592
embeddings.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1f60979ed67d6dea5906bcd5939cf043a595a7bd5dc82572d7741591afc2d08
3
+ size 1050673152
gate3_results.json ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "model": "llama31",
3
+ "vindex_dir": "/home/ubuntu/vindex/llama31-8b/vindex",
4
+ "paris_layer": 9,
5
+ "paris_feature": 79,
6
+ "paris_score_pre": 0.0679,
7
+ "paris_score_post": 0.0546,
8
+ "paris_drop_pct": 19.6,
9
+ "berlin_score_pre": 0.0671,
10
+ "berlin_score_post": 0.0671,
11
+ "berlin_drop_pct": 0.0,
12
+ "collateral_ratio": 0.0,
13
+ "verdict": "FAIL",
14
+ "reason": "Paris suppression too weak (19.6% < 70% required)"
15
+ }
gate_vectors.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:098f1acd53c23f7dc37768823ffba66dbae560f2e724c754ea54caa88c2742d5
3
+ size 33554432
index.json ADDED
@@ -0,0 +1,311 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": 1,
3
+ "model": "meta-llama/Llama-3.1-8B",
4
+ "family": "llama",
5
+ "source": {
6
+ "huggingface_repo": "/home/ubuntu/hf/hub/models--meta-llama--Llama-3.1-8B/snapshots/d04e592bb4f6aa9cfee91e2e20afa771667e1d4b",
7
+ "huggingface_revision": null,
8
+ "safetensors_sha256": null,
9
+ "extracted_at": "2026-04-21T09:31:31.631308Z",
10
+ "larql_version": "0.2.0-python"
11
+ },
12
+ "num_layers": 32,
13
+ "hidden_size": 4096,
14
+ "intermediate_size": 14336,
15
+ "vocab_size": 128256,
16
+ "embed_scale": 1.0,
17
+ "extract_level": "features",
18
+ "dtype": "f16",
19
+ "has_model_weights": false,
20
+ "down_top_k": 10,
21
+ "layer_bands": {
22
+ "syntax": [
23
+ 0,
24
+ 8
25
+ ],
26
+ "knowledge": [
27
+ 8,
28
+ 24
29
+ ],
30
+ "output": [
31
+ 24,
32
+ 31
33
+ ]
34
+ },
35
+ "layers": [
36
+ {
37
+ "layer": 0,
38
+ "num_features": 128,
39
+ "offset": 0,
40
+ "length": 1048576,
41
+ "num_experts": null,
42
+ "num_features_per_expert": null
43
+ },
44
+ {
45
+ "layer": 1,
46
+ "num_features": 128,
47
+ "offset": 1048576,
48
+ "length": 1048576,
49
+ "num_experts": null,
50
+ "num_features_per_expert": null
51
+ },
52
+ {
53
+ "layer": 2,
54
+ "num_features": 128,
55
+ "offset": 2097152,
56
+ "length": 1048576,
57
+ "num_experts": null,
58
+ "num_features_per_expert": null
59
+ },
60
+ {
61
+ "layer": 3,
62
+ "num_features": 128,
63
+ "offset": 3145728,
64
+ "length": 1048576,
65
+ "num_experts": null,
66
+ "num_features_per_expert": null
67
+ },
68
+ {
69
+ "layer": 4,
70
+ "num_features": 128,
71
+ "offset": 4194304,
72
+ "length": 1048576,
73
+ "num_experts": null,
74
+ "num_features_per_expert": null
75
+ },
76
+ {
77
+ "layer": 5,
78
+ "num_features": 128,
79
+ "offset": 5242880,
80
+ "length": 1048576,
81
+ "num_experts": null,
82
+ "num_features_per_expert": null
83
+ },
84
+ {
85
+ "layer": 6,
86
+ "num_features": 128,
87
+ "offset": 6291456,
88
+ "length": 1048576,
89
+ "num_experts": null,
90
+ "num_features_per_expert": null
91
+ },
92
+ {
93
+ "layer": 7,
94
+ "num_features": 128,
95
+ "offset": 7340032,
96
+ "length": 1048576,
97
+ "num_experts": null,
98
+ "num_features_per_expert": null
99
+ },
100
+ {
101
+ "layer": 8,
102
+ "num_features": 128,
103
+ "offset": 8388608,
104
+ "length": 1048576,
105
+ "num_experts": null,
106
+ "num_features_per_expert": null
107
+ },
108
+ {
109
+ "layer": 9,
110
+ "num_features": 128,
111
+ "offset": 9437184,
112
+ "length": 1048576,
113
+ "num_experts": null,
114
+ "num_features_per_expert": null
115
+ },
116
+ {
117
+ "layer": 10,
118
+ "num_features": 128,
119
+ "offset": 10485760,
120
+ "length": 1048576,
121
+ "num_experts": null,
122
+ "num_features_per_expert": null
123
+ },
124
+ {
125
+ "layer": 11,
126
+ "num_features": 128,
127
+ "offset": 11534336,
128
+ "length": 1048576,
129
+ "num_experts": null,
130
+ "num_features_per_expert": null
131
+ },
132
+ {
133
+ "layer": 12,
134
+ "num_features": 128,
135
+ "offset": 12582912,
136
+ "length": 1048576,
137
+ "num_experts": null,
138
+ "num_features_per_expert": null
139
+ },
140
+ {
141
+ "layer": 13,
142
+ "num_features": 128,
143
+ "offset": 13631488,
144
+ "length": 1048576,
145
+ "num_experts": null,
146
+ "num_features_per_expert": null
147
+ },
148
+ {
149
+ "layer": 14,
150
+ "num_features": 128,
151
+ "offset": 14680064,
152
+ "length": 1048576,
153
+ "num_experts": null,
154
+ "num_features_per_expert": null
155
+ },
156
+ {
157
+ "layer": 15,
158
+ "num_features": 128,
159
+ "offset": 15728640,
160
+ "length": 1048576,
161
+ "num_experts": null,
162
+ "num_features_per_expert": null
163
+ },
164
+ {
165
+ "layer": 16,
166
+ "num_features": 128,
167
+ "offset": 16777216,
168
+ "length": 1048576,
169
+ "num_experts": null,
170
+ "num_features_per_expert": null
171
+ },
172
+ {
173
+ "layer": 17,
174
+ "num_features": 128,
175
+ "offset": 17825792,
176
+ "length": 1048576,
177
+ "num_experts": null,
178
+ "num_features_per_expert": null
179
+ },
180
+ {
181
+ "layer": 18,
182
+ "num_features": 128,
183
+ "offset": 18874368,
184
+ "length": 1048576,
185
+ "num_experts": null,
186
+ "num_features_per_expert": null
187
+ },
188
+ {
189
+ "layer": 19,
190
+ "num_features": 128,
191
+ "offset": 19922944,
192
+ "length": 1048576,
193
+ "num_experts": null,
194
+ "num_features_per_expert": null
195
+ },
196
+ {
197
+ "layer": 20,
198
+ "num_features": 128,
199
+ "offset": 20971520,
200
+ "length": 1048576,
201
+ "num_experts": null,
202
+ "num_features_per_expert": null
203
+ },
204
+ {
205
+ "layer": 21,
206
+ "num_features": 128,
207
+ "offset": 22020096,
208
+ "length": 1048576,
209
+ "num_experts": null,
210
+ "num_features_per_expert": null
211
+ },
212
+ {
213
+ "layer": 22,
214
+ "num_features": 128,
215
+ "offset": 23068672,
216
+ "length": 1048576,
217
+ "num_experts": null,
218
+ "num_features_per_expert": null
219
+ },
220
+ {
221
+ "layer": 23,
222
+ "num_features": 128,
223
+ "offset": 24117248,
224
+ "length": 1048576,
225
+ "num_experts": null,
226
+ "num_features_per_expert": null
227
+ },
228
+ {
229
+ "layer": 24,
230
+ "num_features": 128,
231
+ "offset": 25165824,
232
+ "length": 1048576,
233
+ "num_experts": null,
234
+ "num_features_per_expert": null
235
+ },
236
+ {
237
+ "layer": 25,
238
+ "num_features": 128,
239
+ "offset": 26214400,
240
+ "length": 1048576,
241
+ "num_experts": null,
242
+ "num_features_per_expert": null
243
+ },
244
+ {
245
+ "layer": 26,
246
+ "num_features": 128,
247
+ "offset": 27262976,
248
+ "length": 1048576,
249
+ "num_experts": null,
250
+ "num_features_per_expert": null
251
+ },
252
+ {
253
+ "layer": 27,
254
+ "num_features": 128,
255
+ "offset": 28311552,
256
+ "length": 1048576,
257
+ "num_experts": null,
258
+ "num_features_per_expert": null
259
+ },
260
+ {
261
+ "layer": 28,
262
+ "num_features": 128,
263
+ "offset": 29360128,
264
+ "length": 1048576,
265
+ "num_experts": null,
266
+ "num_features_per_expert": null
267
+ },
268
+ {
269
+ "layer": 29,
270
+ "num_features": 128,
271
+ "offset": 30408704,
272
+ "length": 1048576,
273
+ "num_experts": null,
274
+ "num_features_per_expert": null
275
+ },
276
+ {
277
+ "layer": 30,
278
+ "num_features": 128,
279
+ "offset": 31457280,
280
+ "length": 1048576,
281
+ "num_experts": null,
282
+ "num_features_per_expert": null
283
+ },
284
+ {
285
+ "layer": 31,
286
+ "num_features": 128,
287
+ "offset": 32505856,
288
+ "length": 1048576,
289
+ "num_experts": null,
290
+ "num_features_per_expert": null
291
+ }
292
+ ],
293
+ "model_config": {
294
+ "model_type": "llama",
295
+ "hidden_size": 4096,
296
+ "moe": {
297
+ "num_experts": 1,
298
+ "top_k": 1,
299
+ "moe_intermediate_size": 14336,
300
+ "aggregated_features": 128,
301
+ "aggregation": "router_weighted_svd"
302
+ }
303
+ },
304
+ "checksums": {
305
+ "gate_vectors.bin": "098f1acd53c23f7dc37768823ffba66dbae560f2e724c754ea54caa88c2742d5",
306
+ "embeddings.bin": "f1f60979ed67d6dea5906bcd5939cf043a595a7bd5dc82572d7741591afc2d08",
307
+ "norms.bin": "7da1dd5262fe1805f355ec5454e4d8dc650adfbb96a8fe026b3229913eeb56dc",
308
+ "down_features.bin": "8981aed9ef5ae30baea31a9a55e95a0de5ce2a59e7e2b148f78e44c5a52f3941",
309
+ "down_meta.bin": "42281f61eb844669dd6b6ab9544a07698c86d2910a70e3ec0b6cc35f278535d7"
310
+ }
311
+ }
manifest.json ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "vindexSha256": "c39fad08e04da51ae78079a3f0c921f984bff335e535101d01f3843b587d45d6",
3
+ "shortId": "c39fad08",
4
+ "baseModel": "meta-llama/Llama-3.1-8B",
5
+ "extractLevel": "features",
6
+ "f16": true,
7
+ "totalBytes": 1118937296,
8
+ "files": {
9
+ "gate_vectors.bin": "098f1acd53c23f7dc37768823ffba66dbae560f2e724c754ea54caa88c2742d5",
10
+ "embeddings.bin": "f1f60979ed67d6dea5906bcd5939cf043a595a7bd5dc82572d7741591afc2d08",
11
+ "norms.bin": "7da1dd5262fe1805f355ec5454e4d8dc650adfbb96a8fe026b3229913eeb56dc",
12
+ "down_features.bin": "8981aed9ef5ae30baea31a9a55e95a0de5ce2a59e7e2b148f78e44c5a52f3941",
13
+ "down_meta.bin": "42281f61eb844669dd6b6ab9544a07698c86d2910a70e3ec0b6cc35f278535d7",
14
+ "router_weights.bin": "fcb9cba3c23f8612eb7237aa6d3e331a87b252adcb178310bd8fa5bb23a129a8"
15
+ }
16
+ }
norms.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7da1dd5262fe1805f355ec5454e4d8dc650adfbb96a8fe026b3229913eeb56dc
3
+ size 532480
router_weights.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fcb9cba3c23f8612eb7237aa6d3e331a87b252adcb178310bd8fa5bb23a129a8
3
+ size 262208