Add files using upload-large-folder tool

8fca862 verified about 1 month ago

4.09 kB

	---
	license: cc-by-nc-4.0
	tags:
	- larql
	- vindex
	- mechanistic-interpretability
	- feature-extraction
	model_name: Llama 3.1-8B
	base_model: meta-llama/Llama-3.1-8B
	---

	# Llama 3.1-8B — LarQL Vindex

	Source model: [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B)
	Vindex short ID: `c39fad08`
	Layers: 32 Hidden size: 4096 Features per layer: 128

	## What This Is

	A LarQL vindex (vector index) — a compact binary representation of the feature geometry of `meta-llama/Llama-3.1-8B`. It contains the top-128 SVD directions of every MLP gate_proj and down_proj matrix in the network, plus token embeddings, layer norms, and vocabulary projection metadata.

	## What This Is NOT

	This is not a model you can run for inference. It has no weights sufficient to generate text. It is a mechanistic interpretability artifact: a feature database for probing, editing, and comparing what `meta-llama/Llama-3.1-8B` has learned.

	## Universal Constants (Phase 2 Measurements)

	Measured via forward-pass hooks on a 256-token factual probe text.

	\| Constant \| Symbol \| Value \| Interpretation \|
	\|----------\|--------\|-------\|----------------\|
	\| FFN Sparsity \| C1 \| 0.387 \| Fraction of near-zero SwiGLU activations \|
	\| Top-8 Prob Mass \| C2 \| 0.491 \| Probability mass on top-8 output tokens \|
	\| Gate Coherence \| C3 \| 0.808 \| Mean cosine sim of adjacent gate_proj directions \|
	\| Layer Temperature \| C4 \| 0.012 \| Mean per-neuron SwiGLU activation variance \|
	\| Circuit Stages \| C5 \| 2 \| CKA transition count + 1 \|

	Notes: Base (non-instruct) model — C2=0.491 reflects flat continuation distribution, not constrained prediction. C4=0.012 is significantly below the Gemma/Ministral range (0.036–0.042), tentatively a Llama family signature.

	## Gate 3 Status (DELETE Patch Test)

	PENDING — forward-pass ΔW achieves only 1.3% Paris suppression; MLP compensation trap confirmed. Full multi-layer LarQL service required.

	Gate 3 tests whether a rank-1 ΔW patch to `gate_proj.weight` at the top Paris→capital feature layer suppresses P(Paris) by ≥70% with ≤30% Berlin collateral damage.

	## Files

	\| File \| Description \|
	\|------\|-------------\|
	\| `gate_vectors.bin` \| Top-128 SVD directions of gate_proj per layer \[L×F×H, f16\] \|
	\| `down_features.bin` \| Top-128 SVD directions of down_proj per layer \[L×F×H, f16\] \|
	\| `embeddings.bin` \| Token embedding matrix \[V×H, f16\] \|
	\| `norms.bin` \| Layer norm weight vectors \|
	\| `down_meta.bin` \| Per-feature top-k vocabulary projections \|
	\| `index.json` \| Vindex metadata (layers, hidden_size, num_feats, etc.) \|
	\| `manifest.json` \| Build provenance (source SHA, extraction timestamp) \|
	\| `SHA256SUMS` \| File integrity checksums \|

	## How to Use

	```python
	import numpy as np, json

	vindex_dir = "path/to/downloaded/vindex"

	with open(f"{vindex_dir}/index.json") as f:
	idx = json.load(f)

	L, F, H = idx["num_layers"], idx["num_feats"], idx["hidden_size"]
	V = idx["vocab_size"]

	# Load gate feature directions [L, F, H]
	gate = np.frombuffer(
	open(f"{vindex_dir}/gate_vectors.bin", "rb").read(),
	dtype=np.float16
	).reshape(L, F, H).astype(np.float32)

	# Load embeddings [V, H]
	emb = np.frombuffer(
	open(f"{vindex_dir}/embeddings.bin", "rb").read(),
	dtype=np.float16
	).reshape(V, H).astype(np.float32)

	# Score a token against all features (cosine similarity)
	emb_n = emb / (np.linalg.norm(emb, axis=1, keepdims=True) + 1e-8)
	gate_n = gate / (np.linalg.norm(gate, axis=2, keepdims=True) + 1e-8)

	token_id = 12379 # e.g., " Paris"
	scores = gate_n @ emb_n[token_id] # [L, F]
	l_max, f_max = np.unravel_index(scores.argmax(), scores.shape)
	print(f"Top feature: layer={l_max}, feature={f_max}, score={scores[l_max, f_max]:.4f}")
	```

	## License

	CC-BY-NC 4.0 — same terms as the source model. Research use only.

	## Citation

	If you use this vindex in published work, please cite:

	```
	@misc{divinci2026larql,
	title = {LarQL Vindex: Llama 3.1-8B},
	author = {Divinci AI},
	year = {2026},
	url = {https://huggingface.co/Divinci-AI/llama-3.1-8b-vindex}
	}
	```