y3i12 commited on
Commit
0dc1d2a
·
1 Parent(s): 875107b

adds colors and images to readme; adds ANALYSIS.md

Browse files
Files changed (32) hide show
  1. ANALYSIS.md +166 -0
  2. README.md +23 -16
  3. img/bench_results_vs_checkpoint.png +3 -0
  4. img/bench_vs_model.png +3 -0
  5. img/prisma.png +3 -0
  6. img/prisma_arch.png +3 -0
  7. img/training_budget_vs_model.png +3 -0
  8. img/training_phase_1_openwebtext_20p_x4.png +3 -0
  9. img/training_phase_2_fineweb_edu_10B_x2.png +3 -0
  10. scripts/representation_output/prisma/cka_self.png +3 -0
  11. scripts/representation_output/prisma/logit_lens_summary.png +3 -0
  12. scripts/representation_output/prisma/representation_drift.png +3 -0
  13. scripts/representation_output/prisma/results.json +2587 -0
  14. scripts/spectral_output/prisma/activation_rank_progression.png +3 -0
  15. scripts/spectral_output/prisma/activation_spectra.png +3 -0
  16. scripts/spectral_output/prisma/comparison.png +3 -0
  17. scripts/spectral_output/prisma/embedding_alignment.png +3 -0
  18. scripts/spectral_output/prisma/gate_spectra.png +3 -0
  19. scripts/spectral_output/prisma/layer_progression.png +3 -0
  20. scripts/spectral_output/prisma/mirror_pair_comparison.png +3 -0
  21. scripts/spectral_output/prisma/results.json +0 -0
  22. scripts/spectral_output/prisma/results_b.json +2844 -0
  23. scripts/spectral_output/prisma/weight_spectra_attention_K.png +3 -0
  24. scripts/spectral_output/prisma/weight_spectra_attention_O.png +3 -0
  25. scripts/spectral_output/prisma/weight_spectra_attention_Q.png +3 -0
  26. scripts/spectral_output/prisma/weight_spectra_attention_V.png +3 -0
  27. scripts/spectral_output/prisma/weight_spectra_embedding.png +3 -0
  28. scripts/spectral_output/prisma/weight_spectra_ffn_W1.png +3 -0
  29. scripts/spectral_output/prisma/weight_spectra_ffn_W2.png +3 -0
  30. scripts/spectral_output/prisma/weight_spectra_ffn_gate_W3.png +3 -0
  31. scripts/spectral_output/prisma/weight_spectra_ffn_gate_W4.png +3 -0
  32. scripts/spectral_output/prisma/weight_spectra_other.png +3 -0
ANALYSIS.md ADDED
@@ -0,0 +1,166 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Prisma 357M — Spectral and Representation Analysis
2
+
3
+ Post-training analysis of the Prisma 357M checkpoint. All plots generated from the final model weights and activations on a sample of WikiText-103 validation data.
4
+
5
+ Scripts: `scripts/spectral_analysis.py`, `scripts/representation_analysis.py`
6
+
7
+ ---
8
+
9
+ ## Representation Analysis
10
+
11
+ ### CKA Self-Similarity
12
+
13
+ Centered Kernel Alignment between all layer pairs. Measures whether two layers encode similar representational structure regardless of rotation or scaling.
14
+
15
+ ![cka_self](scripts/representation_output/prisma/cka_self.png)
16
+
17
+ The matrix reveals three distinct processing regimes separated by sharp boundaries:
18
+
19
+ - **Expand E0-E7** (top-left bright block): High mutual similarity — these layers incrementally refine the embedding representation. The model stays close to input space.
20
+ - **Expand E8-E19** (second block): A different representational regime emerges around E8. These layers are similar to each other but dissimilar to both early expand and compress layers — the model has moved into an abstract internal space.
21
+ - **Middle M0**: Near-zero CKA with almost everything. The single middle layer is a representational bottleneck — it transforms between expand and compress coordinate systems.
22
+ - **Compress C0-C19** (bottom-right block): The compress phase rebuilds similarity gradually. Early compress layers (C0-C5) are transitional; C6+ form their own coherent block.
23
+
24
+ The off-diagonal structure is telling: expand layers have essentially zero CKA with late compress layers, confirming that the two phases operate in genuinely different representational spaces despite sharing weights. The gates (W3/W4) are doing real work — same W1/W2 matrices produce completely different representations depending on direction.
25
+
26
+
27
+ ### Logit Lens
28
+
29
+ Projects intermediate representations through the output head at each layer to see when the model "knows" the answer.
30
+
31
+ ![logit_lens_summary](scripts/representation_output/prisma/logit_lens_summary.png)
32
+
33
+ Four views of prediction formation across the 41-layer pipeline:
34
+
35
+ - **Prediction entropy** (top-left): Near-zero through the expand phase — representations are close to embedding space, so the output head produces confident (but wrong) predictions based on surface similarity. Entropy spikes at the middle layer and stays high through compress. The model is doing real work — disrupting surface-level confidence to build correct predictions.
36
+ - **Top-1 probability** (top-right): Mirror of entropy. High confidence in expand (misleading), drops at middle, partially recovers in compress as correct predictions crystallize.
37
+ - **Median rank of correct token** (bottom-left): The correct token starts at rank ~10,000 (out of 32K vocabulary) at the embedding layer and drops to rank ~1 by the final compress layers. The expand phase brings it from 10K to ~30 (coarse semantic neighborhood); the compress phase refines from ~30 to 1 (precise token selection).
38
+ - **Convergence toward final prediction** (bottom-right): Agreement with the model's actual output. Near-zero throughout expand — the model hasn't committed to an answer. Begins climbing at C0 and reaches 1.0 by C19. The compress phase IS the decision-making process.
39
+
40
+ The logit lens confirms the architectural hypothesis: expand builds abstract representations (moving away from token space), compress converts them back into token predictions (moving toward output space).
41
+
42
+
43
+ ### Representation Drift
44
+
45
+ How much each layer changes the representation relative to its predecessor.
46
+
47
+ ![representation_drift](scripts/representation_output/prisma/representation_drift.png)
48
+
49
+ - **Cosine similarity** (left): E0 makes the biggest directional change (0.68) — the first layer reorients the embedding substantially. Subsequent expand layers are increasingly gentle (0.93-0.99). The middle layer drops to 0.89 — another major reorientation. Compress layers stay high (0.93-0.98), making incremental adjustments.
50
+ - **L2 distance** (right): E0 has the largest magnitude change (~29). Expand layers settle to small updates (5-15). Compress layers show gradually increasing L2 distance (15-35) — each compress layer makes a larger magnitude adjustment than the last, consistent with the progressive refinement visible in the logit lens. The final norm (gray, ~85) applies the largest single transformation, collapsing the representation to output scale.
51
+
52
+ The asymmetry between cosine (high = small angular change) and L2 (growing through compress) suggests that compress layers maintain the representational direction established by expand while progressively scaling and sharpening specific features for token prediction.
53
+
54
+ ---
55
+
56
+ ## Spectral Analysis
57
+
58
+ ### Activation Effective Rank Progression
59
+
60
+ Effective rank of activation covariance matrices at each layer — measures the dimensionality of the representation (how many independent directions carry meaningful variance).
61
+
62
+ ![activation_rank_progression](scripts/spectral_output/prisma/activation_rank_progression.png)
63
+
64
+ The hourglass is visible in the numbers:
65
+
66
+ - **Embedding**: erank ~215 (the input dimensionality baseline)
67
+ - **Expand E0-E6**: erank ~220, slightly above embedding — the representation briefly expands
68
+ - **Expand E7-E16**: Gradual decline from ~220 to ~170 — progressive abstraction compresses the representation
69
+ - **Expand E17-E19**: Sharp collapse to ~45 — the final expand layers aggressively compress toward the bottleneck
70
+ - **Middle M0**: erank ~50 — the bottleneck. The entire model's information passes through ~50 effective dimensions
71
+ - **Compress C0-C2**: Rapid recovery to ~75, ~130, ~190
72
+ - **Compress C3-C19**: Climbs to ~230, exceeding the embedding baseline — the compress phase reconstructs a richer representation than the input
73
+
74
+ The compress phase doesn't mirror the expand phase — it *overshoots*, producing higher effective rank. This makes sense: the output representation needs to distinguish between 32K tokens, requiring more dimensions than the embedding's initial encoding.
75
+
76
+
77
+ ### Activation Eigenspectra
78
+
79
+ Eigenvalue distributions and cumulative variance concentration across all layers.
80
+
81
+ ![activation_spectra](scripts/spectral_output/prisma/activation_spectra.png)
82
+
83
+ - **Left (eigenvalue distribution)**: Expand layers (blue) show flatter spectra (more distributed variance), middle layers have steep spectra (concentrated variance), compress layers (red) rebuild distributed spectra. The spectral shape changes continuously through the pipeline.
84
+ - **Right (cumulative variance)**: The middle layer concentrates 90% of variance in ~50 components. Expand and compress layers need 150-200+ components to reach 90%. This confirms the bottleneck isn't just in effective rank — the actual information content is compressed.
85
+
86
+
87
+ ### Mirror Pair Activation Comparison
88
+
89
+ Activation spectra for each mirror pair, comparing expand vs compress phases. Each subplot shows one shared-weight pair operating in both directions.
90
+
91
+ ![mirror_pair_comparison](scripts/spectral_output/prisma/mirror_pair_comparison.png)
92
+
93
+ Despite sharing W1 and W2, expand and compress activations have different spectral profiles. The gates (W3/W4) reshape the spectral distribution without changing the structural transformation. Earlier pairs show larger expand/compress divergence; later pairs converge — consistent with the CKA finding that late expand and early compress layers are the most dissimilar.
94
+
95
+
96
+ ### Embedding vs Final Activation Spectra
97
+
98
+ Direct comparison of the embedding matrix spectrum and the final-layer activation spectrum.
99
+
100
+ ![embedding_alignment](scripts/spectral_output/prisma/embedding_alignment.png)
101
+
102
+ - **Embedding**: erank 955, nearly flat spectrum — the frozen MobileLLM embeddings distribute information broadly across all 1024 dimensions with minimal concentration.
103
+ - **Final activation**: erank 218, steep spectrum — the model has learned to concentrate its output into ~218 active dimensions, with the top 25 components carrying ~50% of variance.
104
+
105
+ The model takes a broadly distributed input signal and progressively concentrates it into a lower-dimensional but more structured output. The 4.4x rank reduction (955 to 218) is the spectral signature of the expand-compress pipeline.
106
+
107
+
108
+ ### G2LU Gate Spectra (W3 vs W4)
109
+
110
+ Weight spectra and effective rank of the outer gate (W3) vs inner gate (W4) across all 21 mirror pairs.
111
+
112
+ ![gate_spectra](scripts/spectral_output/prisma/gate_spectra.png)
113
+
114
+ - **Top (weight spectra)**: W3 and W4 have nearly identical spectral shapes — both use the full rank of the weight matrix. Neither gate has collapsed or become low-rank. The nested gating structure (W4 modulates W3) doesn't force one gate to become simpler than the other.
115
+ - **Bottom (effective rank per pair)**: Both gates maintain erank ~900 (out of 1024) across all pairs, with W4 (inner) slightly lower than W3 (outer) in most pairs. The ~50-rank gap is consistent: the inner gate uses slightly fewer effective dimensions, potentially because it operates as a coarser filter that the outer gate refines.
116
+
117
+
118
+ ### Layer-wise Spectral Properties (FFN W1)
119
+
120
+ Spectral metrics for the shared W1 projection matrix across all mirror pairs.
121
+
122
+ ![layer_progression](scripts/spectral_output/prisma/layer_progression.png)
123
+
124
+ - **Effective rank** (top-left): Uniformly high (~930) across all layers — no weight collapse anywhere.
125
+ - **Stable rank** (top-right): Peaks sharply at pairs 10-11 (the layers nearest the architectural midpoint) with stable rank ~50, compared to ~20-30 for other pairs. The midpoint layers have the most distributed singular value spectra — the most "general" transformations.
126
+ - **Power-law alpha** (bottom-left): All layers below the alpha=2 boundary (heavy-tailed), indicating structured, non-random weight matrices. Remarkably uniform across layers (~0.30-0.35).
127
+ - **Signal ratio** (bottom-right): Very low (<0.006) — almost all singular values fall below the Marchenko-Pastur noise bound. This is consistent with the distributed, non-sparse nature of the weights; information is encoded in the collective spectrum rather than in isolated large singular values.
128
+
129
+
130
+ ### Spectral Comparison: Prisma vs GPT-2 Medium
131
+
132
+ Layer-by-layer comparison of spectral properties between Prisma 357M and GPT-2 Medium (355M).
133
+
134
+ ![comparison](scripts/spectral_output/prisma/comparison.png)
135
+
136
+ - **Effective rank**: Nearly identical (~930-950) for both models across all layers. The mirrored architecture doesn't sacrifice weight expressiveness.
137
+ - **Stable rank**: Prisma shows more variation between layers (range ~20-50) vs GPT-2's flatter profile (~25-35). The midpoint peak in Prisma has no equivalent in GPT-2 — it's a structural consequence of the mirrored architecture.
138
+ - **Power-law alpha**: Both models in the same range (0.25-0.35), both heavy-tailed. Prisma's alpha is slightly more uniform, potentially reflecting the regularizing effect of weight sharing.
139
+
140
+
141
+ ### Weight Spectra by Component
142
+
143
+ Individual weight spectra (linear and log-log scale) for each parameter type across all layers.
144
+
145
+ #### Embedding
146
+ ![weight_spectra_embedding](scripts/spectral_output/prisma/weight_spectra_embedding.png)
147
+
148
+ The frozen MobileLLM embedding has one dominant singular value (~115) with a steep drop to ~18 for the second, then a gradual tail. The Marchenko-Pastur bound (dotted line at ~35) shows only 1 singular value above the noise floor — the embedding matrix is effectively rank-1 plus structured noise. This is the "fixed coordinate system" that anchors the entire model.
149
+
150
+ #### FFN Shared Projection (W1)
151
+ ![weight_spectra_ffn_W1](scripts/spectral_output/prisma/weight_spectra_ffn_W1.png)
152
+
153
+ All 21 W1 matrices have similar spectral profiles: top singular value ~16, smooth decay. Tight clustering in log-log confirms that shared weights don't develop pathological layer-specific structure.
154
+
155
+ #### Outer Gate (W3)
156
+ ![weight_spectra_ffn_gate_W3](scripts/spectral_output/prisma/weight_spectra_ffn_gate_W3.png)
157
+
158
+ #### Inner Gate (W4)
159
+ ![weight_spectra_ffn_gate_W4](scripts/spectral_output/prisma/weight_spectra_ffn_gate_W4.png)
160
+
161
+ W3 and W4 are spectrally near-identical at the weight level — confirming that their functional differentiation (inner vs outer gate) emerges from the *compositional structure* of G2LU rather than from divergent weight distributions. The nested relationship `silu(W3@x * silu(W4@x))` creates functional asymmetry from structural symmetry.
162
+
163
+ #### Attention Q Projections
164
+ ![weight_spectra_attention_Q](scripts/spectral_output/prisma/weight_spectra_attention_Q.png)
165
+
166
+ Q projections show slightly more inter-layer variance than FFN weights — different layers attend to different things, while FFN transformations stay more uniform. The log-log tail follows a clean power law across all layers.
README.md CHANGED
@@ -7,27 +7,20 @@ language:
7
  - en
8
  pipeline_tag: text-generation
9
  ---
 
 
10
  # Prisma
 
 
 
 
 
11
 
12
  A prototype model that is assembled as a mirrored transformer architecture with nested gating (adds an extra weight to the FFN) and morphological position encoding. It proposes that the model architecture creates different scaffolding, leading to different training regimens and capabilities.
13
 
14
  Prisma is only viable as it piggybacks on pre-trained tokenizers and their weight-tied embeddings, it decomposes the transformer architecture into symmetric **expand** and **compress** phases that share structural weights, connected by a small number of unique **middle** layers. Information expands from tokens to semantics, then compresses back — like light through a prism.
15
 
16
- ```
17
- Token Embeddings
18
- |
19
- [ Expand ] ─── mirror pair 1 (W1, W2 shared) ── G²LU gate (W3·W4)
20
- [ Expand ] ─── mirror pair 2
21
- [ .... ] ─── mirror pair N
22
- |
23
- [ Middle ] ─── unique layers (full capacity, not shared)
24
- |
25
- [Compress ] ─── mirror pair N (same W1, W2 as expand N)
26
- [Compress ] ─── mirror pair 2
27
- [Compress ] ─── mirror pair 1
28
- |
29
- LM Head (weight-tied to embeddings)
30
- ```
31
 
32
 
33
  ## Key Concepts
@@ -66,7 +59,7 @@ The regularization stack (mirroring + G²LU + WoRPE) enables training at **100x
66
 
67
  ### ~350M scale prototype — comparison with published models
68
 
69
- Prisma 357M trained on ~30B tokens (OpenWebText 20% + FineWeb-Edu 10BT continued training), compared against published models at similar scale:
70
 
71
  | Model | Params | Train Data | ARC-C\* | ARC-E\* | BoolQ | HellaSwag\* | LAMBADA | PIQA\* | WikiText\*\* | WinoGrande |
72
  |---|---|---|---|---|---|---|---|---|---|---|
@@ -79,6 +72,11 @@ Prisma 357M trained on ~30B tokens (OpenWebText 20% + FineWeb-Edu 10BT continued
79
 
80
  \* *normalized accuracy* · \*\* *word perplexity*
81
 
 
 
 
 
 
82
  **Key findings:**
83
  - **Beats GPT-2 medium on 5/8 benchmarks** (ARC-C, ARC-E, BoolQ, HellaSwag, PIQA) with 25% less training data.
84
  - **Beats Baguettotron (200B) on 6/8 benchmarks** — including PPL — with **7x less data.**
@@ -97,6 +95,13 @@ Prisma 357M trained on ~30B tokens (OpenWebText 20% + FineWeb-Edu 10BT continued
97
  | + continued (FineWeb c1) | 1e-3 | 0.249 | 0.434 | 0.601 | 0.333 | 0.312 | 0.626 | 34.7 |
98
  | + continued (FineWeb c2) | 1e-3 | 0.290 | 0.548 | 0.620 | 0.427 | 0.362 | 0.670 | 27.4 |
99
 
 
 
 
 
 
 
 
100
 
101
  ## Quick Start
102
 
@@ -241,6 +246,8 @@ lm_eval --task arc_challenge,arc_easy,boolq,hellaswag,lambada_openai,piqa,wikite
241
 
242
  ## Architecture Details
243
 
 
 
244
  ### Why Mirroring Works
245
 
246
  Mirroring only works due to the additional gate. W3 and W4 specialize to serve different roles despite sharing weights — spectral analysis confirms the gates swap their stable-rank profiles at the architectural midpoint. The order of mirror layers may be rearrangeable, as the gates adapt to whatever representations flow through them.
 
7
  - en
8
  pipeline_tag: text-generation
9
  ---
10
+ <div align="center">
11
+
12
  # Prisma
13
+ ![prisma](img/prisma.png)
14
+
15
+ </div>
16
+
17
+ ---
18
 
19
  A prototype model that is assembled as a mirrored transformer architecture with nested gating (adds an extra weight to the FFN) and morphological position encoding. It proposes that the model architecture creates different scaffolding, leading to different training regimens and capabilities.
20
 
21
  Prisma is only viable as it piggybacks on pre-trained tokenizers and their weight-tied embeddings, it decomposes the transformer architecture into symmetric **expand** and **compress** phases that share structural weights, connected by a small number of unique **middle** layers. Information expands from tokens to semantics, then compresses back — like light through a prism.
22
 
23
+ ![architecture](img/prisma_arch.png)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
24
 
25
 
26
  ## Key Concepts
 
59
 
60
  ### ~350M scale prototype — comparison with published models
61
 
62
+ Prisma 357M trained on ~30B tokens (OpenWebText 20% + FineWeb-Edu 10BT continued training), compared against published models at similar scale.
63
 
64
  | Model | Params | Train Data | ARC-C\* | ARC-E\* | BoolQ | HellaSwag\* | LAMBADA | PIQA\* | WikiText\*\* | WinoGrande |
65
  |---|---|---|---|---|---|---|---|---|---|---|
 
72
 
73
  \* *normalized accuracy* · \*\* *word perplexity*
74
 
75
+ ![training_budget](img/training_budget_vs_model.png)
76
+
77
+
78
+ ![bench_vs_model](img/bench_vs_model.png)
79
+
80
  **Key findings:**
81
  - **Beats GPT-2 medium on 5/8 benchmarks** (ARC-C, ARC-E, BoolQ, HellaSwag, PIQA) with 25% less training data.
82
  - **Beats Baguettotron (200B) on 6/8 benchmarks** — including PPL — with **7x less data.**
 
95
  | + continued (FineWeb c1) | 1e-3 | 0.249 | 0.434 | 0.601 | 0.333 | 0.312 | 0.626 | 34.7 |
96
  | + continued (FineWeb c2) | 1e-3 | 0.290 | 0.548 | 0.620 | 0.427 | 0.362 | 0.670 | 27.4 |
97
 
98
+ #### Prisma 357M detailed training charts
99
+
100
+ ![training_phase_1](img/training_phase_1_openwebtext_20p_x4.png)
101
+
102
+ ![training_phase_2](img/training_phase_2_fineweb_edu_10B_x2.png)
103
+
104
+ ![bench_results_per_checkpoint](img/bench_results_vs_checkpoint.png)
105
 
106
  ## Quick Start
107
 
 
246
 
247
  ## Architecture Details
248
 
249
+ Full spectral and representation analysis with all plots: **[ANALYSIS.md](ANALYSIS.md)**
250
+
251
  ### Why Mirroring Works
252
 
253
  Mirroring only works due to the additional gate. W3 and W4 specialize to serve different roles despite sharing weights — spectral analysis confirms the gates swap their stable-rank profiles at the architectural midpoint. The order of mirror layers may be rearrangeable, as the gates adapt to whatever representations flow through them.
img/bench_results_vs_checkpoint.png ADDED

Git LFS Details

  • SHA256: 406b825c148a6debabf448e0024415a151c90c763487c0fbecdf23293f1af450
  • Pointer size: 130 Bytes
  • Size of remote file: 69.2 kB
img/bench_vs_model.png ADDED

Git LFS Details

  • SHA256: 8014483d0791626fa996be004e373023a1d0d10fb303539ea19ad89227258ef2
  • Pointer size: 130 Bytes
  • Size of remote file: 59.8 kB
img/prisma.png ADDED

Git LFS Details

  • SHA256: e626897259a078d70e4518e70ec03e25f57156708b07a3520f6a7400c2ffa3e4
  • Pointer size: 130 Bytes
  • Size of remote file: 17.6 kB
img/prisma_arch.png ADDED

Git LFS Details

  • SHA256: 71fba6710170e842fcd36129ff4a0c0f71cf82589997bd6a713c718dc21b8ca1
  • Pointer size: 131 Bytes
  • Size of remote file: 114 kB
img/training_budget_vs_model.png ADDED

Git LFS Details

  • SHA256: b036f233ab9bf7c14a1c2023d84c01aaced3318378fe7de2f63c6e75e7757f19
  • Pointer size: 130 Bytes
  • Size of remote file: 22.5 kB
img/training_phase_1_openwebtext_20p_x4.png ADDED

Git LFS Details

  • SHA256: 5f7de2c67c248f3627043e460e2f1a285b7b31166f06d7c8ae3e519daab9342a
  • Pointer size: 130 Bytes
  • Size of remote file: 38 kB
img/training_phase_2_fineweb_edu_10B_x2.png ADDED

Git LFS Details

  • SHA256: b0c5f267063f725445d2d88e71e7fddce6f6ddf028f23dc80d702f4dd59298fb
  • Pointer size: 130 Bytes
  • Size of remote file: 37.3 kB
scripts/representation_output/prisma/cka_self.png ADDED

Git LFS Details

  • SHA256: 7d8d66d85869ea9fd53aca2b524d866a2c234f8b84f120f1fc9e50d9f7169a38
  • Pointer size: 130 Bytes
  • Size of remote file: 97.2 kB
scripts/representation_output/prisma/logit_lens_summary.png ADDED

Git LFS Details

  • SHA256: 168d152bd05007cc3b40a5aa44b227b9b85a6d6809ae4ff7e24d1bb42e4b9e89
  • Pointer size: 131 Bytes
  • Size of remote file: 125 kB
scripts/representation_output/prisma/representation_drift.png ADDED

Git LFS Details

  • SHA256: 90358a96a33f4ad502530d5e7cffe38889711195991b571e12587953236a5321
  • Pointer size: 130 Bytes
  • Size of remote file: 72 kB
scripts/representation_output/prisma/results.json ADDED
@@ -0,0 +1,2587 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cka_self": {
3
+ "names": [
4
+ "embedding",
5
+ "expand_0",
6
+ "expand_1",
7
+ "expand_2",
8
+ "expand_3",
9
+ "expand_4",
10
+ "expand_5",
11
+ "expand_6",
12
+ "expand_7",
13
+ "expand_8",
14
+ "expand_9",
15
+ "expand_10",
16
+ "expand_11",
17
+ "expand_12",
18
+ "expand_13",
19
+ "expand_14",
20
+ "expand_15",
21
+ "expand_16",
22
+ "expand_17",
23
+ "expand_18",
24
+ "expand_19",
25
+ "middle_0",
26
+ "compress_0",
27
+ "compress_1",
28
+ "compress_2",
29
+ "compress_3",
30
+ "compress_4",
31
+ "compress_5",
32
+ "compress_6",
33
+ "compress_7",
34
+ "compress_8",
35
+ "compress_9",
36
+ "compress_10",
37
+ "compress_11",
38
+ "compress_12",
39
+ "compress_13",
40
+ "compress_14",
41
+ "compress_15",
42
+ "compress_16",
43
+ "compress_17",
44
+ "compress_18",
45
+ "compress_19",
46
+ "final_norm"
47
+ ],
48
+ "matrix": [
49
+ [
50
+ 1.0,
51
+ 0.8866188526153564,
52
+ 0.8164767026901245,
53
+ 0.7336245179176331,
54
+ 0.7054738998413086,
55
+ 0.6806746125221252,
56
+ 0.6546079516410828,
57
+ 0.6396195292472839,
58
+ 0.5966438055038452,
59
+ 0.085598886013031,
60
+ 0.07780761271715164,
61
+ 0.07064133882522583,
62
+ 0.06587867438793182,
63
+ 0.06444326043128967,
64
+ 0.06620140373706818,
65
+ 0.06643415987491608,
66
+ 0.06744755059480667,
67
+ 0.06798108667135239,
68
+ 0.023102257400751114,
69
+ 0.02318720705807209,
70
+ 0.023029332980513573,
71
+ 0.02303888462483883,
72
+ 0.0230090469121933,
73
+ 0.023262370377779007,
74
+ 0.07920751720666885,
75
+ 0.08516508340835571,
76
+ 0.12063281238079071,
77
+ 0.13757441937923431,
78
+ 0.1504034847021103,
79
+ 0.20180721580982208,
80
+ 0.20416469871997833,
81
+ 0.20618851482868195,
82
+ 0.2103184014558792,
83
+ 0.276023268699646,
84
+ 0.2877357304096222,
85
+ 0.27862244844436646,
86
+ 0.2714858949184418,
87
+ 0.2638208270072937,
88
+ 0.24908611178398132,
89
+ 0.23528440296649933,
90
+ 0.2028241604566574,
91
+ 0.18157896399497986,
92
+ 0.21474751830101013
93
+ ],
94
+ [
95
+ 0.8866188526153564,
96
+ 1.0,
97
+ 0.9628973603248596,
98
+ 0.8799647688865662,
99
+ 0.8539307713508606,
100
+ 0.8308154940605164,
101
+ 0.8077625632286072,
102
+ 0.7751748561859131,
103
+ 0.7229080200195312,
104
+ 0.10106048732995987,
105
+ 0.09334167093038559,
106
+ 0.08555062115192413,
107
+ 0.08125422894954681,
108
+ 0.0780123695731163,
109
+ 0.07894133031368256,
110
+ 0.08002081513404846,
111
+ 0.07889489084482193,
112
+ 0.09398424625396729,
113
+ 0.11173419654369354,
114
+ 0.11162395775318146,
115
+ 0.11166036128997803,
116
+ 0.11173491179943085,
117
+ 0.1118139699101448,
118
+ 0.11213553696870804,
119
+ 0.15420915186405182,
120
+ 0.15838824212551117,
121
+ 0.18471214175224304,
122
+ 0.1943701058626175,
123
+ 0.20828627049922943,
124
+ 0.22415931522846222,
125
+ 0.22729963064193726,
126
+ 0.23028919100761414,
127
+ 0.23264256119728088,
128
+ 0.3063642978668213,
129
+ 0.3168011009693146,
130
+ 0.3035929203033447,
131
+ 0.2928607165813446,
132
+ 0.28051790595054626,
133
+ 0.2587052285671234,
134
+ 0.23799873888492584,
135
+ 0.20439808070659637,
136
+ 0.1831863820552826,
137
+ 0.22291426360607147
138
+ ],
139
+ [
140
+ 0.8164767026901245,
141
+ 0.9628973603248596,
142
+ 1.0,
143
+ 0.9413407444953918,
144
+ 0.9221710562705994,
145
+ 0.9034830331802368,
146
+ 0.8851162791252136,
147
+ 0.8494495749473572,
148
+ 0.7978618741035461,
149
+ 0.11317962408065796,
150
+ 0.10702235996723175,
151
+ 0.09908141940832138,
152
+ 0.09638337790966034,
153
+ 0.09181856364011765,
154
+ 0.09098811447620392,
155
+ 0.0925474762916565,
156
+ 0.08911333978176117,
157
+ 0.1197425052523613,
158
+ 0.2184595763683319,
159
+ 0.2182939201593399,
160
+ 0.21833431720733643,
161
+ 0.21851415932178497,
162
+ 0.21860040724277496,
163
+ 0.21891941130161285,
164
+ 0.23645025491714478,
165
+ 0.23678229749202728,
166
+ 0.2491903305053711,
167
+ 0.2493284046649933,
168
+ 0.2616477906703949,
169
+ 0.23879699409008026,
170
+ 0.24320976436138153,
171
+ 0.2469138205051422,
172
+ 0.24982884526252747,
173
+ 0.3279764652252197,
174
+ 0.33550044894218445,
175
+ 0.3209203779697418,
176
+ 0.30974993109703064,
177
+ 0.2971762716770172,
178
+ 0.2757277190685272,
179
+ 0.2561028003692627,
180
+ 0.22381751239299774,
181
+ 0.20405621826648712,
182
+ 0.24675846099853516
183
+ ],
184
+ [
185
+ 0.7336245179176331,
186
+ 0.8799647688865662,
187
+ 0.9413407444953918,
188
+ 1.0,
189
+ 0.995227575302124,
190
+ 0.9878040552139282,
191
+ 0.9771004319190979,
192
+ 0.9507975578308105,
193
+ 0.9107967615127563,
194
+ 0.308550626039505,
195
+ 0.3040868937969208,
196
+ 0.29570043087005615,
197
+ 0.2935424745082855,
198
+ 0.2864590287208557,
199
+ 0.2842535078525543,
200
+ 0.2867068648338318,
201
+ 0.2797296643257141,
202
+ 0.32049211859703064,
203
+ 0.27435117959976196,
204
+ 0.27240100502967834,
205
+ 0.27254536747932434,
206
+ 0.27274295687675476,
207
+ 0.27290886640548706,
208
+ 0.2748803198337555,
209
+ 0.40867719054222107,
210
+ 0.41179877519607544,
211
+ 0.4371880888938904,
212
+ 0.43606802821159363,
213
+ 0.4444027245044708,
214
+ 0.38802891969680786,
215
+ 0.3861248791217804,
216
+ 0.3806435763835907,
217
+ 0.3713274300098419,
218
+ 0.3279266059398651,
219
+ 0.3327653408050537,
220
+ 0.3181270658969879,
221
+ 0.30604374408721924,
222
+ 0.29389938712120056,
223
+ 0.2742559015750885,
224
+ 0.25545141100883484,
225
+ 0.22576963901519775,
226
+ 0.2079307585954666,
227
+ 0.24914155900478363
228
+ ],
229
+ [
230
+ 0.7054738998413086,
231
+ 0.8539307713508606,
232
+ 0.9221710562705994,
233
+ 0.995227575302124,
234
+ 1.0,
235
+ 0.9955264925956726,
236
+ 0.9871774315834045,
237
+ 0.9648488759994507,
238
+ 0.9293156862258911,
239
+ 0.33550918102264404,
240
+ 0.33139121532440186,
241
+ 0.32308056950569153,
242
+ 0.32135188579559326,
243
+ 0.31377658247947693,
244
+ 0.3109651207923889,
245
+ 0.31324508786201477,
246
+ 0.30540403723716736,
247
+ 0.3480979800224304,
248
+ 0.2881461977958679,
249
+ 0.28593939542770386,
250
+ 0.2860979735851288,
251
+ 0.28630706667900085,
252
+ 0.2864839732646942,
253
+ 0.2886858880519867,
254
+ 0.43609726428985596,
255
+ 0.43914493918418884,
256
+ 0.46483513712882996,
257
+ 0.4626930356025696,
258
+ 0.4702044725418091,
259
+ 0.40629586577415466,
260
+ 0.40342196822166443,
261
+ 0.3966600000858307,
262
+ 0.3857835531234741,
263
+ 0.32744163274765015,
264
+ 0.33175063133239746,
265
+ 0.3172719180583954,
266
+ 0.30519741773605347,
267
+ 0.29320383071899414,
268
+ 0.27433881163597107,
269
+ 0.25606462359428406,
270
+ 0.22731678187847137,
271
+ 0.20990385115146637,
272
+ 0.25057658553123474
273
+ ],
274
+ [
275
+ 0.6806746125221252,
276
+ 0.8308154940605164,
277
+ 0.9034830331802368,
278
+ 0.9878040552139282,
279
+ 0.9955264925956726,
280
+ 1.0,
281
+ 0.9944660067558289,
282
+ 0.9749224781990051,
283
+ 0.9430608749389648,
284
+ 0.3607690930366516,
285
+ 0.3568904995918274,
286
+ 0.34869030117988586,
287
+ 0.3472835421562195,
288
+ 0.33940771222114563,
289
+ 0.3363325297832489,
290
+ 0.3382657468318939,
291
+ 0.32981014251708984,
292
+ 0.3720569312572479,
293
+ 0.2870592474937439,
294
+ 0.284622460603714,
295
+ 0.28477877378463745,
296
+ 0.2850104570388794,
297
+ 0.2851870656013489,
298
+ 0.28762033581733704,
299
+ 0.4527134895324707,
300
+ 0.4562855660915375,
301
+ 0.4845730662345886,
302
+ 0.4827938675880432,
303
+ 0.4897107779979706,
304
+ 0.42294442653656006,
305
+ 0.4187129735946655,
306
+ 0.41039565205574036,
307
+ 0.39745908975601196,
308
+ 0.3240452706813812,
309
+ 0.32759734988212585,
310
+ 0.3131496012210846,
311
+ 0.30125588178634644,
312
+ 0.2897612750530243,
313
+ 0.2714788019657135,
314
+ 0.2537075877189636,
315
+ 0.22584068775177002,
316
+ 0.20897728204727173,
317
+ 0.24922457337379456
318
+ ],
319
+ [
320
+ 0.6546079516410828,
321
+ 0.8077625632286072,
322
+ 0.8851162791252136,
323
+ 0.9771004319190979,
324
+ 0.9871774315834045,
325
+ 0.9944660067558289,
326
+ 1.0,
327
+ 0.9820618033409119,
328
+ 0.953970730304718,
329
+ 0.3724070191383362,
330
+ 0.3690847158432007,
331
+ 0.3610292077064514,
332
+ 0.3599538505077362,
333
+ 0.3517965078353882,
334
+ 0.3483361005783081,
335
+ 0.3496970534324646,
336
+ 0.34060928225517273,
337
+ 0.3831266760826111,
338
+ 0.2930297255516052,
339
+ 0.29048728942871094,
340
+ 0.29062631726264954,
341
+ 0.29089054465293884,
342
+ 0.2910628914833069,
343
+ 0.2935792803764343,
344
+ 0.46357405185699463,
345
+ 0.4668458104133606,
346
+ 0.49484941363334656,
347
+ 0.49225494265556335,
348
+ 0.49875810742378235,
349
+ 0.4283643364906311,
350
+ 0.42346519231796265,
351
+ 0.4143224060535431,
352
+ 0.4004828631877899,
353
+ 0.32099777460098267,
354
+ 0.32415735721588135,
355
+ 0.3096967935562134,
356
+ 0.2979552745819092,
357
+ 0.28647178411483765,
358
+ 0.2684289216995239,
359
+ 0.25093063712120056,
360
+ 0.22366459667682648,
361
+ 0.20713619887828827,
362
+ 0.2471763789653778
363
+ ],
364
+ [
365
+ 0.6396195292472839,
366
+ 0.7751748561859131,
367
+ 0.8494495749473572,
368
+ 0.9507975578308105,
369
+ 0.9648488759994507,
370
+ 0.9749224781990051,
371
+ 0.9820618033409119,
372
+ 1.0,
373
+ 0.9852873682975769,
374
+ 0.4005168080329895,
375
+ 0.3977200984954834,
376
+ 0.39006975293159485,
377
+ 0.38888439536094666,
378
+ 0.37970638275146484,
379
+ 0.37458398938179016,
380
+ 0.3730723559856415,
381
+ 0.3639926314353943,
382
+ 0.3965635299682617,
383
+ 0.25505632162094116,
384
+ 0.2525669038295746,
385
+ 0.25264737010002136,
386
+ 0.2528040111064911,
387
+ 0.25296083092689514,
388
+ 0.25539398193359375,
389
+ 0.4444199204444885,
390
+ 0.4489409923553467,
391
+ 0.48802775144577026,
392
+ 0.4915412962436676,
393
+ 0.49782225489616394,
394
+ 0.44944503903388977,
395
+ 0.4436427652835846,
396
+ 0.4329726994037628,
397
+ 0.4182862639427185,
398
+ 0.3326359987258911,
399
+ 0.3348675072193146,
400
+ 0.3216875195503235,
401
+ 0.3106471300125122,
402
+ 0.30101141333580017,
403
+ 0.28571727871894836,
404
+ 0.2689763605594635,
405
+ 0.24261517822742462,
406
+ 0.22421549260616302,
407
+ 0.2646792531013489
408
+ ],
409
+ [
410
+ 0.5966438055038452,
411
+ 0.7229080200195312,
412
+ 0.7978618741035461,
413
+ 0.9107967615127563,
414
+ 0.9293156862258911,
415
+ 0.9430608749389648,
416
+ 0.953970730304718,
417
+ 0.9852873682975769,
418
+ 1.0,
419
+ 0.4309193193912506,
420
+ 0.42949405312538147,
421
+ 0.4229198396205902,
422
+ 0.4221244156360626,
423
+ 0.4122016131877899,
424
+ 0.40526336431503296,
425
+ 0.4009464681148529,
426
+ 0.3909369707107544,
427
+ 0.4178644120693207,
428
+ 0.23692750930786133,
429
+ 0.2343255579471588,
430
+ 0.23436222970485687,
431
+ 0.2345062792301178,
432
+ 0.23465289175510406,
433
+ 0.23714682459831238,
434
+ 0.44334208965301514,
435
+ 0.44831061363220215,
436
+ 0.49308687448501587,
437
+ 0.4989806115627289,
438
+ 0.5044687986373901,
439
+ 0.46394869685173035,
440
+ 0.4565954804420471,
441
+ 0.4442926347255707,
442
+ 0.42832520604133606,
443
+ 0.32949790358543396,
444
+ 0.3316981792449951,
445
+ 0.31988200545310974,
446
+ 0.3099154829978943,
447
+ 0.3022046983242035,
448
+ 0.28973516821861267,
449
+ 0.27511394023895264,
450
+ 0.2516450881958008,
451
+ 0.23347976803779602,
452
+ 0.27288132905960083
453
+ ],
454
+ [
455
+ 0.085598886013031,
456
+ 0.10106048732995987,
457
+ 0.11317962408065796,
458
+ 0.308550626039505,
459
+ 0.33550918102264404,
460
+ 0.3607690930366516,
461
+ 0.3724070191383362,
462
+ 0.4005168080329895,
463
+ 0.4309193193912506,
464
+ 1.0,
465
+ 0.9983712434768677,
466
+ 0.9967196583747864,
467
+ 0.993431031703949,
468
+ 0.9915507435798645,
469
+ 0.9917429089546204,
470
+ 0.9871540069580078,
471
+ 0.9866191744804382,
472
+ 0.9566135406494141,
473
+ 0.0430295355618,
474
+ 0.03934454545378685,
475
+ 0.03954160958528519,
476
+ 0.03954703360795975,
477
+ 0.03969861567020416,
478
+ 0.043120093643665314,
479
+ 0.5498442053794861,
480
+ 0.568802535533905,
481
+ 0.7249490022659302,
482
+ 0.7664463520050049,
483
+ 0.7588014602661133,
484
+ 0.8236193656921387,
485
+ 0.7813343405723572,
486
+ 0.7313903570175171,
487
+ 0.6704528331756592,
488
+ 0.07240330427885056,
489
+ 0.0693894773721695,
490
+ 0.06366771459579468,
491
+ 0.058279674500226974,
492
+ 0.0554267056286335,
493
+ 0.050616730004549026,
494
+ 0.04563816264271736,
495
+ 0.04066110774874687,
496
+ 0.03758147731423378,
497
+ 0.04408200830221176
498
+ ],
499
+ [
500
+ 0.07780761271715164,
501
+ 0.09334167093038559,
502
+ 0.10702235996723175,
503
+ 0.3040868937969208,
504
+ 0.33139121532440186,
505
+ 0.3568904995918274,
506
+ 0.3690847158432007,
507
+ 0.3977200984954834,
508
+ 0.42949405312538147,
509
+ 0.9983712434768677,
510
+ 1.0,
511
+ 0.9991177916526794,
512
+ 0.9967723488807678,
513
+ 0.9949100613594055,
514
+ 0.9938264489173889,
515
+ 0.9891536235809326,
516
+ 0.9876658320426941,
517
+ 0.9608913064002991,
518
+ 0.05062812194228172,
519
+ 0.046665869653224945,
520
+ 0.04688529297709465,
521
+ 0.046895939856767654,
522
+ 0.04706761613488197,
523
+ 0.05072401463985443,
524
+ 0.5625230073928833,
525
+ 0.5815515518188477,
526
+ 0.7339819669723511,
527
+ 0.7737396955490112,
528
+ 0.7656188607215881,
529
+ 0.8210443258285522,
530
+ 0.7790476083755493,
531
+ 0.7293313145637512,
532
+ 0.6685813069343567,
533
+ 0.07231699675321579,
534
+ 0.069251149892807,
535
+ 0.06337570399045944,
536
+ 0.05794829502701759,
537
+ 0.05501179024577141,
538
+ 0.05034567788243294,
539
+ 0.04554736986756325,
540
+ 0.04089762642979622,
541
+ 0.0380101203918457,
542
+ 0.04412412270903587
543
+ ],
544
+ [
545
+ 0.07064133882522583,
546
+ 0.08555062115192413,
547
+ 0.09908141940832138,
548
+ 0.29570043087005615,
549
+ 0.32308056950569153,
550
+ 0.34869030117988586,
551
+ 0.3610292077064514,
552
+ 0.39006975293159485,
553
+ 0.4229198396205902,
554
+ 0.9967196583747864,
555
+ 0.9991177916526794,
556
+ 1.0,
557
+ 0.9984608888626099,
558
+ 0.9968641400337219,
559
+ 0.9954223036766052,
560
+ 0.9907151460647583,
561
+ 0.9888972640037537,
562
+ 0.9628637433052063,
563
+ 0.05198612064123154,
564
+ 0.0479416586458683,
565
+ 0.0481654517352581,
566
+ 0.04817423224449158,
567
+ 0.048352621495723724,
568
+ 0.052079249173402786,
569
+ 0.5660314559936523,
570
+ 0.5851565003395081,
571
+ 0.7368158102035522,
572
+ 0.7762715220451355,
573
+ 0.7680290341377258,
574
+ 0.8212220072746277,
575
+ 0.7793737649917603,
576
+ 0.7299240231513977,
577
+ 0.6690068244934082,
578
+ 0.07289182394742966,
579
+ 0.0697130560874939,
580
+ 0.06382288783788681,
581
+ 0.058345895260572433,
582
+ 0.055434055626392365,
583
+ 0.050859484821558,
584
+ 0.04605194926261902,
585
+ 0.04151143878698349,
586
+ 0.03859316557645798,
587
+ 0.04483146220445633
588
+ ],
589
+ [
590
+ 0.06587867438793182,
591
+ 0.08125422894954681,
592
+ 0.09638337790966034,
593
+ 0.2935424745082855,
594
+ 0.32135188579559326,
595
+ 0.3472835421562195,
596
+ 0.3599538505077362,
597
+ 0.38888439536094666,
598
+ 0.4221244156360626,
599
+ 0.993431031703949,
600
+ 0.9967723488807678,
601
+ 0.9984608888626099,
602
+ 1.0,
603
+ 0.998919665813446,
604
+ 0.9969019293785095,
605
+ 0.9931889176368713,
606
+ 0.9905962944030762,
607
+ 0.9688683152198792,
608
+ 0.0649133026599884,
609
+ 0.06056395545601845,
610
+ 0.06081191077828407,
611
+ 0.06082715466618538,
612
+ 0.061027247458696365,
613
+ 0.06504278630018234,
614
+ 0.5855360627174377,
615
+ 0.6047874689102173,
616
+ 0.7524403929710388,
617
+ 0.7901456952095032,
618
+ 0.7814924120903015,
619
+ 0.822943925857544,
620
+ 0.7818796634674072,
621
+ 0.7328982949256897,
622
+ 0.6719599366188049,
623
+ 0.07769713550806046,
624
+ 0.07411515712738037,
625
+ 0.06803729385137558,
626
+ 0.06221542879939079,
627
+ 0.05910384654998779,
628
+ 0.054406825453042984,
629
+ 0.0491962805390358,
630
+ 0.04431117698550224,
631
+ 0.04105953127145767,
632
+ 0.047917358577251434
633
+ ],
634
+ [
635
+ 0.06444326043128967,
636
+ 0.0780123695731163,
637
+ 0.09181856364011765,
638
+ 0.2864590287208557,
639
+ 0.31377658247947693,
640
+ 0.33940771222114563,
641
+ 0.3517965078353882,
642
+ 0.37970638275146484,
643
+ 0.4122016131877899,
644
+ 0.9915507435798645,
645
+ 0.9949100613594055,
646
+ 0.9968641400337219,
647
+ 0.998919665813446,
648
+ 1.0,
649
+ 0.998546838760376,
650
+ 0.9956219792366028,
651
+ 0.993252158164978,
652
+ 0.972076416015625,
653
+ 0.06579349935054779,
654
+ 0.06141830235719681,
655
+ 0.061670996248722076,
656
+ 0.061693765223026276,
657
+ 0.06189659610390663,
658
+ 0.06596172600984573,
659
+ 0.5887587070465088,
660
+ 0.60823655128479,
661
+ 0.7562814950942993,
662
+ 0.794303297996521,
663
+ 0.7856388092041016,
664
+ 0.8270909786224365,
665
+ 0.7860783934593201,
666
+ 0.7372952699661255,
667
+ 0.6760224103927612,
668
+ 0.08032910525798798,
669
+ 0.07661429792642593,
670
+ 0.07035835832357407,
671
+ 0.06424151360988617,
672
+ 0.06096977740526199,
673
+ 0.055847618728876114,
674
+ 0.050160396844148636,
675
+ 0.0446544885635376,
676
+ 0.04107876121997833,
677
+ 0.04873289167881012
678
+ ],
679
+ [
680
+ 0.06620140373706818,
681
+ 0.07894133031368256,
682
+ 0.09098811447620392,
683
+ 0.2842535078525543,
684
+ 0.3109651207923889,
685
+ 0.3363325297832489,
686
+ 0.3483361005783081,
687
+ 0.37458398938179016,
688
+ 0.40526336431503296,
689
+ 0.9917429089546204,
690
+ 0.9938264489173889,
691
+ 0.9954223036766052,
692
+ 0.9969019293785095,
693
+ 0.998546838760376,
694
+ 1.0,
695
+ 0.9978783130645752,
696
+ 0.9963661432266235,
697
+ 0.9736549258232117,
698
+ 0.05974001809954643,
699
+ 0.05549969524145126,
700
+ 0.05575323849916458,
701
+ 0.055778440088033676,
702
+ 0.05597558617591858,
703
+ 0.05994410067796707,
704
+ 0.5834383964538574,
705
+ 0.603299081325531,
706
+ 0.7547523379325867,
707
+ 0.7943676114082336,
708
+ 0.7866061329841614,
709
+ 0.8345628976821899,
710
+ 0.7933462858200073,
711
+ 0.744498074054718,
712
+ 0.6827337741851807,
713
+ 0.08384349197149277,
714
+ 0.08009275794029236,
715
+ 0.0735638365149498,
716
+ 0.06728626787662506,
717
+ 0.06396643072366714,
718
+ 0.05840763822197914,
719
+ 0.0524911992251873,
720
+ 0.04635629802942276,
721
+ 0.042436614632606506,
722
+ 0.05078994855284691
723
+ ],
724
+ [
725
+ 0.06643415987491608,
726
+ 0.08002081513404846,
727
+ 0.0925474762916565,
728
+ 0.2867068648338318,
729
+ 0.31324508786201477,
730
+ 0.3382657468318939,
731
+ 0.3496970534324646,
732
+ 0.3730723559856415,
733
+ 0.4009464681148529,
734
+ 0.9871540069580078,
735
+ 0.9891536235809326,
736
+ 0.9907151460647583,
737
+ 0.9931889176368713,
738
+ 0.9956219792366028,
739
+ 0.9978783130645752,
740
+ 1.0,
741
+ 0.9985812902450562,
742
+ 0.9823518395423889,
743
+ 0.07506875693798065,
744
+ 0.07046352326869965,
745
+ 0.07077838480472565,
746
+ 0.07081238180398941,
747
+ 0.07104198634624481,
748
+ 0.07539550215005875,
749
+ 0.6085659861564636,
750
+ 0.6289919018745422,
751
+ 0.7765320539474487,
752
+ 0.815192461013794,
753
+ 0.8073939681053162,
754
+ 0.8424702882766724,
755
+ 0.8021972179412842,
756
+ 0.7543538212776184,
757
+ 0.6925984621047974,
758
+ 0.09298310428857803,
759
+ 0.08874774724245071,
760
+ 0.08172436058521271,
761
+ 0.07478447258472443,
762
+ 0.07101134210824966,
763
+ 0.06470689922571182,
764
+ 0.05802201107144356,
765
+ 0.05082838982343674,
766
+ 0.04616275802254677,
767
+ 0.0556659996509552
768
+ ],
769
+ [
770
+ 0.06744755059480667,
771
+ 0.07889489084482193,
772
+ 0.08911333978176117,
773
+ 0.2797296643257141,
774
+ 0.30540403723716736,
775
+ 0.32981014251708984,
776
+ 0.34060928225517273,
777
+ 0.3639926314353943,
778
+ 0.3909369707107544,
779
+ 0.9866191744804382,
780
+ 0.9876658320426941,
781
+ 0.9888972640037537,
782
+ 0.9905962944030762,
783
+ 0.993252158164978,
784
+ 0.9963661432266235,
785
+ 0.9985812902450562,
786
+ 1.0,
787
+ 0.9805582165718079,
788
+ 0.06232975423336029,
789
+ 0.05793860927224159,
790
+ 0.05823370814323425,
791
+ 0.05825956538319588,
792
+ 0.058471664786338806,
793
+ 0.06263023614883423,
794
+ 0.5946943759918213,
795
+ 0.6155142784118652,
796
+ 0.7682710886001587,
797
+ 0.809515118598938,
798
+ 0.8023097515106201,
799
+ 0.8486509919166565,
800
+ 0.8083965182304382,
801
+ 0.7606802582740784,
802
+ 0.6990503668785095,
803
+ 0.09798076003789902,
804
+ 0.09355077892541885,
805
+ 0.0864337757229805,
806
+ 0.07936399430036545,
807
+ 0.07550866156816483,
808
+ 0.06888752430677414,
809
+ 0.06172937527298927,
810
+ 0.053861312568187714,
811
+ 0.0485219843685627,
812
+ 0.0585828460752964
813
+ ],
814
+ [
815
+ 0.06798108667135239,
816
+ 0.09398424625396729,
817
+ 0.1197425052523613,
818
+ 0.32049211859703064,
819
+ 0.3480979800224304,
820
+ 0.3720569312572479,
821
+ 0.3831266760826111,
822
+ 0.3965635299682617,
823
+ 0.4178644120693207,
824
+ 0.9566135406494141,
825
+ 0.9608913064002991,
826
+ 0.9628637433052063,
827
+ 0.9688683152198792,
828
+ 0.972076416015625,
829
+ 0.9736549258232117,
830
+ 0.9823518395423889,
831
+ 0.9805582165718079,
832
+ 1.0,
833
+ 0.20967945456504822,
834
+ 0.20398284494876862,
835
+ 0.20447564125061035,
836
+ 0.20453287661075592,
837
+ 0.20486538112163544,
838
+ 0.21031388640403748,
839
+ 0.7319108843803406,
840
+ 0.7499132752418518,
841
+ 0.8658959269523621,
842
+ 0.8875241875648499,
843
+ 0.8780434131622314,
844
+ 0.8381298184394836,
845
+ 0.8010719418525696,
846
+ 0.7571243047714233,
847
+ 0.6966723799705505,
848
+ 0.11264808475971222,
849
+ 0.10858210921287537,
850
+ 0.1002814769744873,
851
+ 0.09172182530164719,
852
+ 0.08617549389600754,
853
+ 0.07838811725378036,
854
+ 0.06973807513713837,
855
+ 0.06031683087348938,
856
+ 0.054660484194755554,
857
+ 0.06589983403682709
858
+ ],
859
+ [
860
+ 0.023102257400751114,
861
+ 0.11173419654369354,
862
+ 0.2184595763683319,
863
+ 0.27435117959976196,
864
+ 0.2881461977958679,
865
+ 0.2870592474937439,
866
+ 0.2930297255516052,
867
+ 0.25505632162094116,
868
+ 0.23692750930786133,
869
+ 0.0430295355618,
870
+ 0.05062812194228172,
871
+ 0.05198612064123154,
872
+ 0.0649133026599884,
873
+ 0.06579349935054779,
874
+ 0.05974001809954643,
875
+ 0.07506875693798065,
876
+ 0.06232975423336029,
877
+ 0.20967945456504822,
878
+ 1.0,
879
+ 0.999909520149231,
880
+ 0.9999238848686218,
881
+ 0.999921441078186,
882
+ 0.9999275803565979,
883
+ 0.9999585747718811,
884
+ 0.7119568586349487,
885
+ 0.671326220035553,
886
+ 0.5175796151161194,
887
+ 0.41512802243232727,
888
+ 0.4003653824329376,
889
+ 0.06566229462623596,
890
+ 0.06661772727966309,
891
+ 0.07006547600030899,
892
+ 0.06426654756069183,
893
+ 0.07124065607786179,
894
+ 0.07693160325288773,
895
+ 0.07690145075321198,
896
+ 0.06902671605348587,
897
+ 0.06145131587982178,
898
+ 0.06265770643949509,
899
+ 0.055518437176942825,
900
+ 0.04859182983636856,
901
+ 0.045823391526937485,
902
+ 0.05258903652429581
903
+ ],
904
+ [
905
+ 0.02318720705807209,
906
+ 0.11162395775318146,
907
+ 0.2182939201593399,
908
+ 0.27240100502967834,
909
+ 0.28593939542770386,
910
+ 0.284622460603714,
911
+ 0.29048728942871094,
912
+ 0.2525669038295746,
913
+ 0.2343255579471588,
914
+ 0.03934454545378685,
915
+ 0.046665869653224945,
916
+ 0.0479416586458683,
917
+ 0.06056395545601845,
918
+ 0.06141830235719681,
919
+ 0.05549969524145126,
920
+ 0.07046352326869965,
921
+ 0.05793860927224159,
922
+ 0.20398284494876862,
923
+ 0.999909520149231,
924
+ 1.0,
925
+ 0.9999940991401672,
926
+ 0.9999884366989136,
927
+ 0.9999811053276062,
928
+ 0.9998469948768616,
929
+ 0.7045469284057617,
930
+ 0.6635348796844482,
931
+ 0.5103201270103455,
932
+ 0.40810084342956543,
933
+ 0.393448144197464,
934
+ 0.0624198317527771,
935
+ 0.0633949413895607,
936
+ 0.06690343469381332,
937
+ 0.06137187033891678,
938
+ 0.07085655629634857,
939
+ 0.0765998587012291,
940
+ 0.07666971534490585,
941
+ 0.06883713603019714,
942
+ 0.061300039291381836,
943
+ 0.06258251518011093,
944
+ 0.0554729625582695,
945
+ 0.048547327518463135,
946
+ 0.04577719047665596,
947
+ 0.05257975682616234
948
+ ],
949
+ [
950
+ 0.023029332980513573,
951
+ 0.11166036128997803,
952
+ 0.21833431720733643,
953
+ 0.27254536747932434,
954
+ 0.2860979735851288,
955
+ 0.28477877378463745,
956
+ 0.29062631726264954,
957
+ 0.25264737010002136,
958
+ 0.23436222970485687,
959
+ 0.03954160958528519,
960
+ 0.04688529297709465,
961
+ 0.0481654517352581,
962
+ 0.06081191077828407,
963
+ 0.061670996248722076,
964
+ 0.05575323849916458,
965
+ 0.07077838480472565,
966
+ 0.05823370814323425,
967
+ 0.20447564125061035,
968
+ 0.9999238848686218,
969
+ 0.9999940991401672,
970
+ 1.0,
971
+ 0.9999904036521912,
972
+ 0.9999901652336121,
973
+ 0.9998791813850403,
974
+ 0.7054231762886047,
975
+ 0.6644848585128784,
976
+ 0.511172354221344,
977
+ 0.40891215205192566,
978
+ 0.3942578136920929,
979
+ 0.06265631318092346,
980
+ 0.06365527212619781,
981
+ 0.06717797368764877,
982
+ 0.06163809821009636,
983
+ 0.07093967497348785,
984
+ 0.07665561139583588,
985
+ 0.07666600495576859,
986
+ 0.06881962716579437,
987
+ 0.0612604096531868,
988
+ 0.06247647851705551,
989
+ 0.05536418780684471,
990
+ 0.04843009635806084,
991
+ 0.045680876821279526,
992
+ 0.05250919237732887
993
+ ],
994
+ [
995
+ 0.02303888462483883,
996
+ 0.11173491179943085,
997
+ 0.21851415932178497,
998
+ 0.27274295687675476,
999
+ 0.28630706667900085,
1000
+ 0.2850104570388794,
1001
+ 0.29089054465293884,
1002
+ 0.2528040111064911,
1003
+ 0.2345062792301178,
1004
+ 0.03954703360795975,
1005
+ 0.046895939856767654,
1006
+ 0.04817423224449158,
1007
+ 0.06082715466618538,
1008
+ 0.061693765223026276,
1009
+ 0.055778440088033676,
1010
+ 0.07081238180398941,
1011
+ 0.05825956538319588,
1012
+ 0.20453287661075592,
1013
+ 0.999921441078186,
1014
+ 0.9999884366989136,
1015
+ 0.9999904036521912,
1016
+ 1.0,
1017
+ 0.9999964237213135,
1018
+ 0.9998904466629028,
1019
+ 0.7056722044944763,
1020
+ 0.6647428870201111,
1021
+ 0.5114345550537109,
1022
+ 0.40917566418647766,
1023
+ 0.3945251703262329,
1024
+ 0.06278237700462341,
1025
+ 0.06377211213111877,
1026
+ 0.06728610396385193,
1027
+ 0.06173764541745186,
1028
+ 0.07106957584619522,
1029
+ 0.07678575813770294,
1030
+ 0.07677685469388962,
1031
+ 0.0689200833439827,
1032
+ 0.061353541910648346,
1033
+ 0.06254272162914276,
1034
+ 0.05543401837348938,
1035
+ 0.048503417521715164,
1036
+ 0.04575222730636597,
1037
+ 0.05261457711458206
1038
+ ],
1039
+ [
1040
+ 0.0230090469121933,
1041
+ 0.1118139699101448,
1042
+ 0.21860040724277496,
1043
+ 0.27290886640548706,
1044
+ 0.2864839732646942,
1045
+ 0.2851870656013489,
1046
+ 0.2910628914833069,
1047
+ 0.25296083092689514,
1048
+ 0.23465289175510406,
1049
+ 0.03969861567020416,
1050
+ 0.04706761613488197,
1051
+ 0.048352621495723724,
1052
+ 0.061027247458696365,
1053
+ 0.06189659610390663,
1054
+ 0.05597558617591858,
1055
+ 0.07104198634624481,
1056
+ 0.058471664786338806,
1057
+ 0.20486538112163544,
1058
+ 0.9999275803565979,
1059
+ 0.9999811053276062,
1060
+ 0.9999901652336121,
1061
+ 0.9999964237213135,
1062
+ 1.0,
1063
+ 0.9999097585678101,
1064
+ 0.706282377243042,
1065
+ 0.6654007434844971,
1066
+ 0.5120447874069214,
1067
+ 0.40976452827453613,
1068
+ 0.3951128423213959,
1069
+ 0.06300974637269974,
1070
+ 0.06402121484279633,
1071
+ 0.06754946708679199,
1072
+ 0.06200111284852028,
1073
+ 0.07124228030443192,
1074
+ 0.07693175226449966,
1075
+ 0.07687943428754807,
1076
+ 0.06900914758443832,
1077
+ 0.06142721697688103,
1078
+ 0.06256409734487534,
1079
+ 0.05545356869697571,
1080
+ 0.0485139898955822,
1081
+ 0.04577886685729027,
1082
+ 0.05267048999667168
1083
+ ],
1084
+ [
1085
+ 0.023262370377779007,
1086
+ 0.11213553696870804,
1087
+ 0.21891941130161285,
1088
+ 0.2748803198337555,
1089
+ 0.2886858880519867,
1090
+ 0.28762033581733704,
1091
+ 0.2935792803764343,
1092
+ 0.25539398193359375,
1093
+ 0.23714682459831238,
1094
+ 0.043120093643665314,
1095
+ 0.05072401463985443,
1096
+ 0.052079249173402786,
1097
+ 0.06504278630018234,
1098
+ 0.06596172600984573,
1099
+ 0.05994410067796707,
1100
+ 0.07539550215005875,
1101
+ 0.06263023614883423,
1102
+ 0.21031388640403748,
1103
+ 0.9999585747718811,
1104
+ 0.9998469948768616,
1105
+ 0.9998791813850403,
1106
+ 0.9998904466629028,
1107
+ 0.9999097585678101,
1108
+ 1.0,
1109
+ 0.7137615084648132,
1110
+ 0.6733015179634094,
1111
+ 0.51947420835495,
1112
+ 0.4169948399066925,
1113
+ 0.40224966406822205,
1114
+ 0.0663842037320137,
1115
+ 0.06739447265863419,
1116
+ 0.07088898122310638,
1117
+ 0.06509575247764587,
1118
+ 0.0720483735203743,
1119
+ 0.07764078676700592,
1120
+ 0.07742615044116974,
1121
+ 0.069486103951931,
1122
+ 0.06184566766023636,
1123
+ 0.06280297785997391,
1124
+ 0.05563516169786453,
1125
+ 0.04863174259662628,
1126
+ 0.045876212418079376,
1127
+ 0.05289127677679062
1128
+ ],
1129
+ [
1130
+ 0.07920751720666885,
1131
+ 0.15420915186405182,
1132
+ 0.23645025491714478,
1133
+ 0.40867719054222107,
1134
+ 0.43609726428985596,
1135
+ 0.4527134895324707,
1136
+ 0.46357405185699463,
1137
+ 0.4444199204444885,
1138
+ 0.44334208965301514,
1139
+ 0.5498442053794861,
1140
+ 0.5625230073928833,
1141
+ 0.5660314559936523,
1142
+ 0.5855360627174377,
1143
+ 0.5887587070465088,
1144
+ 0.5834383964538574,
1145
+ 0.6085659861564636,
1146
+ 0.5946943759918213,
1147
+ 0.7319108843803406,
1148
+ 0.7119568586349487,
1149
+ 0.7045469284057617,
1150
+ 0.7054231762886047,
1151
+ 0.7056722044944763,
1152
+ 0.706282377243042,
1153
+ 0.7137615084648132,
1154
+ 1.0,
1155
+ 0.9971895813941956,
1156
+ 0.950676441192627,
1157
+ 0.8982461094856262,
1158
+ 0.8854431509971619,
1159
+ 0.5648123621940613,
1160
+ 0.552385151386261,
1161
+ 0.5359407663345337,
1162
+ 0.5004628300666809,
1163
+ 0.19620217382907867,
1164
+ 0.19262436032295227,
1165
+ 0.181674525141716,
1166
+ 0.16961324214935303,
1167
+ 0.15909402072429657,
1168
+ 0.14928825199604034,
1169
+ 0.13633903861045837,
1170
+ 0.12095215916633606,
1171
+ 0.11138390004634857,
1172
+ 0.12936289608478546
1173
+ ],
1174
+ [
1175
+ 0.08516508340835571,
1176
+ 0.15838824212551117,
1177
+ 0.23678229749202728,
1178
+ 0.41179877519607544,
1179
+ 0.43914493918418884,
1180
+ 0.4562855660915375,
1181
+ 0.4668458104133606,
1182
+ 0.4489409923553467,
1183
+ 0.44831061363220215,
1184
+ 0.568802535533905,
1185
+ 0.5815515518188477,
1186
+ 0.5851565003395081,
1187
+ 0.6047874689102173,
1188
+ 0.60823655128479,
1189
+ 0.603299081325531,
1190
+ 0.6289919018745422,
1191
+ 0.6155142784118652,
1192
+ 0.7499132752418518,
1193
+ 0.671326220035553,
1194
+ 0.6635348796844482,
1195
+ 0.6644848585128784,
1196
+ 0.6647428870201111,
1197
+ 0.6654007434844971,
1198
+ 0.6733015179634094,
1199
+ 0.9971895813941956,
1200
+ 1.0,
1201
+ 0.9640783667564392,
1202
+ 0.9180646538734436,
1203
+ 0.9064103364944458,
1204
+ 0.5952581167221069,
1205
+ 0.583310604095459,
1206
+ 0.5669860243797302,
1207
+ 0.5310094356536865,
1208
+ 0.21623265743255615,
1209
+ 0.21143624186515808,
1210
+ 0.19894073903560638,
1211
+ 0.18640725314617157,
1212
+ 0.17529775202274323,
1213
+ 0.16363555192947388,
1214
+ 0.14969877898693085,
1215
+ 0.13267894089221954,
1216
+ 0.12187019735574722,
1217
+ 0.14145489037036896
1218
+ ],
1219
+ [
1220
+ 0.12063281238079071,
1221
+ 0.18471214175224304,
1222
+ 0.2491903305053711,
1223
+ 0.4371880888938904,
1224
+ 0.46483513712882996,
1225
+ 0.4845730662345886,
1226
+ 0.49484941363334656,
1227
+ 0.48802775144577026,
1228
+ 0.49308687448501587,
1229
+ 0.7249490022659302,
1230
+ 0.7339819669723511,
1231
+ 0.7368158102035522,
1232
+ 0.7524403929710388,
1233
+ 0.7562814950942993,
1234
+ 0.7547523379325867,
1235
+ 0.7765320539474487,
1236
+ 0.7682710886001587,
1237
+ 0.8658959269523621,
1238
+ 0.5175796151161194,
1239
+ 0.5103201270103455,
1240
+ 0.511172354221344,
1241
+ 0.5114345550537109,
1242
+ 0.5120447874069214,
1243
+ 0.51947420835495,
1244
+ 0.950676441192627,
1245
+ 0.9640783667564392,
1246
+ 1.0,
1247
+ 0.9872865676879883,
1248
+ 0.9804797172546387,
1249
+ 0.7754476070404053,
1250
+ 0.7595964074134827,
1251
+ 0.7371383905410767,
1252
+ 0.6947705149650574,
1253
+ 0.29692989587783813,
1254
+ 0.28856322169303894,
1255
+ 0.272670716047287,
1256
+ 0.2585139870643616,
1257
+ 0.24620142579078674,
1258
+ 0.22987234592437744,
1259
+ 0.21223331987857819,
1260
+ 0.18888315558433533,
1261
+ 0.1721278429031372,
1262
+ 0.1982099562883377
1263
+ ],
1264
+ [
1265
+ 0.13757441937923431,
1266
+ 0.1943701058626175,
1267
+ 0.2493284046649933,
1268
+ 0.43606802821159363,
1269
+ 0.4626930356025696,
1270
+ 0.4827938675880432,
1271
+ 0.49225494265556335,
1272
+ 0.4915412962436676,
1273
+ 0.4989806115627289,
1274
+ 0.7664463520050049,
1275
+ 0.7737396955490112,
1276
+ 0.7762715220451355,
1277
+ 0.7901456952095032,
1278
+ 0.794303297996521,
1279
+ 0.7943676114082336,
1280
+ 0.815192461013794,
1281
+ 0.809515118598938,
1282
+ 0.8875241875648499,
1283
+ 0.41512802243232727,
1284
+ 0.40810084342956543,
1285
+ 0.40891215205192566,
1286
+ 0.40917566418647766,
1287
+ 0.40976452827453613,
1288
+ 0.4169948399066925,
1289
+ 0.8982461094856262,
1290
+ 0.9180646538734436,
1291
+ 0.9872865676879883,
1292
+ 1.0,
1293
+ 0.9966458678245544,
1294
+ 0.8506045937538147,
1295
+ 0.838002622127533,
1296
+ 0.8168523907661438,
1297
+ 0.7763916850090027,
1298
+ 0.36660265922546387,
1299
+ 0.3540237545967102,
1300
+ 0.33642521500587463,
1301
+ 0.32011911273002625,
1302
+ 0.3063182234764099,
1303
+ 0.28603243827819824,
1304
+ 0.26461443305015564,
1305
+ 0.23550668358802795,
1306
+ 0.21309895813465118,
1307
+ 0.24481619894504547
1308
+ ],
1309
+ [
1310
+ 0.1504034847021103,
1311
+ 0.20828627049922943,
1312
+ 0.2616477906703949,
1313
+ 0.4444027245044708,
1314
+ 0.4702044725418091,
1315
+ 0.4897107779979706,
1316
+ 0.49875810742378235,
1317
+ 0.49782225489616394,
1318
+ 0.5044687986373901,
1319
+ 0.7588014602661133,
1320
+ 0.7656188607215881,
1321
+ 0.7680290341377258,
1322
+ 0.7814924120903015,
1323
+ 0.7856388092041016,
1324
+ 0.7866061329841614,
1325
+ 0.8073939681053162,
1326
+ 0.8023097515106201,
1327
+ 0.8780434131622314,
1328
+ 0.4003653824329376,
1329
+ 0.393448144197464,
1330
+ 0.3942578136920929,
1331
+ 0.3945251703262329,
1332
+ 0.3951128423213959,
1333
+ 0.40224966406822205,
1334
+ 0.8854431509971619,
1335
+ 0.9064103364944458,
1336
+ 0.9804797172546387,
1337
+ 0.9966458678245544,
1338
+ 1.0,
1339
+ 0.8653483986854553,
1340
+ 0.8540165424346924,
1341
+ 0.8343536853790283,
1342
+ 0.7951667308807373,
1343
+ 0.3973771035671234,
1344
+ 0.3850161135196686,
1345
+ 0.3663579523563385,
1346
+ 0.34976524114608765,
1347
+ 0.3352092504501343,
1348
+ 0.3136848509311676,
1349
+ 0.29112502932548523,
1350
+ 0.2594355046749115,
1351
+ 0.23437561094760895,
1352
+ 0.26727741956710815
1353
+ ],
1354
+ [
1355
+ 0.20180721580982208,
1356
+ 0.22415931522846222,
1357
+ 0.23879699409008026,
1358
+ 0.38802891969680786,
1359
+ 0.40629586577415466,
1360
+ 0.42294442653656006,
1361
+ 0.4283643364906311,
1362
+ 0.44944503903388977,
1363
+ 0.46394869685173035,
1364
+ 0.8236193656921387,
1365
+ 0.8210443258285522,
1366
+ 0.8212220072746277,
1367
+ 0.822943925857544,
1368
+ 0.8270909786224365,
1369
+ 0.8345628976821899,
1370
+ 0.8424702882766724,
1371
+ 0.8486509919166565,
1372
+ 0.8381298184394836,
1373
+ 0.06566229462623596,
1374
+ 0.0624198317527771,
1375
+ 0.06265631318092346,
1376
+ 0.06278237700462341,
1377
+ 0.06300974637269974,
1378
+ 0.0663842037320137,
1379
+ 0.5648123621940613,
1380
+ 0.5952581167221069,
1381
+ 0.7754476070404053,
1382
+ 0.8506045937538147,
1383
+ 0.8653483986854553,
1384
+ 1.0,
1385
+ 0.9904011487960815,
1386
+ 0.9704439043998718,
1387
+ 0.9368601441383362,
1388
+ 0.5288958549499512,
1389
+ 0.5148767232894897,
1390
+ 0.4965519905090332,
1391
+ 0.48039594292640686,
1392
+ 0.46464771032333374,
1393
+ 0.440955251455307,
1394
+ 0.4131018817424774,
1395
+ 0.37193602323532104,
1396
+ 0.3341248333454132,
1397
+ 0.37199312448501587
1398
+ ],
1399
+ [
1400
+ 0.20416469871997833,
1401
+ 0.22729963064193726,
1402
+ 0.24320976436138153,
1403
+ 0.3861248791217804,
1404
+ 0.40342196822166443,
1405
+ 0.4187129735946655,
1406
+ 0.42346519231796265,
1407
+ 0.4436427652835846,
1408
+ 0.4565954804420471,
1409
+ 0.7813343405723572,
1410
+ 0.7790476083755493,
1411
+ 0.7793737649917603,
1412
+ 0.7818796634674072,
1413
+ 0.7860783934593201,
1414
+ 0.7933462858200073,
1415
+ 0.8021972179412842,
1416
+ 0.8083965182304382,
1417
+ 0.8010719418525696,
1418
+ 0.06661772727966309,
1419
+ 0.0633949413895607,
1420
+ 0.06365527212619781,
1421
+ 0.06377211213111877,
1422
+ 0.06402121484279633,
1423
+ 0.06739447265863419,
1424
+ 0.552385151386261,
1425
+ 0.583310604095459,
1426
+ 0.7595964074134827,
1427
+ 0.838002622127533,
1428
+ 0.8540165424346924,
1429
+ 0.9904011487960815,
1430
+ 1.0,
1431
+ 0.9889050722122192,
1432
+ 0.9653147459030151,
1433
+ 0.6016663312911987,
1434
+ 0.5846797823905945,
1435
+ 0.5664506554603577,
1436
+ 0.5473917722702026,
1437
+ 0.5293927788734436,
1438
+ 0.5014894604682922,
1439
+ 0.46798568964004517,
1440
+ 0.4201577603816986,
1441
+ 0.3755658268928528,
1442
+ 0.417666494846344
1443
+ ],
1444
+ [
1445
+ 0.20618851482868195,
1446
+ 0.23028919100761414,
1447
+ 0.2469138205051422,
1448
+ 0.3806435763835907,
1449
+ 0.3966600000858307,
1450
+ 0.41039565205574036,
1451
+ 0.4143224060535431,
1452
+ 0.4329726994037628,
1453
+ 0.4442926347255707,
1454
+ 0.7313903570175171,
1455
+ 0.7293313145637512,
1456
+ 0.7299240231513977,
1457
+ 0.7328982949256897,
1458
+ 0.7372952699661255,
1459
+ 0.744498074054718,
1460
+ 0.7543538212776184,
1461
+ 0.7606802582740784,
1462
+ 0.7571243047714233,
1463
+ 0.07006547600030899,
1464
+ 0.06690343469381332,
1465
+ 0.06717797368764877,
1466
+ 0.06728610396385193,
1467
+ 0.06754946708679199,
1468
+ 0.07088898122310638,
1469
+ 0.5359407663345337,
1470
+ 0.5669860243797302,
1471
+ 0.7371383905410767,
1472
+ 0.8168523907661438,
1473
+ 0.8343536853790283,
1474
+ 0.9704439043998718,
1475
+ 0.9889050722122192,
1476
+ 1.0,
1477
+ 0.9857767820358276,
1478
+ 0.6702643632888794,
1479
+ 0.6520861387252808,
1480
+ 0.6344103217124939,
1481
+ 0.6132138967514038,
1482
+ 0.5937173962593079,
1483
+ 0.5627182722091675,
1484
+ 0.5241591930389404,
1485
+ 0.47016438841819763,
1486
+ 0.41858911514282227,
1487
+ 0.46332573890686035
1488
+ ],
1489
+ [
1490
+ 0.2103184014558792,
1491
+ 0.23264256119728088,
1492
+ 0.24982884526252747,
1493
+ 0.3713274300098419,
1494
+ 0.3857835531234741,
1495
+ 0.39745908975601196,
1496
+ 0.4004828631877899,
1497
+ 0.4182862639427185,
1498
+ 0.42832520604133606,
1499
+ 0.6704528331756592,
1500
+ 0.6685813069343567,
1501
+ 0.6690068244934082,
1502
+ 0.6719599366188049,
1503
+ 0.6760224103927612,
1504
+ 0.6827337741851807,
1505
+ 0.6925984621047974,
1506
+ 0.6990503668785095,
1507
+ 0.6966723799705505,
1508
+ 0.06426654756069183,
1509
+ 0.06137187033891678,
1510
+ 0.06163809821009636,
1511
+ 0.06173764541745186,
1512
+ 0.06200111284852028,
1513
+ 0.06509575247764587,
1514
+ 0.5004628300666809,
1515
+ 0.5310094356536865,
1516
+ 0.6947705149650574,
1517
+ 0.7763916850090027,
1518
+ 0.7951667308807373,
1519
+ 0.9368601441383362,
1520
+ 0.9653147459030151,
1521
+ 0.9857767820358276,
1522
+ 1.0,
1523
+ 0.7446660995483398,
1524
+ 0.7232252359390259,
1525
+ 0.7054296731948853,
1526
+ 0.6820340156555176,
1527
+ 0.6602522730827332,
1528
+ 0.6257666945457458,
1529
+ 0.5834745764732361,
1530
+ 0.5233609676361084,
1531
+ 0.46513792872428894,
1532
+ 0.5128670334815979
1533
+ ],
1534
+ [
1535
+ 0.276023268699646,
1536
+ 0.3063642978668213,
1537
+ 0.3279764652252197,
1538
+ 0.3279266059398651,
1539
+ 0.32744163274765015,
1540
+ 0.3240452706813812,
1541
+ 0.32099777460098267,
1542
+ 0.3326359987258911,
1543
+ 0.32949790358543396,
1544
+ 0.07240330427885056,
1545
+ 0.07231699675321579,
1546
+ 0.07289182394742966,
1547
+ 0.07769713550806046,
1548
+ 0.08032910525798798,
1549
+ 0.08384349197149277,
1550
+ 0.09298310428857803,
1551
+ 0.09798076003789902,
1552
+ 0.11264808475971222,
1553
+ 0.07124065607786179,
1554
+ 0.07085655629634857,
1555
+ 0.07093967497348785,
1556
+ 0.07106957584619522,
1557
+ 0.07124228030443192,
1558
+ 0.0720483735203743,
1559
+ 0.19620217382907867,
1560
+ 0.21623265743255615,
1561
+ 0.29692989587783813,
1562
+ 0.36660265922546387,
1563
+ 0.3973771035671234,
1564
+ 0.5288958549499512,
1565
+ 0.6016663312911987,
1566
+ 0.6702643632888794,
1567
+ 0.7446660995483398,
1568
+ 1.0,
1569
+ 0.9814231395721436,
1570
+ 0.9648851156234741,
1571
+ 0.9420889019966125,
1572
+ 0.918043851852417,
1573
+ 0.8781349062919617,
1574
+ 0.8248966336250305,
1575
+ 0.7464483380317688,
1576
+ 0.6632609367370605,
1577
+ 0.7183345556259155
1578
+ ],
1579
+ [
1580
+ 0.2877357304096222,
1581
+ 0.3168011009693146,
1582
+ 0.33550044894218445,
1583
+ 0.3327653408050537,
1584
+ 0.33175063133239746,
1585
+ 0.32759734988212585,
1586
+ 0.32415735721588135,
1587
+ 0.3348675072193146,
1588
+ 0.3316981792449951,
1589
+ 0.0693894773721695,
1590
+ 0.069251149892807,
1591
+ 0.0697130560874939,
1592
+ 0.07411515712738037,
1593
+ 0.07661429792642593,
1594
+ 0.08009275794029236,
1595
+ 0.08874774724245071,
1596
+ 0.09355077892541885,
1597
+ 0.10858210921287537,
1598
+ 0.07693160325288773,
1599
+ 0.0765998587012291,
1600
+ 0.07665561139583588,
1601
+ 0.07678575813770294,
1602
+ 0.07693175226449966,
1603
+ 0.07764078676700592,
1604
+ 0.19262436032295227,
1605
+ 0.21143624186515808,
1606
+ 0.28856322169303894,
1607
+ 0.3540237545967102,
1608
+ 0.3850161135196686,
1609
+ 0.5148767232894897,
1610
+ 0.5846797823905945,
1611
+ 0.6520861387252808,
1612
+ 0.7232252359390259,
1613
+ 0.9814231395721436,
1614
+ 1.0,
1615
+ 0.9873993992805481,
1616
+ 0.972152054309845,
1617
+ 0.9519630670547485,
1618
+ 0.916405975818634,
1619
+ 0.8626837134361267,
1620
+ 0.7842699885368347,
1621
+ 0.6978129744529724,
1622
+ 0.7440783381462097
1623
+ ],
1624
+ [
1625
+ 0.27862244844436646,
1626
+ 0.3035929203033447,
1627
+ 0.3209203779697418,
1628
+ 0.3181270658969879,
1629
+ 0.3172719180583954,
1630
+ 0.3131496012210846,
1631
+ 0.3096967935562134,
1632
+ 0.3216875195503235,
1633
+ 0.31988200545310974,
1634
+ 0.06366771459579468,
1635
+ 0.06337570399045944,
1636
+ 0.06382288783788681,
1637
+ 0.06803729385137558,
1638
+ 0.07035835832357407,
1639
+ 0.0735638365149498,
1640
+ 0.08172436058521271,
1641
+ 0.0864337757229805,
1642
+ 0.1002814769744873,
1643
+ 0.07690145075321198,
1644
+ 0.07666971534490585,
1645
+ 0.07666600495576859,
1646
+ 0.07677685469388962,
1647
+ 0.07687943428754807,
1648
+ 0.07742615044116974,
1649
+ 0.181674525141716,
1650
+ 0.19894073903560638,
1651
+ 0.272670716047287,
1652
+ 0.33642521500587463,
1653
+ 0.3663579523563385,
1654
+ 0.4965519905090332,
1655
+ 0.5664506554603577,
1656
+ 0.6344103217124939,
1657
+ 0.7054296731948853,
1658
+ 0.9648851156234741,
1659
+ 0.9873993992805481,
1660
+ 1.0,
1661
+ 0.9873649477958679,
1662
+ 0.9710380434989929,
1663
+ 0.9398236274719238,
1664
+ 0.8875017166137695,
1665
+ 0.8098384737968445,
1666
+ 0.7214956879615784,
1667
+ 0.7621151208877563
1668
+ ],
1669
+ [
1670
+ 0.2714858949184418,
1671
+ 0.2928607165813446,
1672
+ 0.30974993109703064,
1673
+ 0.30604374408721924,
1674
+ 0.30519741773605347,
1675
+ 0.30125588178634644,
1676
+ 0.2979552745819092,
1677
+ 0.3106471300125122,
1678
+ 0.3099154829978943,
1679
+ 0.058279674500226974,
1680
+ 0.05794829502701759,
1681
+ 0.058345895260572433,
1682
+ 0.06221542879939079,
1683
+ 0.06424151360988617,
1684
+ 0.06728626787662506,
1685
+ 0.07478447258472443,
1686
+ 0.07936399430036545,
1687
+ 0.09172182530164719,
1688
+ 0.06902671605348587,
1689
+ 0.06883713603019714,
1690
+ 0.06881962716579437,
1691
+ 0.0689200833439827,
1692
+ 0.06900914758443832,
1693
+ 0.069486103951931,
1694
+ 0.16961324214935303,
1695
+ 0.18640725314617157,
1696
+ 0.2585139870643616,
1697
+ 0.32011911273002625,
1698
+ 0.34976524114608765,
1699
+ 0.48039594292640686,
1700
+ 0.5473917722702026,
1701
+ 0.6132138967514038,
1702
+ 0.6820340156555176,
1703
+ 0.9420889019966125,
1704
+ 0.972152054309845,
1705
+ 0.9873649477958679,
1706
+ 1.0,
1707
+ 0.9886766672134399,
1708
+ 0.9647974371910095,
1709
+ 0.9186524748802185,
1710
+ 0.8474200963973999,
1711
+ 0.7625328302383423,
1712
+ 0.7892231345176697
1713
+ ],
1714
+ [
1715
+ 0.2638208270072937,
1716
+ 0.28051790595054626,
1717
+ 0.2971762716770172,
1718
+ 0.29389938712120056,
1719
+ 0.29320383071899414,
1720
+ 0.2897612750530243,
1721
+ 0.28647178411483765,
1722
+ 0.30101141333580017,
1723
+ 0.3022046983242035,
1724
+ 0.0554267056286335,
1725
+ 0.05501179024577141,
1726
+ 0.055434055626392365,
1727
+ 0.05910384654998779,
1728
+ 0.06096977740526199,
1729
+ 0.06396643072366714,
1730
+ 0.07101134210824966,
1731
+ 0.07550866156816483,
1732
+ 0.08617549389600754,
1733
+ 0.06145131587982178,
1734
+ 0.061300039291381836,
1735
+ 0.0612604096531868,
1736
+ 0.061353541910648346,
1737
+ 0.06142721697688103,
1738
+ 0.06184566766023636,
1739
+ 0.15909402072429657,
1740
+ 0.17529775202274323,
1741
+ 0.24620142579078674,
1742
+ 0.3063182234764099,
1743
+ 0.3352092504501343,
1744
+ 0.46464771032333374,
1745
+ 0.5293927788734436,
1746
+ 0.5937173962593079,
1747
+ 0.6602522730827332,
1748
+ 0.918043851852417,
1749
+ 0.9519630670547485,
1750
+ 0.9710380434989929,
1751
+ 0.9886766672134399,
1752
+ 1.0,
1753
+ 0.9843608736991882,
1754
+ 0.9462465643882751,
1755
+ 0.8798750638961792,
1756
+ 0.7969598770141602,
1757
+ 0.8162945508956909
1758
+ ],
1759
+ [
1760
+ 0.24908611178398132,
1761
+ 0.2587052285671234,
1762
+ 0.2757277190685272,
1763
+ 0.2742559015750885,
1764
+ 0.27433881163597107,
1765
+ 0.2714788019657135,
1766
+ 0.2684289216995239,
1767
+ 0.28571727871894836,
1768
+ 0.28973516821861267,
1769
+ 0.050616730004549026,
1770
+ 0.05034567788243294,
1771
+ 0.050859484821558,
1772
+ 0.054406825453042984,
1773
+ 0.055847618728876114,
1774
+ 0.05840763822197914,
1775
+ 0.06470689922571182,
1776
+ 0.06888752430677414,
1777
+ 0.07838811725378036,
1778
+ 0.06265770643949509,
1779
+ 0.06258251518011093,
1780
+ 0.06247647851705551,
1781
+ 0.06254272162914276,
1782
+ 0.06256409734487534,
1783
+ 0.06280297785997391,
1784
+ 0.14928825199604034,
1785
+ 0.16363555192947388,
1786
+ 0.22987234592437744,
1787
+ 0.28603243827819824,
1788
+ 0.3136848509311676,
1789
+ 0.440955251455307,
1790
+ 0.5014894604682922,
1791
+ 0.5627182722091675,
1792
+ 0.6257666945457458,
1793
+ 0.8781349062919617,
1794
+ 0.916405975818634,
1795
+ 0.9398236274719238,
1796
+ 0.9647974371910095,
1797
+ 0.9843608736991882,
1798
+ 1.0,
1799
+ 0.9794701933860779,
1800
+ 0.9287322163581848,
1801
+ 0.8544009327888489,
1802
+ 0.8514858484268188
1803
+ ],
1804
+ [
1805
+ 0.23528440296649933,
1806
+ 0.23799873888492584,
1807
+ 0.2561028003692627,
1808
+ 0.25545141100883484,
1809
+ 0.25606462359428406,
1810
+ 0.2537075877189636,
1811
+ 0.25093063712120056,
1812
+ 0.2689763605594635,
1813
+ 0.27511394023895264,
1814
+ 0.04563816264271736,
1815
+ 0.04554736986756325,
1816
+ 0.04605194926261902,
1817
+ 0.0491962805390358,
1818
+ 0.050160396844148636,
1819
+ 0.0524911992251873,
1820
+ 0.05802201107144356,
1821
+ 0.06172937527298927,
1822
+ 0.06973807513713837,
1823
+ 0.055518437176942825,
1824
+ 0.0554729625582695,
1825
+ 0.05536418780684471,
1826
+ 0.05543401837348938,
1827
+ 0.05545356869697571,
1828
+ 0.05563516169786453,
1829
+ 0.13633903861045837,
1830
+ 0.14969877898693085,
1831
+ 0.21223331987857819,
1832
+ 0.26461443305015564,
1833
+ 0.29112502932548523,
1834
+ 0.4131018817424774,
1835
+ 0.46798568964004517,
1836
+ 0.5241591930389404,
1837
+ 0.5834745764732361,
1838
+ 0.8248966336250305,
1839
+ 0.8626837134361267,
1840
+ 0.8875017166137695,
1841
+ 0.9186524748802185,
1842
+ 0.9462465643882751,
1843
+ 0.9794701933860779,
1844
+ 1.0,
1845
+ 0.9706311821937561,
1846
+ 0.9152870774269104,
1847
+ 0.887352705001831
1848
+ ],
1849
+ [
1850
+ 0.2028241604566574,
1851
+ 0.20439808070659637,
1852
+ 0.22381751239299774,
1853
+ 0.22576963901519775,
1854
+ 0.22731678187847137,
1855
+ 0.22584068775177002,
1856
+ 0.22366459667682648,
1857
+ 0.24261517822742462,
1858
+ 0.2516450881958008,
1859
+ 0.04066110774874687,
1860
+ 0.04089762642979622,
1861
+ 0.04151143878698349,
1862
+ 0.04431117698550224,
1863
+ 0.0446544885635376,
1864
+ 0.04635629802942276,
1865
+ 0.05082838982343674,
1866
+ 0.053861312568187714,
1867
+ 0.06031683087348938,
1868
+ 0.04859182983636856,
1869
+ 0.048547327518463135,
1870
+ 0.04843009635806084,
1871
+ 0.048503417521715164,
1872
+ 0.0485139898955822,
1873
+ 0.04863174259662628,
1874
+ 0.12095215916633606,
1875
+ 0.13267894089221954,
1876
+ 0.18888315558433533,
1877
+ 0.23550668358802795,
1878
+ 0.2594355046749115,
1879
+ 0.37193602323532104,
1880
+ 0.4201577603816986,
1881
+ 0.47016438841819763,
1882
+ 0.5233609676361084,
1883
+ 0.7464483380317688,
1884
+ 0.7842699885368347,
1885
+ 0.8098384737968445,
1886
+ 0.8474200963973999,
1887
+ 0.8798750638961792,
1888
+ 0.9287322163581848,
1889
+ 0.9706311821937561,
1890
+ 1.0,
1891
+ 0.9748941659927368,
1892
+ 0.9173017144203186
1893
+ ],
1894
+ [
1895
+ 0.18157896399497986,
1896
+ 0.1831863820552826,
1897
+ 0.20405621826648712,
1898
+ 0.2079307585954666,
1899
+ 0.20990385115146637,
1900
+ 0.20897728204727173,
1901
+ 0.20713619887828827,
1902
+ 0.22421549260616302,
1903
+ 0.23347976803779602,
1904
+ 0.03758147731423378,
1905
+ 0.0380101203918457,
1906
+ 0.03859316557645798,
1907
+ 0.04105953127145767,
1908
+ 0.04107876121997833,
1909
+ 0.042436614632606506,
1910
+ 0.04616275802254677,
1911
+ 0.0485219843685627,
1912
+ 0.054660484194755554,
1913
+ 0.045823391526937485,
1914
+ 0.04577719047665596,
1915
+ 0.045680876821279526,
1916
+ 0.04575222730636597,
1917
+ 0.04577886685729027,
1918
+ 0.045876212418079376,
1919
+ 0.11138390004634857,
1920
+ 0.12187019735574722,
1921
+ 0.1721278429031372,
1922
+ 0.21309895813465118,
1923
+ 0.23437561094760895,
1924
+ 0.3341248333454132,
1925
+ 0.3755658268928528,
1926
+ 0.41858911514282227,
1927
+ 0.46513792872428894,
1928
+ 0.6632609367370605,
1929
+ 0.6978129744529724,
1930
+ 0.7214956879615784,
1931
+ 0.7625328302383423,
1932
+ 0.7969598770141602,
1933
+ 0.8544009327888489,
1934
+ 0.9152870774269104,
1935
+ 0.9748941659927368,
1936
+ 1.0,
1937
+ 0.9208613038063049
1938
+ ],
1939
+ [
1940
+ 0.21474751830101013,
1941
+ 0.22291426360607147,
1942
+ 0.24675846099853516,
1943
+ 0.24914155900478363,
1944
+ 0.25057658553123474,
1945
+ 0.24922457337379456,
1946
+ 0.2471763789653778,
1947
+ 0.2646792531013489,
1948
+ 0.27288132905960083,
1949
+ 0.04408200830221176,
1950
+ 0.04412412270903587,
1951
+ 0.04483146220445633,
1952
+ 0.047917358577251434,
1953
+ 0.04873289167881012,
1954
+ 0.05078994855284691,
1955
+ 0.0556659996509552,
1956
+ 0.0585828460752964,
1957
+ 0.06589983403682709,
1958
+ 0.05258903652429581,
1959
+ 0.05257975682616234,
1960
+ 0.05250919237732887,
1961
+ 0.05261457711458206,
1962
+ 0.05267048999667168,
1963
+ 0.05289127677679062,
1964
+ 0.12936289608478546,
1965
+ 0.14145489037036896,
1966
+ 0.1982099562883377,
1967
+ 0.24481619894504547,
1968
+ 0.26727741956710815,
1969
+ 0.37199312448501587,
1970
+ 0.417666494846344,
1971
+ 0.46332573890686035,
1972
+ 0.5128670334815979,
1973
+ 0.7183345556259155,
1974
+ 0.7440783381462097,
1975
+ 0.7621151208877563,
1976
+ 0.7892231345176697,
1977
+ 0.8162945508956909,
1978
+ 0.8514858484268188,
1979
+ 0.887352705001831,
1980
+ 0.9173017144203186,
1981
+ 0.9208613038063049,
1982
+ 1.0
1983
+ ]
1984
+ ]
1985
+ },
1986
+ "logit_lens": {
1987
+ "embedding": {
1988
+ "entropy": 0.008176468312740326,
1989
+ "entropy_std": 0.2515474855899811,
1990
+ "top1_prob": 0.9989781379699707,
1991
+ "correct_rank_mean": 14393.49609375,
1992
+ "correct_rank_median": 10188.0,
1993
+ "log_rank_mean": 8.11260986328125,
1994
+ "agreement_with_final": 0.005598506890237331
1995
+ },
1996
+ "expand_0": {
1997
+ "entropy": 0.034236982464790344,
1998
+ "entropy_std": 0.3024539649486542,
1999
+ "top1_prob": 0.9954218864440918,
2000
+ "correct_rank_mean": 11135.865234375,
2001
+ "correct_rank_median": 5304.0,
2002
+ "log_rank_mean": 7.781420707702637,
2003
+ "agreement_with_final": 0.005598506890237331
2004
+ },
2005
+ "expand_1": {
2006
+ "entropy": 0.14663031697273254,
2007
+ "entropy_std": 0.5081619620323181,
2008
+ "top1_prob": 0.9832598567008972,
2009
+ "correct_rank_mean": 9871.8037109375,
2010
+ "correct_rank_median": 4662.0,
2011
+ "log_rank_mean": 7.68184757232666,
2012
+ "agreement_with_final": 0.005598506890237331
2013
+ },
2014
+ "expand_2": {
2015
+ "entropy": 0.258383572101593,
2016
+ "entropy_std": 0.7484143376350403,
2017
+ "top1_prob": 0.9704828858375549,
2018
+ "correct_rank_mean": 8762.1416015625,
2019
+ "correct_rank_median": 3955.0,
2020
+ "log_rank_mean": 7.566042900085449,
2021
+ "agreement_with_final": 0.005598506890237331
2022
+ },
2023
+ "expand_3": {
2024
+ "entropy": 0.39039328694343567,
2025
+ "entropy_std": 1.0109078884124756,
2026
+ "top1_prob": 0.9554979801177979,
2027
+ "correct_rank_mean": 8682.91796875,
2028
+ "correct_rank_median": 3898.0,
2029
+ "log_rank_mean": 7.5453033447265625,
2030
+ "agreement_with_final": 0.005598506890237331
2031
+ },
2032
+ "expand_4": {
2033
+ "entropy": 0.5588860511779785,
2034
+ "entropy_std": 1.3780310153961182,
2035
+ "top1_prob": 0.9369558691978455,
2036
+ "correct_rank_mean": 8332.0068359375,
2037
+ "correct_rank_median": 3548.0,
2038
+ "log_rank_mean": 7.495856761932373,
2039
+ "agreement_with_final": 0.005598506890237331
2040
+ },
2041
+ "expand_5": {
2042
+ "entropy": 0.6920637488365173,
2043
+ "entropy_std": 1.6069740056991577,
2044
+ "top1_prob": 0.9192118048667908,
2045
+ "correct_rank_mean": 6979.5546875,
2046
+ "correct_rank_median": 2671.0,
2047
+ "log_rank_mean": 7.241157054901123,
2048
+ "agreement_with_final": 0.005598506890237331
2049
+ },
2050
+ "expand_6": {
2051
+ "entropy": 0.7080297470092773,
2052
+ "entropy_std": 1.597432017326355,
2053
+ "top1_prob": 0.9160333275794983,
2054
+ "correct_rank_mean": 6252.78955078125,
2055
+ "correct_rank_median": 2049.0,
2056
+ "log_rank_mean": 7.042272567749023,
2057
+ "agreement_with_final": 0.005687372293323278
2058
+ },
2059
+ "expand_7": {
2060
+ "entropy": 0.7756950855255127,
2061
+ "entropy_std": 1.7049072980880737,
2062
+ "top1_prob": 0.9075745344161987,
2063
+ "correct_rank_mean": 5574.6640625,
2064
+ "correct_rank_median": 1679.0,
2065
+ "log_rank_mean": 6.867434024810791,
2066
+ "agreement_with_final": 0.0056429398246109486
2067
+ },
2068
+ "expand_8": {
2069
+ "entropy": 0.7931565642356873,
2070
+ "entropy_std": 1.7194257974624634,
2071
+ "top1_prob": 0.9033533930778503,
2072
+ "correct_rank_mean": 5238.6875,
2073
+ "correct_rank_median": 1420.0,
2074
+ "log_rank_mean": 6.73607873916626,
2075
+ "agreement_with_final": 0.0056429398246109486
2076
+ },
2077
+ "expand_9": {
2078
+ "entropy": 0.6823284029960632,
2079
+ "entropy_std": 1.5069533586502075,
2080
+ "top1_prob": 0.9121810793876648,
2081
+ "correct_rank_mean": 4099.94580078125,
2082
+ "correct_rank_median": 933.0,
2083
+ "log_rank_mean": 6.390110492706299,
2084
+ "agreement_with_final": 0.005687372293323278
2085
+ },
2086
+ "expand_10": {
2087
+ "entropy": 1.000943899154663,
2088
+ "entropy_std": 1.9695338010787964,
2089
+ "top1_prob": 0.8712934255599976,
2090
+ "correct_rank_mean": 3916.5166015625,
2091
+ "correct_rank_median": 788.0,
2092
+ "log_rank_mean": 6.270185470581055,
2093
+ "agreement_with_final": 0.005998400505632162
2094
+ },
2095
+ "expand_11": {
2096
+ "entropy": 1.2967911958694458,
2097
+ "entropy_std": 2.363511323928833,
2098
+ "top1_prob": 0.8309510946273804,
2099
+ "correct_rank_mean": 3443.058837890625,
2100
+ "correct_rank_median": 588.0,
2101
+ "log_rank_mean": 6.05585241317749,
2102
+ "agreement_with_final": 0.007553541101515293
2103
+ },
2104
+ "expand_12": {
2105
+ "entropy": 1.423232913017273,
2106
+ "entropy_std": 2.4367549419403076,
2107
+ "top1_prob": 0.8082932233810425,
2108
+ "correct_rank_mean": 2777.310546875,
2109
+ "correct_rank_median": 361.0,
2110
+ "log_rank_mean": 5.695545196533203,
2111
+ "agreement_with_final": 0.010574957355856895
2112
+ },
2113
+ "expand_13": {
2114
+ "entropy": 1.6197071075439453,
2115
+ "entropy_std": 2.5692083835601807,
2116
+ "top1_prob": 0.7709041237831116,
2117
+ "correct_rank_mean": 2434.749755859375,
2118
+ "correct_rank_median": 255.0,
2119
+ "log_rank_mean": 5.40345573425293,
2120
+ "agreement_with_final": 0.028125833719968796
2121
+ },
2122
+ "expand_14": {
2123
+ "entropy": 1.7894773483276367,
2124
+ "entropy_std": 2.620293378829956,
2125
+ "top1_prob": 0.7412071228027344,
2126
+ "correct_rank_mean": 2007.8128662109375,
2127
+ "correct_rank_median": 157.0,
2128
+ "log_rank_mean": 5.057971477508545,
2129
+ "agreement_with_final": 0.037901002913713455
2130
+ },
2131
+ "expand_15": {
2132
+ "entropy": 1.943091869354248,
2133
+ "entropy_std": 2.588176727294922,
2134
+ "top1_prob": 0.7119490504264832,
2135
+ "correct_rank_mean": 1823.1900634765625,
2136
+ "correct_rank_median": 122.0,
2137
+ "log_rank_mean": 4.8463592529296875,
2138
+ "agreement_with_final": 0.05087532103061676
2139
+ },
2140
+ "expand_16": {
2141
+ "entropy": 2.052386999130249,
2142
+ "entropy_std": 2.493408441543579,
2143
+ "top1_prob": 0.6834665536880493,
2144
+ "correct_rank_mean": 1669.0242919921875,
2145
+ "correct_rank_median": 94.0,
2146
+ "log_rank_mean": 4.6481218338012695,
2147
+ "agreement_with_final": 0.06638229638338089
2148
+ },
2149
+ "expand_17": {
2150
+ "entropy": 2.4040215015411377,
2151
+ "entropy_std": 2.651726484298706,
2152
+ "top1_prob": 0.6449254751205444,
2153
+ "correct_rank_mean": 1992.696044921875,
2154
+ "correct_rank_median": 104.0,
2155
+ "log_rank_mean": 4.736515522003174,
2156
+ "agreement_with_final": 0.07122544944286346
2157
+ },
2158
+ "expand_18": {
2159
+ "entropy": 2.3906190395355225,
2160
+ "entropy_std": 2.458310604095459,
2161
+ "top1_prob": 0.6318473219871521,
2162
+ "correct_rank_mean": 1239.644775390625,
2163
+ "correct_rank_median": 40.0,
2164
+ "log_rank_mean": 4.058821201324463,
2165
+ "agreement_with_final": 0.09806273877620697
2166
+ },
2167
+ "expand_19": {
2168
+ "entropy": 2.254875659942627,
2169
+ "entropy_std": 2.2049195766448975,
2170
+ "top1_prob": 0.6253980398178101,
2171
+ "correct_rank_mean": 1140.4500732421875,
2172
+ "correct_rank_median": 27.0,
2173
+ "log_rank_mean": 3.7810096740722656,
2174
+ "agreement_with_final": 0.12956544756889343
2175
+ },
2176
+ "middle_0": {
2177
+ "entropy": 2.791367530822754,
2178
+ "entropy_std": 2.4136030673980713,
2179
+ "top1_prob": 0.5617826581001282,
2180
+ "correct_rank_mean": 1371.789306640625,
2181
+ "correct_rank_median": 32.0,
2182
+ "log_rank_mean": 3.917487382888794,
2183
+ "agreement_with_final": 0.12041233479976654
2184
+ },
2185
+ "compress_0": {
2186
+ "entropy": 2.8886218070983887,
2187
+ "entropy_std": 2.3109898567199707,
2188
+ "top1_prob": 0.5394092202186584,
2189
+ "correct_rank_mean": 1147.898193359375,
2190
+ "correct_rank_median": 22.0,
2191
+ "log_rank_mean": 3.6261489391326904,
2192
+ "agreement_with_final": 0.1596907526254654
2193
+ },
2194
+ "compress_1": {
2195
+ "entropy": 2.8037614822387695,
2196
+ "entropy_std": 2.125213623046875,
2197
+ "top1_prob": 0.5311091542243958,
2198
+ "correct_rank_mean": 876.5938720703125,
2199
+ "correct_rank_median": 14.0,
2200
+ "log_rank_mean": 3.2685093879699707,
2201
+ "agreement_with_final": 0.20385675132274628
2202
+ },
2203
+ "compress_2": {
2204
+ "entropy": 2.894321918487549,
2205
+ "entropy_std": 2.1102190017700195,
2206
+ "top1_prob": 0.5155819654464722,
2207
+ "correct_rank_mean": 864.5591430664062,
2208
+ "correct_rank_median": 14.0,
2209
+ "log_rank_mean": 3.2516262531280518,
2210
+ "agreement_with_final": 0.210121750831604
2211
+ },
2212
+ "compress_3": {
2213
+ "entropy": 2.904087781906128,
2214
+ "entropy_std": 2.03000545501709,
2215
+ "top1_prob": 0.5027743577957153,
2216
+ "correct_rank_mean": 747.5247192382812,
2217
+ "correct_rank_median": 12.0,
2218
+ "log_rank_mean": 3.133939743041992,
2219
+ "agreement_with_final": 0.2354038953781128
2220
+ },
2221
+ "compress_4": {
2222
+ "entropy": 2.9448397159576416,
2223
+ "entropy_std": 1.9586139917373657,
2224
+ "top1_prob": 0.486985445022583,
2225
+ "correct_rank_mean": 675.2647094726562,
2226
+ "correct_rank_median": 12.0,
2227
+ "log_rank_mean": 3.057403326034546,
2228
+ "agreement_with_final": 0.2563316524028778
2229
+ },
2230
+ "compress_5": {
2231
+ "entropy": 2.960257053375244,
2232
+ "entropy_std": 1.9112046957015991,
2233
+ "top1_prob": 0.4762323498725891,
2234
+ "correct_rank_mean": 594.5932006835938,
2235
+ "correct_rank_median": 11.0,
2236
+ "log_rank_mean": 2.9580636024475098,
2237
+ "agreement_with_final": 0.27961432933807373
2238
+ },
2239
+ "compress_6": {
2240
+ "entropy": 2.8116822242736816,
2241
+ "entropy_std": 1.8701006174087524,
2242
+ "top1_prob": 0.48950037360191345,
2243
+ "correct_rank_mean": 566.154296875,
2244
+ "correct_rank_median": 9.0,
2245
+ "log_rank_mean": 2.875314950942993,
2246
+ "agreement_with_final": 0.303830087184906
2247
+ },
2248
+ "compress_7": {
2249
+ "entropy": 2.702578067779541,
2250
+ "entropy_std": 1.7729120254516602,
2251
+ "top1_prob": 0.49450957775115967,
2252
+ "correct_rank_mean": 445.0347595214844,
2253
+ "correct_rank_median": 6.0,
2254
+ "log_rank_mean": 2.618129253387451,
2255
+ "agreement_with_final": 0.36670219898223877
2256
+ },
2257
+ "compress_8": {
2258
+ "entropy": 2.738481283187866,
2259
+ "entropy_std": 1.7727619409561157,
2260
+ "top1_prob": 0.4886171817779541,
2261
+ "correct_rank_mean": 403.071044921875,
2262
+ "correct_rank_median": 6.0,
2263
+ "log_rank_mean": 2.525726556777954,
2264
+ "agreement_with_final": 0.39091798663139343
2265
+ },
2266
+ "compress_9": {
2267
+ "entropy": 2.7483277320861816,
2268
+ "entropy_std": 1.758489966392517,
2269
+ "top1_prob": 0.48307204246520996,
2270
+ "correct_rank_mean": 361.2685546875,
2271
+ "correct_rank_median": 5.0,
2272
+ "log_rank_mean": 2.424931764602661,
2273
+ "agreement_with_final": 0.422998309135437
2274
+ },
2275
+ "compress_10": {
2276
+ "entropy": 2.584798812866211,
2277
+ "entropy_std": 1.7651761770248413,
2278
+ "top1_prob": 0.5065273642539978,
2279
+ "correct_rank_mean": 316.74725341796875,
2280
+ "correct_rank_median": 4.0,
2281
+ "log_rank_mean": 2.286303758621216,
2282
+ "agreement_with_final": 0.47702834010124207
2283
+ },
2284
+ "compress_11": {
2285
+ "entropy": 2.4372329711914062,
2286
+ "entropy_std": 1.724508285522461,
2287
+ "top1_prob": 0.5221402645111084,
2288
+ "correct_rank_mean": 286.2889099121094,
2289
+ "correct_rank_median": 4.0,
2290
+ "log_rank_mean": 2.1740314960479736,
2291
+ "agreement_with_final": 0.5079978704452515
2292
+ },
2293
+ "compress_12": {
2294
+ "entropy": 2.286668062210083,
2295
+ "entropy_std": 1.6892642974853516,
2296
+ "top1_prob": 0.5403583645820618,
2297
+ "correct_rank_mean": 260.3778991699219,
2298
+ "correct_rank_median": 3.0,
2299
+ "log_rank_mean": 2.076981782913208,
2300
+ "agreement_with_final": 0.5433217883110046
2301
+ },
2302
+ "compress_13": {
2303
+ "entropy": 2.219756841659546,
2304
+ "entropy_std": 1.6175485849380493,
2305
+ "top1_prob": 0.5423508882522583,
2306
+ "correct_rank_mean": 210.5286102294922,
2307
+ "correct_rank_median": 3.0,
2308
+ "log_rank_mean": 1.9495395421981812,
2309
+ "agreement_with_final": 0.6007731556892395
2310
+ },
2311
+ "compress_14": {
2312
+ "entropy": 2.1078646183013916,
2313
+ "entropy_std": 1.5745396614074707,
2314
+ "top1_prob": 0.5566951632499695,
2315
+ "correct_rank_mean": 181.68710327148438,
2316
+ "correct_rank_median": 3.0,
2317
+ "log_rank_mean": 1.8273409605026245,
2318
+ "agreement_with_final": 0.6587576866149902
2319
+ },
2320
+ "compress_15": {
2321
+ "entropy": 2.1085779666900635,
2322
+ "entropy_std": 1.5215522050857544,
2323
+ "top1_prob": 0.5488401055335999,
2324
+ "correct_rank_mean": 158.09202575683594,
2325
+ "correct_rank_median": 2.0,
2326
+ "log_rank_mean": 1.7517446279525757,
2327
+ "agreement_with_final": 0.6880831718444824
2328
+ },
2329
+ "compress_16": {
2330
+ "entropy": 2.0532541275024414,
2331
+ "entropy_std": 1.4450801610946655,
2332
+ "top1_prob": 0.5455565452575684,
2333
+ "correct_rank_mean": 148.40792846679688,
2334
+ "correct_rank_median": 2.0,
2335
+ "log_rank_mean": 1.7359200716018677,
2336
+ "agreement_with_final": 0.7060783505439758
2337
+ },
2338
+ "compress_17": {
2339
+ "entropy": 2.151466131210327,
2340
+ "entropy_std": 1.4760440587997437,
2341
+ "top1_prob": 0.5291946530342102,
2342
+ "correct_rank_mean": 138.3632354736328,
2343
+ "correct_rank_median": 2.0,
2344
+ "log_rank_mean": 1.7023895978927612,
2345
+ "agreement_with_final": 0.7247844934463501
2346
+ },
2347
+ "compress_18": {
2348
+ "entropy": 2.490158796310425,
2349
+ "entropy_std": 1.6314911842346191,
2350
+ "top1_prob": 0.48671698570251465,
2351
+ "correct_rank_mean": 131.927490234375,
2352
+ "correct_rank_median": 2.0,
2353
+ "log_rank_mean": 1.593283772468567,
2354
+ "agreement_with_final": 0.8141384720802307
2355
+ },
2356
+ "compress_19": {
2357
+ "entropy": 2.896641731262207,
2358
+ "entropy_std": 1.7835479974746704,
2359
+ "top1_prob": 0.4417716860771179,
2360
+ "correct_rank_mean": 120.98338317871094,
2361
+ "correct_rank_median": 2.0,
2362
+ "log_rank_mean": 1.5257868766784668,
2363
+ "agreement_with_final": 1.0
2364
+ },
2365
+ "final_norm": {
2366
+ "entropy": 2.896641731262207,
2367
+ "entropy_std": 1.7835479974746704,
2368
+ "top1_prob": 0.4417716860771179,
2369
+ "correct_rank_mean": 120.98338317871094,
2370
+ "correct_rank_median": 2.0,
2371
+ "log_rank_mean": 1.5257868766784668,
2372
+ "agreement_with_final": 1.0
2373
+ }
2374
+ },
2375
+ "drift": {
2376
+ "expand_0": {
2377
+ "cos_sim_mean": 0.6849911212921143,
2378
+ "cos_sim_std": 0.175172358751297,
2379
+ "l2_distance": 27.544370651245117
2380
+ },
2381
+ "expand_1": {
2382
+ "cos_sim_mean": 0.9739256501197815,
2383
+ "cos_sim_std": 0.019970882683992386,
2384
+ "l2_distance": 8.544537544250488
2385
+ },
2386
+ "expand_2": {
2387
+ "cos_sim_mean": 0.9862149953842163,
2388
+ "cos_sim_std": 0.018325403332710266,
2389
+ "l2_distance": 6.148813247680664
2390
+ },
2391
+ "expand_3": {
2392
+ "cos_sim_mean": 0.9931728839874268,
2393
+ "cos_sim_std": 0.009126277640461922,
2394
+ "l2_distance": 4.215730667114258
2395
+ },
2396
+ "expand_4": {
2397
+ "cos_sim_mean": 0.9931032657623291,
2398
+ "cos_sim_std": 0.008576806634664536,
2399
+ "l2_distance": 4.328185558319092
2400
+ },
2401
+ "expand_5": {
2402
+ "cos_sim_mean": 0.9900708794593811,
2403
+ "cos_sim_std": 0.009801621548831463,
2404
+ "l2_distance": 5.432103633880615
2405
+ },
2406
+ "expand_6": {
2407
+ "cos_sim_mean": 0.985851526260376,
2408
+ "cos_sim_std": 0.011277426965534687,
2409
+ "l2_distance": 6.384981155395508
2410
+ },
2411
+ "expand_7": {
2412
+ "cos_sim_mean": 0.9834489822387695,
2413
+ "cos_sim_std": 0.014593411237001419,
2414
+ "l2_distance": 6.949868679046631
2415
+ },
2416
+ "expand_8": {
2417
+ "cos_sim_mean": 0.9804284572601318,
2418
+ "cos_sim_std": 0.013100787065923214,
2419
+ "l2_distance": 7.284245014190674
2420
+ },
2421
+ "expand_9": {
2422
+ "cos_sim_mean": 0.9666137099266052,
2423
+ "cos_sim_std": 0.03426649048924446,
2424
+ "l2_distance": 7.660122871398926
2425
+ },
2426
+ "expand_10": {
2427
+ "cos_sim_mean": 0.9552152752876282,
2428
+ "cos_sim_std": 0.03713290020823479,
2429
+ "l2_distance": 8.244612693786621
2430
+ },
2431
+ "expand_11": {
2432
+ "cos_sim_mean": 0.9543018341064453,
2433
+ "cos_sim_std": 0.03289920091629028,
2434
+ "l2_distance": 8.666141510009766
2435
+ },
2436
+ "expand_12": {
2437
+ "cos_sim_mean": 0.9556319713592529,
2438
+ "cos_sim_std": 0.029315391555428505,
2439
+ "l2_distance": 8.619108200073242
2440
+ },
2441
+ "expand_13": {
2442
+ "cos_sim_mean": 0.9506610631942749,
2443
+ "cos_sim_std": 0.03158613666892052,
2444
+ "l2_distance": 9.27637004852295
2445
+ },
2446
+ "expand_14": {
2447
+ "cos_sim_mean": 0.950225293636322,
2448
+ "cos_sim_std": 0.024301381781697273,
2449
+ "l2_distance": 9.890229225158691
2450
+ },
2451
+ "expand_15": {
2452
+ "cos_sim_mean": 0.9466367363929749,
2453
+ "cos_sim_std": 0.023674415424466133,
2454
+ "l2_distance": 10.431220054626465
2455
+ },
2456
+ "expand_16": {
2457
+ "cos_sim_mean": 0.9469488263130188,
2458
+ "cos_sim_std": 0.027035264298319817,
2459
+ "l2_distance": 10.755489349365234
2460
+ },
2461
+ "expand_17": {
2462
+ "cos_sim_mean": 0.9249208569526672,
2463
+ "cos_sim_std": 0.025832273066043854,
2464
+ "l2_distance": 18.261241912841797
2465
+ },
2466
+ "expand_18": {
2467
+ "cos_sim_mean": 0.9288403987884521,
2468
+ "cos_sim_std": 0.04455041512846947,
2469
+ "l2_distance": 14.860454559326172
2470
+ },
2471
+ "expand_19": {
2472
+ "cos_sim_mean": 0.9334417581558228,
2473
+ "cos_sim_std": 0.024690203368663788,
2474
+ "l2_distance": 13.679224014282227
2475
+ },
2476
+ "middle_0": {
2477
+ "cos_sim_mean": 0.8935306072235107,
2478
+ "cos_sim_std": 0.030932700261473656,
2479
+ "l2_distance": 18.660053253173828
2480
+ },
2481
+ "compress_0": {
2482
+ "cos_sim_mean": 0.9521554708480835,
2483
+ "cos_sim_std": 0.016932014375925064,
2484
+ "l2_distance": 14.076226234436035
2485
+ },
2486
+ "compress_1": {
2487
+ "cos_sim_mean": 0.9567081332206726,
2488
+ "cos_sim_std": 0.01375015452504158,
2489
+ "l2_distance": 15.834850311279297
2490
+ },
2491
+ "compress_2": {
2492
+ "cos_sim_mean": 0.967133104801178,
2493
+ "cos_sim_std": 0.011383751407265663,
2494
+ "l2_distance": 17.914730072021484
2495
+ },
2496
+ "compress_3": {
2497
+ "cos_sim_mean": 0.9692171216011047,
2498
+ "cos_sim_std": 0.012477421201765537,
2499
+ "l2_distance": 13.46438980102539
2500
+ },
2501
+ "compress_4": {
2502
+ "cos_sim_mean": 0.9710483551025391,
2503
+ "cos_sim_std": 0.010337604209780693,
2504
+ "l2_distance": 14.120949745178223
2505
+ },
2506
+ "compress_5": {
2507
+ "cos_sim_mean": 0.9687831997871399,
2508
+ "cos_sim_std": 0.011583817191421986,
2509
+ "l2_distance": 15.74539852142334
2510
+ },
2511
+ "compress_6": {
2512
+ "cos_sim_mean": 0.9749146103858948,
2513
+ "cos_sim_std": 0.011172610335052013,
2514
+ "l2_distance": 14.680301666259766
2515
+ },
2516
+ "compress_7": {
2517
+ "cos_sim_mean": 0.965900182723999,
2518
+ "cos_sim_std": 0.01336053665727377,
2519
+ "l2_distance": 19.33135223388672
2520
+ },
2521
+ "compress_8": {
2522
+ "cos_sim_mean": 0.9760032892227173,
2523
+ "cos_sim_std": 0.010950364172458649,
2524
+ "l2_distance": 16.689151763916016
2525
+ },
2526
+ "compress_9": {
2527
+ "cos_sim_mean": 0.9721519947052002,
2528
+ "cos_sim_std": 0.01497873105108738,
2529
+ "l2_distance": 19.427019119262695
2530
+ },
2531
+ "compress_10": {
2532
+ "cos_sim_mean": 0.9651305675506592,
2533
+ "cos_sim_std": 0.016658522188663483,
2534
+ "l2_distance": 22.994850158691406
2535
+ },
2536
+ "compress_11": {
2537
+ "cos_sim_mean": 0.9691932201385498,
2538
+ "cos_sim_std": 0.013362148776650429,
2539
+ "l2_distance": 23.314464569091797
2540
+ },
2541
+ "compress_12": {
2542
+ "cos_sim_mean": 0.9701909422874451,
2543
+ "cos_sim_std": 0.012194577604532242,
2544
+ "l2_distance": 23.932886123657227
2545
+ },
2546
+ "compress_13": {
2547
+ "cos_sim_mean": 0.9694362878799438,
2548
+ "cos_sim_std": 0.011056528426706791,
2549
+ "l2_distance": 25.8023624420166
2550
+ },
2551
+ "compress_14": {
2552
+ "cos_sim_mean": 0.9697679281234741,
2553
+ "cos_sim_std": 0.010314267128705978,
2554
+ "l2_distance": 27.431903839111328
2555
+ },
2556
+ "compress_15": {
2557
+ "cos_sim_mean": 0.9717218279838562,
2558
+ "cos_sim_std": 0.01403223630040884,
2559
+ "l2_distance": 26.985631942749023
2560
+ },
2561
+ "compress_16": {
2562
+ "cos_sim_mean": 0.9759195446968079,
2563
+ "cos_sim_std": 0.009822430089116096,
2564
+ "l2_distance": 26.87748908996582
2565
+ },
2566
+ "compress_17": {
2567
+ "cos_sim_mean": 0.97585529088974,
2568
+ "cos_sim_std": 0.01191424485296011,
2569
+ "l2_distance": 27.900794982910156
2570
+ },
2571
+ "compress_18": {
2572
+ "cos_sim_mean": 0.9694885611534119,
2573
+ "cos_sim_std": 0.01752544939517975,
2574
+ "l2_distance": 32.57567596435547
2575
+ },
2576
+ "compress_19": {
2577
+ "cos_sim_mean": 0.9693145155906677,
2578
+ "cos_sim_std": 0.019211014732718468,
2579
+ "l2_distance": 32.98606872558594
2580
+ },
2581
+ "final_norm": {
2582
+ "cos_sim_mean": 0.9926164746284485,
2583
+ "cos_sim_std": 0.004038275685161352,
2584
+ "l2_distance": 86.8482437133789
2585
+ }
2586
+ }
2587
+ }
scripts/spectral_output/prisma/activation_rank_progression.png ADDED

Git LFS Details

  • SHA256: 86cb752463e2e5819d1090ee8bc1231b3d0bc1b7ded87250e574081e4a0e1fda
  • Pointer size: 130 Bytes
  • Size of remote file: 78 kB
scripts/spectral_output/prisma/activation_spectra.png ADDED

Git LFS Details

  • SHA256: bfd95b856e41b60beaf4c556d1fd7721671ef434d81ff0c7faf5a5cdefd30c59
  • Pointer size: 131 Bytes
  • Size of remote file: 505 kB
scripts/spectral_output/prisma/comparison.png ADDED

Git LFS Details

  • SHA256: 95bc94b3cb8d98cd706e4cfc222585978da24cb6bd38818e6a1fb236f8a6c0d2
  • Pointer size: 130 Bytes
  • Size of remote file: 80.3 kB
scripts/spectral_output/prisma/embedding_alignment.png ADDED

Git LFS Details

  • SHA256: 062a0e796f435c9faa8322bbbadfe78c1d1e53e75f3ebe952490f5078721311a
  • Pointer size: 131 Bytes
  • Size of remote file: 104 kB
scripts/spectral_output/prisma/gate_spectra.png ADDED

Git LFS Details

  • SHA256: cdf1c1818195a528fbd86ce7bc0618244af66fef2b17db2d07f3454187ca41a4
  • Pointer size: 131 Bytes
  • Size of remote file: 173 kB
scripts/spectral_output/prisma/layer_progression.png ADDED

Git LFS Details

  • SHA256: 79dc2ce46613656be7a374f5e79f86e3d8fbe582a57bc78ac4e296fcdd2b66d9
  • Pointer size: 131 Bytes
  • Size of remote file: 125 kB
scripts/spectral_output/prisma/mirror_pair_comparison.png ADDED

Git LFS Details

  • SHA256: 4a2834df66a7c7a5b4525f341c37e456dd1cfd9acfd1a40239f70fcc6867725c
  • Pointer size: 131 Bytes
  • Size of remote file: 444 kB
scripts/spectral_output/prisma/results.json ADDED
The diff for this file is too large to render. See raw diff
 
scripts/spectral_output/prisma/results_b.json ADDED
@@ -0,0 +1,2844 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "transformer.wte.weight": {
3
+ "shape": [
4
+ 50257,
5
+ 1024
6
+ ],
7
+ "effective_rank": 925.6200561523438,
8
+ "stable_rank": 3.222175359725952,
9
+ "spectral_norm": 462.8680419921875,
10
+ "frobenius_norm": 830.8675537109375,
11
+ "mp_bound": 150.2218309787832,
12
+ "n_above_mp": 1,
13
+ "n_total": 1024,
14
+ "signal_ratio": 0.0009765625,
15
+ "alpha": 0.25901723751664113,
16
+ "alpha_r2": 0.9398083798430313,
17
+ "condition_number": 3932.63427734375,
18
+ "top_10_sv": [
19
+ 462.8680419921875,
20
+ 68.01887512207031,
21
+ 59.577247619628906,
22
+ 55.14397430419922,
23
+ 54.30785369873047,
24
+ 49.892826080322266,
25
+ 49.320430755615234,
26
+ 48.26898956298828,
27
+ 47.46828842163086,
28
+ 46.06719207763672
29
+ ]
30
+ },
31
+ "transformer.wpe.weight": {
32
+ "shape": [
33
+ 1024,
34
+ 1024
35
+ ],
36
+ "effective_rank": 151.1204833984375,
37
+ "stable_rank": 2.6537351608276367,
38
+ "spectral_norm": 23.605066299438477,
39
+ "frobenius_norm": 38.45333480834961,
40
+ "mp_bound": 0.14957082271575928,
41
+ "n_above_mp": 210,
42
+ "n_total": 1024,
43
+ "signal_ratio": 0.205078125,
44
+ "alpha": 0.718008780427603,
45
+ "alpha_r2": 0.8641226316938437,
46
+ "condition_number": 597220.375,
47
+ "top_10_sv": [
48
+ 23.605066299438477,
49
+ 20.438417434692383,
50
+ 17.248029708862305,
51
+ 8.979466438293457,
52
+ 5.816164970397949,
53
+ 5.2002973556518555,
54
+ 4.532151699066162,
55
+ 3.2729744911193848,
56
+ 2.9242184162139893,
57
+ 2.0841357707977295
58
+ ]
59
+ },
60
+ "transformer.h.0.attn.c_attn.weight": {
61
+ "shape": [
62
+ 1024,
63
+ 3072
64
+ ],
65
+ "effective_rank": 628.722412109375,
66
+ "stable_rank": 22.675931930541992,
67
+ "spectral_norm": 62.47161865234375,
68
+ "frobenius_norm": 297.48516845703125,
69
+ "mp_bound": 7.63114237099588,
70
+ "n_above_mp": 243,
71
+ "n_total": 1024,
72
+ "signal_ratio": 0.2373046875,
73
+ "alpha": 0.7995412978432835,
74
+ "alpha_r2": 0.9108228897637844,
75
+ "condition_number": 68.46097564697266,
76
+ "top_10_sv": [
77
+ 62.47161865234375,
78
+ 59.061126708984375,
79
+ 52.00094985961914,
80
+ 47.60552215576172,
81
+ 45.418373107910156,
82
+ 45.035884857177734,
83
+ 44.06340026855469,
84
+ 41.139434814453125,
85
+ 39.21805953979492,
86
+ 38.14364242553711
87
+ ]
88
+ },
89
+ "transformer.h.0.attn.c_proj.weight": {
90
+ "shape": [
91
+ 1024,
92
+ 1024
93
+ ],
94
+ "effective_rank": 737.2996826171875,
95
+ "stable_rank": 65.48827362060547,
96
+ "spectral_norm": 5.3187255859375,
97
+ "frobenius_norm": 43.041690826416016,
98
+ "mp_bound": 1.4740509986877441,
99
+ "n_above_mp": 248,
100
+ "n_total": 1024,
101
+ "signal_ratio": 0.2421875,
102
+ "alpha": 0.5869972613657769,
103
+ "alpha_r2": 0.8128838986239981,
104
+ "condition_number": 33621.77734375,
105
+ "top_10_sv": [
106
+ 5.3187255859375,
107
+ 4.55604887008667,
108
+ 4.446892261505127,
109
+ 4.300610065460205,
110
+ 4.185939311981201,
111
+ 4.152495384216309,
112
+ 4.086340427398682,
113
+ 4.0273637771606445,
114
+ 3.9726521968841553,
115
+ 3.915243148803711
116
+ ]
117
+ },
118
+ "transformer.h.0.mlp.c_fc.weight": {
119
+ "shape": [
120
+ 1024,
121
+ 4096
122
+ ],
123
+ "effective_rank": 958.4553833007812,
124
+ "stable_rank": 35.955875396728516,
125
+ "spectral_norm": 34.83731460571289,
126
+ "frobenius_norm": 208.895751953125,
127
+ "mp_bound": 20.392281532287598,
128
+ "n_above_mp": 2,
129
+ "n_total": 1024,
130
+ "signal_ratio": 0.001953125,
131
+ "alpha": 0.24270056604789897,
132
+ "alpha_r2": 0.8867966791819655,
133
+ "condition_number": 47.199832916259766,
134
+ "top_10_sv": [
135
+ 34.83731460571289,
136
+ 22.28569221496582,
137
+ 20.11617088317871,
138
+ 19.44432830810547,
139
+ 16.447628021240234,
140
+ 15.993996620178223,
141
+ 15.006787300109863,
142
+ 14.720680236816406,
143
+ 14.216849327087402,
144
+ 13.424826622009277
145
+ ]
146
+ },
147
+ "transformer.h.0.mlp.c_proj.weight": {
148
+ "shape": [
149
+ 4096,
150
+ 1024
151
+ ],
152
+ "effective_rank": 928.1140747070312,
153
+ "stable_rank": 34.63430404663086,
154
+ "spectral_norm": 28.995763778686523,
155
+ "frobenius_norm": 170.64273071289062,
156
+ "mp_bound": 14.264991760253906,
157
+ "n_above_mp": 5,
158
+ "n_total": 1024,
159
+ "signal_ratio": 0.0048828125,
160
+ "alpha": 0.288666051511119,
161
+ "alpha_r2": 0.8199686145081493,
162
+ "condition_number": 42.9376106262207,
163
+ "top_10_sv": [
164
+ 28.995763778686523,
165
+ 17.847505569458008,
166
+ 17.05669403076172,
167
+ 15.134307861328125,
168
+ 14.533385276794434,
169
+ 12.990875244140625,
170
+ 11.749531745910645,
171
+ 11.249499320983887,
172
+ 10.62743091583252,
173
+ 10.352800369262695
174
+ ]
175
+ },
176
+ "transformer.h.1.attn.c_attn.weight": {
177
+ "shape": [
178
+ 1024,
179
+ 3072
180
+ ],
181
+ "effective_rank": 820.3770751953125,
182
+ "stable_rank": 41.79490661621094,
183
+ "spectral_norm": 27.69493865966797,
184
+ "frobenius_norm": 179.04495239257812,
185
+ "mp_bound": 9.096073287364284,
186
+ "n_above_mp": 84,
187
+ "n_total": 1024,
188
+ "signal_ratio": 0.08203125,
189
+ "alpha": 0.5097652395048509,
190
+ "alpha_r2": 0.8774686180910731,
191
+ "condition_number": 27.94350814819336,
192
+ "top_10_sv": [
193
+ 27.69493865966797,
194
+ 26.477949142456055,
195
+ 24.700719833374023,
196
+ 22.73528480529785,
197
+ 21.188859939575195,
198
+ 20.210037231445312,
199
+ 19.647275924682617,
200
+ 18.7845458984375,
201
+ 18.384246826171875,
202
+ 17.731191635131836
203
+ ]
204
+ },
205
+ "transformer.h.1.attn.c_proj.weight": {
206
+ "shape": [
207
+ 1024,
208
+ 1024
209
+ ],
210
+ "effective_rank": 609.0930786132812,
211
+ "stable_rank": 6.79005241394043,
212
+ "spectral_norm": 35.383201599121094,
213
+ "frobenius_norm": 92.2005844116211,
214
+ "mp_bound": 1.7873764038085938,
215
+ "n_above_mp": 349,
216
+ "n_total": 1024,
217
+ "signal_ratio": 0.3408203125,
218
+ "alpha": 0.7416108698642246,
219
+ "alpha_r2": 0.8534609001933348,
220
+ "condition_number": 36300.8515625,
221
+ "top_10_sv": [
222
+ 35.383201599121094,
223
+ 30.46486473083496,
224
+ 23.07978630065918,
225
+ 17.489381790161133,
226
+ 15.965164184570312,
227
+ 13.38463306427002,
228
+ 11.213841438293457,
229
+ 10.809903144836426,
230
+ 7.931944370269775,
231
+ 7.714986801147461
232
+ ]
233
+ },
234
+ "transformer.h.1.mlp.c_fc.weight": {
235
+ "shape": [
236
+ 1024,
237
+ 4096
238
+ ],
239
+ "effective_rank": 952.1654663085938,
240
+ "stable_rank": 15.651745796203613,
241
+ "spectral_norm": 54.71965026855469,
242
+ "frobenius_norm": 216.48345947265625,
243
+ "mp_bound": 20.210805416107178,
244
+ "n_above_mp": 1,
245
+ "n_total": 1024,
246
+ "signal_ratio": 0.0009765625,
247
+ "alpha": 0.28617357150294975,
248
+ "alpha_r2": 0.944003325737223,
249
+ "condition_number": 25.737693786621094,
250
+ "top_10_sv": [
251
+ 54.71965026855469,
252
+ 20.120031356811523,
253
+ 18.258190155029297,
254
+ 17.045495986938477,
255
+ 16.402690887451172,
256
+ 16.293865203857422,
257
+ 15.51065731048584,
258
+ 15.293896675109863,
259
+ 14.751503944396973,
260
+ 14.567024230957031
261
+ ]
262
+ },
263
+ "transformer.h.1.mlp.c_proj.weight": {
264
+ "shape": [
265
+ 4096,
266
+ 1024
267
+ ],
268
+ "effective_rank": 951.001220703125,
269
+ "stable_rank": 15.77647876739502,
270
+ "spectral_norm": 40.80717086791992,
271
+ "frobenius_norm": 162.0845184326172,
272
+ "mp_bound": 15.43984866142273,
273
+ "n_above_mp": 5,
274
+ "n_total": 1024,
275
+ "signal_ratio": 0.0048828125,
276
+ "alpha": 0.27510892798744624,
277
+ "alpha_r2": 0.9524180259003465,
278
+ "condition_number": 47.46715545654297,
279
+ "top_10_sv": [
280
+ 40.80717086791992,
281
+ 25.39755630493164,
282
+ 24.217174530029297,
283
+ 18.221494674682617,
284
+ 15.662391662597656,
285
+ 13.956523895263672,
286
+ 13.657442092895508,
287
+ 13.022187232971191,
288
+ 12.348648071289062,
289
+ 10.786282539367676
290
+ ]
291
+ },
292
+ "transformer.h.2.attn.c_attn.weight": {
293
+ "shape": [
294
+ 1024,
295
+ 3072
296
+ ],
297
+ "effective_rank": 842.8485107421875,
298
+ "stable_rank": 39.29759216308594,
299
+ "spectral_norm": 32.41156768798828,
300
+ "frobenius_norm": 203.18096923828125,
301
+ "mp_bound": 11.375351081125348,
302
+ "n_above_mp": 52,
303
+ "n_total": 1024,
304
+ "signal_ratio": 0.05078125,
305
+ "alpha": 0.4740024292360243,
306
+ "alpha_r2": 0.8802370281377663,
307
+ "condition_number": 52.709171295166016,
308
+ "top_10_sv": [
309
+ 32.41156768798828,
310
+ 30.25526237487793,
311
+ 25.755977630615234,
312
+ 24.045995712280273,
313
+ 22.17633819580078,
314
+ 21.890745162963867,
315
+ 21.64605712890625,
316
+ 21.57254981994629,
317
+ 19.244956970214844,
318
+ 18.357177734375
319
+ ]
320
+ },
321
+ "transformer.h.2.attn.c_proj.weight": {
322
+ "shape": [
323
+ 1024,
324
+ 1024
325
+ ],
326
+ "effective_rank": 715.241943359375,
327
+ "stable_rank": 9.787028312683105,
328
+ "spectral_norm": 23.163925170898438,
329
+ "frobenius_norm": 72.466552734375,
330
+ "mp_bound": 2.168848752975464,
331
+ "n_above_mp": 315,
332
+ "n_total": 1024,
333
+ "signal_ratio": 0.3076171875,
334
+ "alpha": 0.6080837150928929,
335
+ "alpha_r2": 0.7822885769468158,
336
+ "condition_number": 187718.40625,
337
+ "top_10_sv": [
338
+ 23.163925170898438,
339
+ 9.864102363586426,
340
+ 8.651338577270508,
341
+ 8.504467964172363,
342
+ 7.996184349060059,
343
+ 7.292422294616699,
344
+ 6.915347576141357,
345
+ 6.678709030151367,
346
+ 6.435945510864258,
347
+ 6.021938323974609
348
+ ]
349
+ },
350
+ "transformer.h.2.mlp.c_fc.weight": {
351
+ "shape": [
352
+ 1024,
353
+ 4096
354
+ ],
355
+ "effective_rank": 950.7151489257812,
356
+ "stable_rank": 23.699460983276367,
357
+ "spectral_norm": 44.83905029296875,
358
+ "frobenius_norm": 218.28587341308594,
359
+ "mp_bound": 20.38237237930298,
360
+ "n_above_mp": 3,
361
+ "n_total": 1024,
362
+ "signal_ratio": 0.0029296875,
363
+ "alpha": 0.2912721396169389,
364
+ "alpha_r2": 0.9537768474757292,
365
+ "condition_number": 36.274898529052734,
366
+ "top_10_sv": [
367
+ 44.83905029296875,
368
+ 22.49462127685547,
369
+ 20.47141456604004,
370
+ 19.455570220947266,
371
+ 18.79952621459961,
372
+ 18.357946395874023,
373
+ 17.340517044067383,
374
+ 16.285560607910156,
375
+ 15.497901916503906,
376
+ 15.356476783752441
377
+ ]
378
+ },
379
+ "transformer.h.2.mlp.c_proj.weight": {
380
+ "shape": [
381
+ 4096,
382
+ 1024
383
+ ],
384
+ "effective_rank": 950.6380615234375,
385
+ "stable_rank": 20.027393341064453,
386
+ "spectral_norm": 37.39990997314453,
387
+ "frobenius_norm": 167.3719940185547,
388
+ "mp_bound": 15.765140533447266,
389
+ "n_above_mp": 3,
390
+ "n_total": 1024,
391
+ "signal_ratio": 0.0029296875,
392
+ "alpha": 0.2643769291286133,
393
+ "alpha_r2": 0.9029311778240089,
394
+ "condition_number": 100.74349212646484,
395
+ "top_10_sv": [
396
+ 37.39990997314453,
397
+ 23.12250328063965,
398
+ 20.761938095092773,
399
+ 12.250714302062988,
400
+ 10.560304641723633,
401
+ 10.4915771484375,
402
+ 10.370850563049316,
403
+ 10.147944450378418,
404
+ 9.839871406555176,
405
+ 9.63220500946045
406
+ ]
407
+ },
408
+ "transformer.h.3.attn.c_attn.weight": {
409
+ "shape": [
410
+ 1024,
411
+ 3072
412
+ ],
413
+ "effective_rank": 868.9850463867188,
414
+ "stable_rank": 41.50836181640625,
415
+ "spectral_norm": 33.2469596862793,
416
+ "frobenius_norm": 214.20013427734375,
417
+ "mp_bound": 13.617692360816509,
418
+ "n_above_mp": 30,
419
+ "n_total": 1024,
420
+ "signal_ratio": 0.029296875,
421
+ "alpha": 0.43598154200527217,
422
+ "alpha_r2": 0.9121806812022613,
423
+ "condition_number": 91.30290985107422,
424
+ "top_10_sv": [
425
+ 33.2469596862793,
426
+ 26.513952255249023,
427
+ 25.003625869750977,
428
+ 23.534202575683594,
429
+ 22.926427841186523,
430
+ 21.938570022583008,
431
+ 21.85960578918457,
432
+ 20.899417877197266,
433
+ 20.27928924560547,
434
+ 19.71728515625
435
+ ]
436
+ },
437
+ "transformer.h.3.attn.c_proj.weight": {
438
+ "shape": [
439
+ 1024,
440
+ 1024
441
+ ],
442
+ "effective_rank": 745.2302856445312,
443
+ "stable_rank": 16.78188133239746,
444
+ "spectral_norm": 17.08546257019043,
445
+ "frobenius_norm": 69.99178314208984,
446
+ "mp_bound": 2.3438045978546143,
447
+ "n_above_mp": 303,
448
+ "n_total": 1024,
449
+ "signal_ratio": 0.2958984375,
450
+ "alpha": 0.5602377771042083,
451
+ "alpha_r2": 0.7441943879202231,
452
+ "condition_number": 26872.6328125,
453
+ "top_10_sv": [
454
+ 17.08546257019043,
455
+ 7.187192440032959,
456
+ 6.205941677093506,
457
+ 5.882521629333496,
458
+ 5.645092964172363,
459
+ 5.5240044593811035,
460
+ 5.3767547607421875,
461
+ 5.196966171264648,
462
+ 5.166182994842529,
463
+ 5.101844787597656
464
+ ]
465
+ },
466
+ "transformer.h.3.mlp.c_fc.weight": {
467
+ "shape": [
468
+ 1024,
469
+ 4096
470
+ ],
471
+ "effective_rank": 927.2161254882812,
472
+ "stable_rank": 26.023406982421875,
473
+ "spectral_norm": 43.73242950439453,
474
+ "frobenius_norm": 223.09286499023438,
475
+ "mp_bound": 19.039398908615112,
476
+ "n_above_mp": 6,
477
+ "n_total": 1024,
478
+ "signal_ratio": 0.005859375,
479
+ "alpha": 0.33058612233889434,
480
+ "alpha_r2": 0.9492689266361353,
481
+ "condition_number": 47.069427490234375,
482
+ "top_10_sv": [
483
+ 43.73242950439453,
484
+ 37.90205764770508,
485
+ 20.579469680786133,
486
+ 19.914993286132812,
487
+ 19.66720199584961,
488
+ 19.149551391601562,
489
+ 18.675325393676758,
490
+ 18.34368896484375,
491
+ 17.572053909301758,
492
+ 17.139053344726562
493
+ ]
494
+ },
495
+ "transformer.h.3.mlp.c_proj.weight": {
496
+ "shape": [
497
+ 4096,
498
+ 1024
499
+ ],
500
+ "effective_rank": 938.9752197265625,
501
+ "stable_rank": 12.513044357299805,
502
+ "spectral_norm": 49.337677001953125,
503
+ "frobenius_norm": 174.52601623535156,
504
+ "mp_bound": 15.4556565284729,
505
+ "n_above_mp": 2,
506
+ "n_total": 1024,
507
+ "signal_ratio": 0.001953125,
508
+ "alpha": 0.2906680436450817,
509
+ "alpha_r2": 0.9012577872907297,
510
+ "condition_number": 150.67166137695312,
511
+ "top_10_sv": [
512
+ 49.337677001953125,
513
+ 16.639305114746094,
514
+ 12.377195358276367,
515
+ 12.035073280334473,
516
+ 11.454721450805664,
517
+ 11.364490509033203,
518
+ 10.909409523010254,
519
+ 10.634885787963867,
520
+ 10.58263111114502,
521
+ 10.402506828308105
522
+ ]
523
+ },
524
+ "transformer.h.4.attn.c_attn.weight": {
525
+ "shape": [
526
+ 1024,
527
+ 3072
528
+ ],
529
+ "effective_rank": 883.5626831054688,
530
+ "stable_rank": 66.41778564453125,
531
+ "spectral_norm": 26.355398178100586,
532
+ "frobenius_norm": 214.7888641357422,
533
+ "mp_bound": 14.6059706629891,
534
+ "n_above_mp": 28,
535
+ "n_total": 1024,
536
+ "signal_ratio": 0.02734375,
537
+ "alpha": 0.41387379570841165,
538
+ "alpha_r2": 0.928107216532441,
539
+ "condition_number": 54.06818771362305,
540
+ "top_10_sv": [
541
+ 26.355398178100586,
542
+ 24.099023818969727,
543
+ 23.613140106201172,
544
+ 22.856849670410156,
545
+ 22.657690048217773,
546
+ 22.315385818481445,
547
+ 21.944866180419922,
548
+ 20.979703903198242,
549
+ 20.732112884521484,
550
+ 20.403736114501953
551
+ ]
552
+ },
553
+ "transformer.h.4.attn.c_proj.weight": {
554
+ "shape": [
555
+ 1024,
556
+ 1024
557
+ ],
558
+ "effective_rank": 744.8408203125,
559
+ "stable_rank": 14.994495391845703,
560
+ "spectral_norm": 18.674596786499023,
561
+ "frobenius_norm": 72.3131332397461,
562
+ "mp_bound": 2.412200927734375,
563
+ "n_above_mp": 301,
564
+ "n_total": 1024,
565
+ "signal_ratio": 0.2939453125,
566
+ "alpha": 0.557779475735886,
567
+ "alpha_r2": 0.7426996015149363,
568
+ "condition_number": 38687.66015625,
569
+ "top_10_sv": [
570
+ 18.674596786499023,
571
+ 6.152257442474365,
572
+ 6.06334924697876,
573
+ 5.751345634460449,
574
+ 5.698556423187256,
575
+ 5.618075847625732,
576
+ 5.548093795776367,
577
+ 5.433638572692871,
578
+ 5.330124855041504,
579
+ 5.277598857879639
580
+ ]
581
+ },
582
+ "transformer.h.4.mlp.c_fc.weight": {
583
+ "shape": [
584
+ 1024,
585
+ 4096
586
+ ],
587
+ "effective_rank": 928.9017333984375,
588
+ "stable_rank": 27.40129280090332,
589
+ "spectral_norm": 42.51769256591797,
590
+ "frobenius_norm": 222.56414794921875,
591
+ "mp_bound": 19.081247806549072,
592
+ "n_above_mp": 3,
593
+ "n_total": 1024,
594
+ "signal_ratio": 0.0029296875,
595
+ "alpha": 0.33154839758470445,
596
+ "alpha_r2": 0.9484103079592642,
597
+ "condition_number": 44.41904067993164,
598
+ "top_10_sv": [
599
+ 42.51769256591797,
600
+ 33.62040710449219,
601
+ 20.00185775756836,
602
+ 19.004039764404297,
603
+ 18.752685546875,
604
+ 18.06087875366211,
605
+ 17.665428161621094,
606
+ 17.001155853271484,
607
+ 16.629446029663086,
608
+ 16.390981674194336
609
+ ]
610
+ },
611
+ "transformer.h.4.mlp.c_proj.weight": {
612
+ "shape": [
613
+ 4096,
614
+ 1024
615
+ ],
616
+ "effective_rank": 937.2068481445312,
617
+ "stable_rank": 12.849010467529297,
618
+ "spectral_norm": 48.77532958984375,
619
+ "frobenius_norm": 174.8376922607422,
620
+ "mp_bound": 15.396579265594482,
621
+ "n_above_mp": 1,
622
+ "n_total": 1024,
623
+ "signal_ratio": 0.0009765625,
624
+ "alpha": 0.3018330337436229,
625
+ "alpha_r2": 0.9230008315412146,
626
+ "condition_number": 90.40140533447266,
627
+ "top_10_sv": [
628
+ 48.77532958984375,
629
+ 14.544754981994629,
630
+ 14.092583656311035,
631
+ 13.27783489227295,
632
+ 12.841653823852539,
633
+ 12.547450065612793,
634
+ 12.23483657836914,
635
+ 11.652227401733398,
636
+ 11.583452224731445,
637
+ 11.510062217712402
638
+ ]
639
+ },
640
+ "transformer.h.5.attn.c_attn.weight": {
641
+ "shape": [
642
+ 1024,
643
+ 3072
644
+ ],
645
+ "effective_rank": 896.8251953125,
646
+ "stable_rank": 88.8460693359375,
647
+ "spectral_norm": 22.261667251586914,
648
+ "frobenius_norm": 209.83444213867188,
649
+ "mp_bound": 14.939659960070664,
650
+ "n_above_mp": 23,
651
+ "n_total": 1024,
652
+ "signal_ratio": 0.0224609375,
653
+ "alpha": 0.39352495861912024,
654
+ "alpha_r2": 0.9197388047187437,
655
+ "condition_number": 35.448787689208984,
656
+ "top_10_sv": [
657
+ 22.261667251586914,
658
+ 20.855236053466797,
659
+ 20.020145416259766,
660
+ 19.449405670166016,
661
+ 19.232900619506836,
662
+ 18.622623443603516,
663
+ 18.29434585571289,
664
+ 17.920202255249023,
665
+ 17.658803939819336,
666
+ 17.550424575805664
667
+ ]
668
+ },
669
+ "transformer.h.5.attn.c_proj.weight": {
670
+ "shape": [
671
+ 1024,
672
+ 1024
673
+ ],
674
+ "effective_rank": 746.4803466796875,
675
+ "stable_rank": 16.239879608154297,
676
+ "spectral_norm": 19.48035430908203,
677
+ "frobenius_norm": 78.50335693359375,
678
+ "mp_bound": 2.632415771484375,
679
+ "n_above_mp": 307,
680
+ "n_total": 1024,
681
+ "signal_ratio": 0.2998046875,
682
+ "alpha": 0.5490623315523016,
683
+ "alpha_r2": 0.7276577467035443,
684
+ "condition_number": 42145.03515625,
685
+ "top_10_sv": [
686
+ 19.48035430908203,
687
+ 6.621999740600586,
688
+ 6.383045196533203,
689
+ 6.287996292114258,
690
+ 6.164387226104736,
691
+ 5.886140823364258,
692
+ 5.757631778717041,
693
+ 5.754157066345215,
694
+ 5.641662120819092,
695
+ 5.541800498962402
696
+ ]
697
+ },
698
+ "transformer.h.5.mlp.c_fc.weight": {
699
+ "shape": [
700
+ 1024,
701
+ 4096
702
+ ],
703
+ "effective_rank": 936.9888916015625,
704
+ "stable_rank": 31.094371795654297,
705
+ "spectral_norm": 39.32979202270508,
706
+ "frobenius_norm": 219.31207275390625,
707
+ "mp_bound": 19.247617721557617,
708
+ "n_above_mp": 1,
709
+ "n_total": 1024,
710
+ "signal_ratio": 0.0009765625,
711
+ "alpha": 0.3193148230456383,
712
+ "alpha_r2": 0.9361649517765219,
713
+ "condition_number": 41.62086868286133,
714
+ "top_10_sv": [
715
+ 39.32979202270508,
716
+ 19.010740280151367,
717
+ 18.782209396362305,
718
+ 17.947439193725586,
719
+ 17.337230682373047,
720
+ 16.85923194885254,
721
+ 16.667734146118164,
722
+ 15.952093124389648,
723
+ 15.685457229614258,
724
+ 15.613792419433594
725
+ ]
726
+ },
727
+ "transformer.h.5.mlp.c_proj.weight": {
728
+ "shape": [
729
+ 4096,
730
+ 1024
731
+ ],
732
+ "effective_rank": 936.8583984375,
733
+ "stable_rank": 15.319039344787598,
734
+ "spectral_norm": 44.49594497680664,
735
+ "frobenius_norm": 174.1551055908203,
736
+ "mp_bound": 15.25501549243927,
737
+ "n_above_mp": 3,
738
+ "n_total": 1024,
739
+ "signal_ratio": 0.0029296875,
740
+ "alpha": 0.30940596098305206,
741
+ "alpha_r2": 0.9306552491126057,
742
+ "condition_number": 69.38197326660156,
743
+ "top_10_sv": [
744
+ 44.49594497680664,
745
+ 16.51666831970215,
746
+ 16.053695678710938,
747
+ 14.54151725769043,
748
+ 13.180120468139648,
749
+ 12.70102596282959,
750
+ 12.365108489990234,
751
+ 12.008030891418457,
752
+ 11.90465259552002,
753
+ 11.715628623962402
754
+ ]
755
+ },
756
+ "transformer.h.6.attn.c_attn.weight": {
757
+ "shape": [
758
+ 1024,
759
+ 3072
760
+ ],
761
+ "effective_rank": 897.4365844726562,
762
+ "stable_rank": 115.99281311035156,
763
+ "spectral_norm": 18.230764389038086,
764
+ "frobenius_norm": 196.3452606201172,
765
+ "mp_bound": 13.862052830560662,
766
+ "n_above_mp": 17,
767
+ "n_total": 1024,
768
+ "signal_ratio": 0.0166015625,
769
+ "alpha": 0.3909539490685798,
770
+ "alpha_r2": 0.8932138219176853,
771
+ "condition_number": 32.950740814208984,
772
+ "top_10_sv": [
773
+ 18.230764389038086,
774
+ 17.658872604370117,
775
+ 16.965496063232422,
776
+ 16.603540420532227,
777
+ 16.098522186279297,
778
+ 15.950654983520508,
779
+ 15.425930976867676,
780
+ 15.198224067687988,
781
+ 14.950895309448242,
782
+ 14.900473594665527
783
+ ]
784
+ },
785
+ "transformer.h.6.attn.c_proj.weight": {
786
+ "shape": [
787
+ 1024,
788
+ 1024
789
+ ],
790
+ "effective_rank": 741.775634765625,
791
+ "stable_rank": 23.221019744873047,
792
+ "spectral_norm": 17.442646026611328,
793
+ "frobenius_norm": 84.05296325683594,
794
+ "mp_bound": 2.810410976409912,
795
+ "n_above_mp": 295,
796
+ "n_total": 1024,
797
+ "signal_ratio": 0.2880859375,
798
+ "alpha": 0.5652261405393998,
799
+ "alpha_r2": 0.7572962195682047,
800
+ "condition_number": 14903.8291015625,
801
+ "top_10_sv": [
802
+ 17.442646026611328,
803
+ 8.159473419189453,
804
+ 7.555866241455078,
805
+ 7.282261848449707,
806
+ 7.234253883361816,
807
+ 7.103213787078857,
808
+ 6.917233943939209,
809
+ 6.809220790863037,
810
+ 6.620255470275879,
811
+ 6.525652885437012
812
+ ]
813
+ },
814
+ "transformer.h.6.mlp.c_fc.weight": {
815
+ "shape": [
816
+ 1024,
817
+ 4096
818
+ ],
819
+ "effective_rank": 934.8484497070312,
820
+ "stable_rank": 30.034225463867188,
821
+ "spectral_norm": 39.7269172668457,
822
+ "frobenius_norm": 217.7173614501953,
823
+ "mp_bound": 19.036645889282227,
824
+ "n_above_mp": 3,
825
+ "n_total": 1024,
826
+ "signal_ratio": 0.0029296875,
827
+ "alpha": 0.32115342994248447,
828
+ "alpha_r2": 0.9391101648886675,
829
+ "condition_number": 59.284549713134766,
830
+ "top_10_sv": [
831
+ 39.7269172668457,
832
+ 20.441776275634766,
833
+ 19.18965721130371,
834
+ 18.47493553161621,
835
+ 17.703449249267578,
836
+ 16.833940505981445,
837
+ 16.46803855895996,
838
+ 15.968595504760742,
839
+ 15.606867790222168,
840
+ 15.231405258178711
841
+ ]
842
+ },
843
+ "transformer.h.6.mlp.c_proj.weight": {
844
+ "shape": [
845
+ 4096,
846
+ 1024
847
+ ],
848
+ "effective_rank": 939.2515258789062,
849
+ "stable_rank": 24.95199203491211,
850
+ "spectral_norm": 34.625038146972656,
851
+ "frobenius_norm": 172.95889282226562,
852
+ "mp_bound": 15.321847558021545,
853
+ "n_above_mp": 2,
854
+ "n_total": 1024,
855
+ "signal_ratio": 0.001953125,
856
+ "alpha": 0.30995776991553114,
857
+ "alpha_r2": 0.9308628488187047,
858
+ "condition_number": 56.80348587036133,
859
+ "top_10_sv": [
860
+ 34.625038146972656,
861
+ 16.99053382873535,
862
+ 15.084966659545898,
863
+ 13.3975191116333,
864
+ 13.283339500427246,
865
+ 12.721936225891113,
866
+ 12.279509544372559,
867
+ 12.057058334350586,
868
+ 11.876668930053711,
869
+ 11.697145462036133
870
+ ]
871
+ },
872
+ "transformer.h.7.attn.c_attn.weight": {
873
+ "shape": [
874
+ 1024,
875
+ 3072
876
+ ],
877
+ "effective_rank": 910.6389770507812,
878
+ "stable_rank": 113.14727783203125,
879
+ "spectral_norm": 17.927513122558594,
880
+ "frobenius_norm": 190.69622802734375,
881
+ "mp_bound": 14.044904500546567,
882
+ "n_above_mp": 8,
883
+ "n_total": 1024,
884
+ "signal_ratio": 0.0078125,
885
+ "alpha": 0.36309653033535555,
886
+ "alpha_r2": 0.87929726729129,
887
+ "condition_number": 31.099102020263672,
888
+ "top_10_sv": [
889
+ 17.927513122558594,
890
+ 17.431472778320312,
891
+ 16.597808837890625,
892
+ 15.969321250915527,
893
+ 15.868179321289062,
894
+ 14.81343936920166,
895
+ 14.50455379486084,
896
+ 14.174510955810547,
897
+ 13.997900009155273,
898
+ 13.888890266418457
899
+ ]
900
+ },
901
+ "transformer.h.7.attn.c_proj.weight": {
902
+ "shape": [
903
+ 1024,
904
+ 1024
905
+ ],
906
+ "effective_rank": 743.44921875,
907
+ "stable_rank": 27.847932815551758,
908
+ "spectral_norm": 16.62673568725586,
909
+ "frobenius_norm": 87.74118041992188,
910
+ "mp_bound": 2.8953471183776855,
911
+ "n_above_mp": 311,
912
+ "n_total": 1024,
913
+ "signal_ratio": 0.3037109375,
914
+ "alpha": 0.5651577772345532,
915
+ "alpha_r2": 0.7372368922623682,
916
+ "condition_number": 9111.541015625,
917
+ "top_10_sv": [
918
+ 16.62673568725586,
919
+ 7.667027473449707,
920
+ 7.169197082519531,
921
+ 6.978420257568359,
922
+ 6.833680152893066,
923
+ 6.656124591827393,
924
+ 6.566739082336426,
925
+ 6.528911590576172,
926
+ 6.47317361831665,
927
+ 6.427671432495117
928
+ ]
929
+ },
930
+ "transformer.h.7.mlp.c_fc.weight": {
931
+ "shape": [
932
+ 1024,
933
+ 4096
934
+ ],
935
+ "effective_rank": 939.2886352539062,
936
+ "stable_rank": 31.832117080688477,
937
+ "spectral_norm": 38.28334045410156,
938
+ "frobenius_norm": 215.99444580078125,
939
+ "mp_bound": 19.122761964797974,
940
+ "n_above_mp": 2,
941
+ "n_total": 1024,
942
+ "signal_ratio": 0.001953125,
943
+ "alpha": 0.31228344301482186,
944
+ "alpha_r2": 0.9328367415414192,
945
+ "condition_number": 34.72146224975586,
946
+ "top_10_sv": [
947
+ 38.28334045410156,
948
+ 19.85346031188965,
949
+ 18.42683219909668,
950
+ 17.141801834106445,
951
+ 16.43387222290039,
952
+ 16.275760650634766,
953
+ 16.053062438964844,
954
+ 15.724404335021973,
955
+ 15.451314926147461,
956
+ 15.216456413269043
957
+ ]
958
+ },
959
+ "transformer.h.7.mlp.c_proj.weight": {
960
+ "shape": [
961
+ 4096,
962
+ 1024
963
+ ],
964
+ "effective_rank": 940.928955078125,
965
+ "stable_rank": 33.200374603271484,
966
+ "spectral_norm": 31.440378189086914,
967
+ "frobenius_norm": 181.15872192382812,
968
+ "mp_bound": 16.195263862609863,
969
+ "n_above_mp": 3,
970
+ "n_total": 1024,
971
+ "signal_ratio": 0.0029296875,
972
+ "alpha": 0.30703067706661946,
973
+ "alpha_r2": 0.9333060712253897,
974
+ "condition_number": 61.141605377197266,
975
+ "top_10_sv": [
976
+ 31.440378189086914,
977
+ 18.50613784790039,
978
+ 16.681230545043945,
979
+ 15.740647315979004,
980
+ 13.607156753540039,
981
+ 13.430535316467285,
982
+ 13.093123435974121,
983
+ 12.78380298614502,
984
+ 12.500567436218262,
985
+ 12.175090789794922
986
+ ]
987
+ },
988
+ "transformer.h.8.attn.c_attn.weight": {
989
+ "shape": [
990
+ 1024,
991
+ 3072
992
+ ],
993
+ "effective_rank": 905.0512084960938,
994
+ "stable_rank": 99.04619598388672,
995
+ "spectral_norm": 19.30182647705078,
996
+ "frobenius_norm": 192.09555053710938,
997
+ "mp_bound": 13.908594893171308,
998
+ "n_above_mp": 12,
999
+ "n_total": 1024,
1000
+ "signal_ratio": 0.01171875,
1001
+ "alpha": 0.37568865395886447,
1002
+ "alpha_r2": 0.8932899012135317,
1003
+ "condition_number": 26.482255935668945,
1004
+ "top_10_sv": [
1005
+ 19.30182647705078,
1006
+ 18.0625,
1007
+ 17.029926300048828,
1008
+ 16.608489990234375,
1009
+ 15.877984046936035,
1010
+ 15.367476463317871,
1011
+ 15.020094871520996,
1012
+ 14.934671401977539,
1013
+ 14.854327201843262,
1014
+ 14.298294067382812
1015
+ ]
1016
+ },
1017
+ "transformer.h.8.attn.c_proj.weight": {
1018
+ "shape": [
1019
+ 1024,
1020
+ 1024
1021
+ ],
1022
+ "effective_rank": 730.39599609375,
1023
+ "stable_rank": 23.92629051208496,
1024
+ "spectral_norm": 18.780719757080078,
1025
+ "frobenius_norm": 91.8649673461914,
1026
+ "mp_bound": 2.854386568069458,
1027
+ "n_above_mp": 321,
1028
+ "n_total": 1024,
1029
+ "signal_ratio": 0.3134765625,
1030
+ "alpha": 0.590965763955716,
1031
+ "alpha_r2": 0.7524001510220397,
1032
+ "condition_number": 10199.13671875,
1033
+ "top_10_sv": [
1034
+ 18.780719757080078,
1035
+ 9.103902816772461,
1036
+ 8.50258731842041,
1037
+ 7.891252517700195,
1038
+ 7.823528289794922,
1039
+ 7.506181716918945,
1040
+ 7.408065319061279,
1041
+ 7.162362575531006,
1042
+ 7.139276504516602,
1043
+ 7.067962646484375
1044
+ ]
1045
+ },
1046
+ "transformer.h.8.mlp.c_fc.weight": {
1047
+ "shape": [
1048
+ 1024,
1049
+ 4096
1050
+ ],
1051
+ "effective_rank": 936.2421875,
1052
+ "stable_rank": 38.09269714355469,
1053
+ "spectral_norm": 34.884918212890625,
1054
+ "frobenius_norm": 215.3072052001953,
1055
+ "mp_bound": 18.746233463287354,
1056
+ "n_above_mp": 2,
1057
+ "n_total": 1024,
1058
+ "signal_ratio": 0.001953125,
1059
+ "alpha": 0.32267201724432737,
1060
+ "alpha_r2": 0.9352114281367789,
1061
+ "condition_number": 38.3657341003418,
1062
+ "top_10_sv": [
1063
+ 34.884918212890625,
1064
+ 19.184831619262695,
1065
+ 18.076223373413086,
1066
+ 17.63890266418457,
1067
+ 16.88092613220215,
1068
+ 16.76192855834961,
1069
+ 16.45589828491211,
1070
+ 16.098548889160156,
1071
+ 15.723489761352539,
1072
+ 15.483587265014648
1073
+ ]
1074
+ },
1075
+ "transformer.h.8.mlp.c_proj.weight": {
1076
+ "shape": [
1077
+ 4096,
1078
+ 1024
1079
+ ],
1080
+ "effective_rank": 950.4535522460938,
1081
+ "stable_rank": 39.308773040771484,
1082
+ "spectral_norm": 27.4515380859375,
1083
+ "frobenius_norm": 172.1121063232422,
1084
+ "mp_bound": 15.766410827636719,
1085
+ "n_above_mp": 1,
1086
+ "n_total": 1024,
1087
+ "signal_ratio": 0.0009765625,
1088
+ "alpha": 0.28704826650262233,
1089
+ "alpha_r2": 0.905894394932516,
1090
+ "condition_number": 47.98366165161133,
1091
+ "top_10_sv": [
1092
+ 27.4515380859375,
1093
+ 14.21110725402832,
1094
+ 12.706835746765137,
1095
+ 12.113818168640137,
1096
+ 11.690884590148926,
1097
+ 11.509939193725586,
1098
+ 11.169968605041504,
1099
+ 11.008914947509766,
1100
+ 10.82995319366455,
1101
+ 10.744305610656738
1102
+ ]
1103
+ },
1104
+ "transformer.h.9.attn.c_attn.weight": {
1105
+ "shape": [
1106
+ 1024,
1107
+ 3072
1108
+ ],
1109
+ "effective_rank": 893.1614990234375,
1110
+ "stable_rank": 115.87055206298828,
1111
+ "spectral_norm": 17.444477081298828,
1112
+ "frobenius_norm": 187.7779083251953,
1113
+ "mp_bound": 12.937507738520768,
1114
+ "n_above_mp": 18,
1115
+ "n_total": 1024,
1116
+ "signal_ratio": 0.017578125,
1117
+ "alpha": 0.39390157964822387,
1118
+ "alpha_r2": 0.8784313742375426,
1119
+ "condition_number": 25.246517181396484,
1120
+ "top_10_sv": [
1121
+ 17.444477081298828,
1122
+ 16.29485511779785,
1123
+ 15.905627250671387,
1124
+ 15.577847480773926,
1125
+ 15.356382369995117,
1126
+ 15.149186134338379,
1127
+ 14.64958381652832,
1128
+ 14.456648826599121,
1129
+ 14.237245559692383,
1130
+ 14.110804557800293
1131
+ ]
1132
+ },
1133
+ "transformer.h.9.attn.c_proj.weight": {
1134
+ "shape": [
1135
+ 1024,
1136
+ 1024
1137
+ ],
1138
+ "effective_rank": 723.2728271484375,
1139
+ "stable_rank": 13.839754104614258,
1140
+ "spectral_norm": 26.413339614868164,
1141
+ "frobenius_norm": 98.2624282836914,
1142
+ "mp_bound": 3.006326913833618,
1143
+ "n_above_mp": 307,
1144
+ "n_total": 1024,
1145
+ "signal_ratio": 0.2998046875,
1146
+ "alpha": 0.5910035608611524,
1147
+ "alpha_r2": 0.7777498557933287,
1148
+ "condition_number": 84904.125,
1149
+ "top_10_sv": [
1150
+ 26.413339614868164,
1151
+ 24.238210678100586,
1152
+ 13.290877342224121,
1153
+ 11.937247276306152,
1154
+ 9.426248550415039,
1155
+ 7.983802795410156,
1156
+ 7.864888668060303,
1157
+ 7.773645877838135,
1158
+ 7.429806232452393,
1159
+ 7.390601634979248
1160
+ ]
1161
+ },
1162
+ "transformer.h.9.mlp.c_fc.weight": {
1163
+ "shape": [
1164
+ 1024,
1165
+ 4096
1166
+ ],
1167
+ "effective_rank": 938.6810913085938,
1168
+ "stable_rank": 35.63467025756836,
1169
+ "spectral_norm": 35.846248626708984,
1170
+ "frobenius_norm": 213.9833984375,
1171
+ "mp_bound": 18.81469202041626,
1172
+ "n_above_mp": 1,
1173
+ "n_total": 1024,
1174
+ "signal_ratio": 0.0009765625,
1175
+ "alpha": 0.3128217781401333,
1176
+ "alpha_r2": 0.9229840018865637,
1177
+ "condition_number": 41.859554290771484,
1178
+ "top_10_sv": [
1179
+ 35.846248626708984,
1180
+ 17.866443634033203,
1181
+ 17.141401290893555,
1182
+ 16.734554290771484,
1183
+ 16.55880355834961,
1184
+ 16.355411529541016,
1185
+ 15.886651039123535,
1186
+ 15.564071655273438,
1187
+ 15.157404899597168,
1188
+ 14.905134201049805
1189
+ ]
1190
+ },
1191
+ "transformer.h.9.mlp.c_proj.weight": {
1192
+ "shape": [
1193
+ 4096,
1194
+ 1024
1195
+ ],
1196
+ "effective_rank": 944.1468505859375,
1197
+ "stable_rank": 77.85943603515625,
1198
+ "spectral_norm": 21.045055389404297,
1199
+ "frobenius_norm": 185.6973419189453,
1200
+ "mp_bound": 16.68483567237854,
1201
+ "n_above_mp": 4,
1202
+ "n_total": 1024,
1203
+ "signal_ratio": 0.00390625,
1204
+ "alpha": 0.30334476439857244,
1205
+ "alpha_r2": 0.9238160976321365,
1206
+ "condition_number": 36.544837951660156,
1207
+ "top_10_sv": [
1208
+ 21.045055389404297,
1209
+ 18.942150115966797,
1210
+ 18.65625762939453,
1211
+ 17.60894203186035,
1212
+ 14.828644752502441,
1213
+ 13.582180976867676,
1214
+ 13.063478469848633,
1215
+ 12.69278335571289,
1216
+ 12.27115535736084,
1217
+ 12.16966438293457
1218
+ ]
1219
+ },
1220
+ "transformer.h.10.attn.c_attn.weight": {
1221
+ "shape": [
1222
+ 1024,
1223
+ 3072
1224
+ ],
1225
+ "effective_rank": 889.07373046875,
1226
+ "stable_rank": 110.98599243164062,
1227
+ "spectral_norm": 18.18061065673828,
1228
+ "frobenius_norm": 191.53253173828125,
1229
+ "mp_bound": 12.933648669067457,
1230
+ "n_above_mp": 23,
1231
+ "n_total": 1024,
1232
+ "signal_ratio": 0.0224609375,
1233
+ "alpha": 0.400968608999224,
1234
+ "alpha_r2": 0.8759391614511947,
1235
+ "condition_number": 22.077619552612305,
1236
+ "top_10_sv": [
1237
+ 18.18061065673828,
1238
+ 17.37954330444336,
1239
+ 16.723297119140625,
1240
+ 16.557512283325195,
1241
+ 15.972862243652344,
1242
+ 15.706113815307617,
1243
+ 15.19486141204834,
1244
+ 14.927367210388184,
1245
+ 14.738473892211914,
1246
+ 14.608979225158691
1247
+ ]
1248
+ },
1249
+ "transformer.h.10.attn.c_proj.weight": {
1250
+ "shape": [
1251
+ 1024,
1252
+ 1024
1253
+ ],
1254
+ "effective_rank": 712.9119262695312,
1255
+ "stable_rank": 15.614805221557617,
1256
+ "spectral_norm": 24.71717071533203,
1257
+ "frobenius_norm": 97.67131805419922,
1258
+ "mp_bound": 2.9057722091674805,
1259
+ "n_above_mp": 303,
1260
+ "n_total": 1024,
1261
+ "signal_ratio": 0.2958984375,
1262
+ "alpha": 0.6156612751434485,
1263
+ "alpha_r2": 0.7966407254881575,
1264
+ "condition_number": 26397.486328125,
1265
+ "top_10_sv": [
1266
+ 24.71717071533203,
1267
+ 22.72795867919922,
1268
+ 11.450672149658203,
1269
+ 9.635822296142578,
1270
+ 9.428568840026855,
1271
+ 8.725939750671387,
1272
+ 8.629619598388672,
1273
+ 8.33855152130127,
1274
+ 8.053350448608398,
1275
+ 8.0328950881958
1276
+ ]
1277
+ },
1278
+ "transformer.h.10.mlp.c_fc.weight": {
1279
+ "shape": [
1280
+ 1024,
1281
+ 4096
1282
+ ],
1283
+ "effective_rank": 936.62158203125,
1284
+ "stable_rank": 36.54176712036133,
1285
+ "spectral_norm": 35.16852951049805,
1286
+ "frobenius_norm": 212.593017578125,
1287
+ "mp_bound": 18.578571796417236,
1288
+ "n_above_mp": 2,
1289
+ "n_total": 1024,
1290
+ "signal_ratio": 0.001953125,
1291
+ "alpha": 0.31837997506899607,
1292
+ "alpha_r2": 0.9268627447940765,
1293
+ "condition_number": 44.306236267089844,
1294
+ "top_10_sv": [
1295
+ 35.16852951049805,
1296
+ 18.78311538696289,
1297
+ 17.198081970214844,
1298
+ 16.65778350830078,
1299
+ 16.474864959716797,
1300
+ 16.360870361328125,
1301
+ 15.851672172546387,
1302
+ 15.545867919921875,
1303
+ 15.253518104553223,
1304
+ 15.09296703338623
1305
+ ]
1306
+ },
1307
+ "transformer.h.10.mlp.c_proj.weight": {
1308
+ "shape": [
1309
+ 4096,
1310
+ 1024
1311
+ ],
1312
+ "effective_rank": 945.8907470703125,
1313
+ "stable_rank": 78.20089721679688,
1314
+ "spectral_norm": 21.427583694458008,
1315
+ "frobenius_norm": 189.48684692382812,
1316
+ "mp_bound": 17.147469520568848,
1317
+ "n_above_mp": 3,
1318
+ "n_total": 1024,
1319
+ "signal_ratio": 0.0029296875,
1320
+ "alpha": 0.30030447990075465,
1321
+ "alpha_r2": 0.9246116625990694,
1322
+ "condition_number": 53.54841232299805,
1323
+ "top_10_sv": [
1324
+ 21.427583694458008,
1325
+ 20.07192611694336,
1326
+ 17.730005264282227,
1327
+ 15.819792747497559,
1328
+ 14.968161582946777,
1329
+ 13.369080543518066,
1330
+ 13.173439025878906,
1331
+ 13.08027458190918,
1332
+ 12.7400541305542,
1333
+ 12.428592681884766
1334
+ ]
1335
+ },
1336
+ "transformer.h.11.attn.c_attn.weight": {
1337
+ "shape": [
1338
+ 1024,
1339
+ 3072
1340
+ ],
1341
+ "effective_rank": 901.9686279296875,
1342
+ "stable_rank": 114.79285430908203,
1343
+ "spectral_norm": 17.392602920532227,
1344
+ "frobenius_norm": 186.34683227539062,
1345
+ "mp_bound": 13.299353036480547,
1346
+ "n_above_mp": 13,
1347
+ "n_total": 1024,
1348
+ "signal_ratio": 0.0126953125,
1349
+ "alpha": 0.38050221069018897,
1350
+ "alpha_r2": 0.8828344695891076,
1351
+ "condition_number": 18.282512664794922,
1352
+ "top_10_sv": [
1353
+ 17.392602920532227,
1354
+ 16.71450424194336,
1355
+ 15.829066276550293,
1356
+ 15.560785293579102,
1357
+ 15.137024879455566,
1358
+ 14.838809967041016,
1359
+ 14.30210018157959,
1360
+ 13.980681419372559,
1361
+ 13.925599098205566,
1362
+ 13.726691246032715
1363
+ ]
1364
+ },
1365
+ "transformer.h.11.attn.c_proj.weight": {
1366
+ "shape": [
1367
+ 1024,
1368
+ 1024
1369
+ ],
1370
+ "effective_rank": 733.375,
1371
+ "stable_rank": 42.236331939697266,
1372
+ "spectral_norm": 15.064600944519043,
1373
+ "frobenius_norm": 97.90406799316406,
1374
+ "mp_bound": 3.1596755981445312,
1375
+ "n_above_mp": 294,
1376
+ "n_total": 1024,
1377
+ "signal_ratio": 0.287109375,
1378
+ "alpha": 0.5869072812022946,
1379
+ "alpha_r2": 0.7743078879808162,
1380
+ "condition_number": 10999.072265625,
1381
+ "top_10_sv": [
1382
+ 15.064600944519043,
1383
+ 13.98853874206543,
1384
+ 11.859731674194336,
1385
+ 11.481588363647461,
1386
+ 8.733123779296875,
1387
+ 8.476235389709473,
1388
+ 8.284677505493164,
1389
+ 8.177391052246094,
1390
+ 7.880287170410156,
1391
+ 7.761397361755371
1392
+ ]
1393
+ },
1394
+ "transformer.h.11.mlp.c_fc.weight": {
1395
+ "shape": [
1396
+ 1024,
1397
+ 4096
1398
+ ],
1399
+ "effective_rank": 936.5640869140625,
1400
+ "stable_rank": 36.39714050292969,
1401
+ "spectral_norm": 35.354408264160156,
1402
+ "frobenius_norm": 213.29330444335938,
1403
+ "mp_bound": 18.617774963378906,
1404
+ "n_above_mp": 2,
1405
+ "n_total": 1024,
1406
+ "signal_ratio": 0.001953125,
1407
+ "alpha": 0.3180465298760983,
1408
+ "alpha_r2": 0.9268863407776537,
1409
+ "condition_number": 36.84086990356445,
1410
+ "top_10_sv": [
1411
+ 35.354408264160156,
1412
+ 19.106882095336914,
1413
+ 17.004091262817383,
1414
+ 16.809144973754883,
1415
+ 16.472070693969727,
1416
+ 16.30332374572754,
1417
+ 15.732508659362793,
1418
+ 15.557268142700195,
1419
+ 15.403963088989258,
1420
+ 14.984481811523438
1421
+ ]
1422
+ },
1423
+ "transformer.h.11.mlp.c_proj.weight": {
1424
+ "shape": [
1425
+ 4096,
1426
+ 1024
1427
+ ],
1428
+ "effective_rank": 950.7989501953125,
1429
+ "stable_rank": 109.07081604003906,
1430
+ "spectral_norm": 18.535531997680664,
1431
+ "frobenius_norm": 193.57949829101562,
1432
+ "mp_bound": 17.80658483505249,
1433
+ "n_above_mp": 1,
1434
+ "n_total": 1024,
1435
+ "signal_ratio": 0.0009765625,
1436
+ "alpha": 0.2904225097637395,
1437
+ "alpha_r2": 0.9180203572981726,
1438
+ "condition_number": 31.789386749267578,
1439
+ "top_10_sv": [
1440
+ 18.535531997680664,
1441
+ 17.2476749420166,
1442
+ 15.5623140335083,
1443
+ 15.329514503479004,
1444
+ 14.429676055908203,
1445
+ 13.832291603088379,
1446
+ 13.37085247039795,
1447
+ 13.245711326599121,
1448
+ 13.16356372833252,
1449
+ 13.034541130065918
1450
+ ]
1451
+ },
1452
+ "transformer.h.12.attn.c_attn.weight": {
1453
+ "shape": [
1454
+ 1024,
1455
+ 3072
1456
+ ],
1457
+ "effective_rank": 900.2671508789062,
1458
+ "stable_rank": 111.13557434082031,
1459
+ "spectral_norm": 17.990798950195312,
1460
+ "frobenius_norm": 189.66055297851562,
1461
+ "mp_bound": 13.46277887415714,
1462
+ "n_above_mp": 15,
1463
+ "n_total": 1024,
1464
+ "signal_ratio": 0.0146484375,
1465
+ "alpha": 0.3849465327520483,
1466
+ "alpha_r2": 0.8884130947195702,
1467
+ "condition_number": 19.290571212768555,
1468
+ "top_10_sv": [
1469
+ 17.990798950195312,
1470
+ 17.627941131591797,
1471
+ 16.81555938720703,
1472
+ 16.37408447265625,
1473
+ 15.790731430053711,
1474
+ 15.523019790649414,
1475
+ 14.99732494354248,
1476
+ 14.942817687988281,
1477
+ 14.576937675476074,
1478
+ 14.378122329711914
1479
+ ]
1480
+ },
1481
+ "transformer.h.12.attn.c_proj.weight": {
1482
+ "shape": [
1483
+ 1024,
1484
+ 1024
1485
+ ],
1486
+ "effective_rank": 726.1136474609375,
1487
+ "stable_rank": 67.91838836669922,
1488
+ "spectral_norm": 11.730238914489746,
1489
+ "frobenius_norm": 96.67196655273438,
1490
+ "mp_bound": 2.9830989837646484,
1491
+ "n_above_mp": 317,
1492
+ "n_total": 1024,
1493
+ "signal_ratio": 0.3095703125,
1494
+ "alpha": 0.6079611377940939,
1495
+ "alpha_r2": 0.7713425607206535,
1496
+ "condition_number": 74660.875,
1497
+ "top_10_sv": [
1498
+ 11.730238914489746,
1499
+ 11.002860069274902,
1500
+ 9.450277328491211,
1501
+ 9.339866638183594,
1502
+ 9.130253791809082,
1503
+ 8.70073413848877,
1504
+ 8.407955169677734,
1505
+ 8.365035057067871,
1506
+ 8.216530799865723,
1507
+ 8.057732582092285
1508
+ ]
1509
+ },
1510
+ "transformer.h.12.mlp.c_fc.weight": {
1511
+ "shape": [
1512
+ 1024,
1513
+ 4096
1514
+ ],
1515
+ "effective_rank": 937.1617431640625,
1516
+ "stable_rank": 35.29595184326172,
1517
+ "spectral_norm": 35.728485107421875,
1518
+ "frobenius_norm": 212.26434326171875,
1519
+ "mp_bound": 18.63219451904297,
1520
+ "n_above_mp": 1,
1521
+ "n_total": 1024,
1522
+ "signal_ratio": 0.0009765625,
1523
+ "alpha": 0.31249088591006335,
1524
+ "alpha_r2": 0.9217926851101266,
1525
+ "condition_number": 59.26630401611328,
1526
+ "top_10_sv": [
1527
+ 35.728485107421875,
1528
+ 18.58656883239746,
1529
+ 16.78707504272461,
1530
+ 16.50104331970215,
1531
+ 16.170015335083008,
1532
+ 15.932202339172363,
1533
+ 15.695175170898438,
1534
+ 15.225218772888184,
1535
+ 14.931440353393555,
1536
+ 14.655704498291016
1537
+ ]
1538
+ },
1539
+ "transformer.h.12.mlp.c_proj.weight": {
1540
+ "shape": [
1541
+ 4096,
1542
+ 1024
1543
+ ],
1544
+ "effective_rank": 951.9144897460938,
1545
+ "stable_rank": 125.68574523925781,
1546
+ "spectral_norm": 17.590118408203125,
1547
+ "frobenius_norm": 197.20220947265625,
1548
+ "mp_bound": 18.2495698928833,
1549
+ "n_above_mp": 0,
1550
+ "n_total": 1024,
1551
+ "signal_ratio": 0.0,
1552
+ "alpha": 0.28659027982486085,
1553
+ "alpha_r2": 0.9157493562146201,
1554
+ "condition_number": 63.03939437866211,
1555
+ "top_10_sv": [
1556
+ 17.590118408203125,
1557
+ 16.458555221557617,
1558
+ 15.0971040725708,
1559
+ 14.703747749328613,
1560
+ 14.40058708190918,
1561
+ 14.16800308227539,
1562
+ 13.817294120788574,
1563
+ 13.755056381225586,
1564
+ 13.447993278503418,
1565
+ 13.400959968566895
1566
+ ]
1567
+ },
1568
+ "transformer.h.13.attn.c_attn.weight": {
1569
+ "shape": [
1570
+ 1024,
1571
+ 3072
1572
+ ],
1573
+ "effective_rank": 921.8739013671875,
1574
+ "stable_rank": 122.69657897949219,
1575
+ "spectral_norm": 17.3557071685791,
1576
+ "frobenius_norm": 192.24655151367188,
1577
+ "mp_bound": 14.840765654402492,
1578
+ "n_above_mp": 5,
1579
+ "n_total": 1024,
1580
+ "signal_ratio": 0.0048828125,
1581
+ "alpha": 0.34449500140802347,
1582
+ "alpha_r2": 0.8890960763544924,
1583
+ "condition_number": 12.950295448303223,
1584
+ "top_10_sv": [
1585
+ 17.3557071685791,
1586
+ 16.527515411376953,
1587
+ 16.102203369140625,
1588
+ 15.533954620361328,
1589
+ 15.408553123474121,
1590
+ 14.486262321472168,
1591
+ 14.440102577209473,
1592
+ 14.294885635375977,
1593
+ 13.857573509216309,
1594
+ 13.616387367248535
1595
+ ]
1596
+ },
1597
+ "transformer.h.13.attn.c_proj.weight": {
1598
+ "shape": [
1599
+ 1024,
1600
+ 1024
1601
+ ],
1602
+ "effective_rank": 745.8662109375,
1603
+ "stable_rank": 69.44125366210938,
1604
+ "spectral_norm": 11.393943786621094,
1605
+ "frobenius_norm": 94.94735717773438,
1606
+ "mp_bound": 3.1595468521118164,
1607
+ "n_above_mp": 305,
1608
+ "n_total": 1024,
1609
+ "signal_ratio": 0.2978515625,
1610
+ "alpha": 0.5692666342371,
1611
+ "alpha_r2": 0.7468213049281058,
1612
+ "condition_number": 11072.6982421875,
1613
+ "top_10_sv": [
1614
+ 11.393943786621094,
1615
+ 10.033926010131836,
1616
+ 9.37811279296875,
1617
+ 8.2539701461792,
1618
+ 7.887532711029053,
1619
+ 7.69539213180542,
1620
+ 7.5529046058654785,
1621
+ 7.425933837890625,
1622
+ 7.274467468261719,
1623
+ 7.151112079620361
1624
+ ]
1625
+ },
1626
+ "transformer.h.13.mlp.c_fc.weight": {
1627
+ "shape": [
1628
+ 1024,
1629
+ 4096
1630
+ ],
1631
+ "effective_rank": 937.599365234375,
1632
+ "stable_rank": 37.05495071411133,
1633
+ "spectral_norm": 35.005916595458984,
1634
+ "frobenius_norm": 213.0907440185547,
1635
+ "mp_bound": 18.706077575683594,
1636
+ "n_above_mp": 1,
1637
+ "n_total": 1024,
1638
+ "signal_ratio": 0.0009765625,
1639
+ "alpha": 0.3122716285047514,
1640
+ "alpha_r2": 0.9183302410504182,
1641
+ "condition_number": 36.091087341308594,
1642
+ "top_10_sv": [
1643
+ 35.005916595458984,
1644
+ 18.3486385345459,
1645
+ 16.548778533935547,
1646
+ 16.184619903564453,
1647
+ 15.928171157836914,
1648
+ 15.709515571594238,
1649
+ 15.531082153320312,
1650
+ 14.870841026306152,
1651
+ 14.759794235229492,
1652
+ 14.733899116516113
1653
+ ]
1654
+ },
1655
+ "transformer.h.13.mlp.c_proj.weight": {
1656
+ "shape": [
1657
+ 4096,
1658
+ 1024
1659
+ ],
1660
+ "effective_rank": 953.9815063476562,
1661
+ "stable_rank": 93.02642059326172,
1662
+ "spectral_norm": 21.067401885986328,
1663
+ "frobenius_norm": 203.19552612304688,
1664
+ "mp_bound": 18.922306537628174,
1665
+ "n_above_mp": 1,
1666
+ "n_total": 1024,
1667
+ "signal_ratio": 0.0009765625,
1668
+ "alpha": 0.2803336865969046,
1669
+ "alpha_r2": 0.9048293558413308,
1670
+ "condition_number": 39.748817443847656,
1671
+ "top_10_sv": [
1672
+ 21.067401885986328,
1673
+ 17.05304718017578,
1674
+ 15.297992706298828,
1675
+ 14.769113540649414,
1676
+ 14.03395938873291,
1677
+ 13.835834503173828,
1678
+ 13.424732208251953,
1679
+ 13.155864715576172,
1680
+ 13.011979103088379,
1681
+ 12.918780326843262
1682
+ ]
1683
+ },
1684
+ "transformer.h.14.attn.c_attn.weight": {
1685
+ "shape": [
1686
+ 1024,
1687
+ 3072
1688
+ ],
1689
+ "effective_rank": 927.090576171875,
1690
+ "stable_rank": 125.47744750976562,
1691
+ "spectral_norm": 16.747602462768555,
1692
+ "frobenius_norm": 187.60113525390625,
1693
+ "mp_bound": 14.835510292247621,
1694
+ "n_above_mp": 4,
1695
+ "n_total": 1024,
1696
+ "signal_ratio": 0.00390625,
1697
+ "alpha": 0.33493011773835984,
1698
+ "alpha_r2": 0.893001234472107,
1699
+ "condition_number": 14.26633358001709,
1700
+ "top_10_sv": [
1701
+ 16.747602462768555,
1702
+ 15.885435104370117,
1703
+ 15.1227388381958,
1704
+ 14.847970008850098,
1705
+ 14.377248764038086,
1706
+ 14.23108959197998,
1707
+ 13.869467735290527,
1708
+ 13.69625186920166,
1709
+ 13.520434379577637,
1710
+ 13.335817337036133
1711
+ ]
1712
+ },
1713
+ "transformer.h.14.attn.c_proj.weight": {
1714
+ "shape": [
1715
+ 1024,
1716
+ 1024
1717
+ ],
1718
+ "effective_rank": 771.0768432617188,
1719
+ "stable_rank": 45.977813720703125,
1720
+ "spectral_norm": 14.095232963562012,
1721
+ "frobenius_norm": 95.57546997070312,
1722
+ "mp_bound": 3.5943875312805176,
1723
+ "n_above_mp": 260,
1724
+ "n_total": 1024,
1725
+ "signal_ratio": 0.25390625,
1726
+ "alpha": 0.5172198054236867,
1727
+ "alpha_r2": 0.7357139263927202,
1728
+ "condition_number": 4705.93505859375,
1729
+ "top_10_sv": [
1730
+ 14.095232963562012,
1731
+ 10.67358684539795,
1732
+ 9.503582000732422,
1733
+ 7.890437126159668,
1734
+ 7.194736003875732,
1735
+ 7.045958042144775,
1736
+ 7.003268718719482,
1737
+ 6.855443000793457,
1738
+ 6.832728862762451,
1739
+ 6.729796886444092
1740
+ ]
1741
+ },
1742
+ "transformer.h.14.mlp.c_fc.weight": {
1743
+ "shape": [
1744
+ 1024,
1745
+ 4096
1746
+ ],
1747
+ "effective_rank": 938.0935668945312,
1748
+ "stable_rank": 32.70786666870117,
1749
+ "spectral_norm": 37.25117492675781,
1750
+ "frobenius_norm": 213.04241943359375,
1751
+ "mp_bound": 18.850232362747192,
1752
+ "n_above_mp": 1,
1753
+ "n_total": 1024,
1754
+ "signal_ratio": 0.0009765625,
1755
+ "alpha": 0.3063623329883038,
1756
+ "alpha_r2": 0.9141184802270717,
1757
+ "condition_number": 55.86502456665039,
1758
+ "top_10_sv": [
1759
+ 37.25117492675781,
1760
+ 17.538530349731445,
1761
+ 16.43047332763672,
1762
+ 15.928857803344727,
1763
+ 15.448025703430176,
1764
+ 15.416549682617188,
1765
+ 15.191793441772461,
1766
+ 14.8709135055542,
1767
+ 14.701964378356934,
1768
+ 14.292315483093262
1769
+ ]
1770
+ },
1771
+ "transformer.h.14.mlp.c_proj.weight": {
1772
+ "shape": [
1773
+ 4096,
1774
+ 1024
1775
+ ],
1776
+ "effective_rank": 954.6704711914062,
1777
+ "stable_rank": 85.12628173828125,
1778
+ "spectral_norm": 23.248565673828125,
1779
+ "frobenius_norm": 214.50035095214844,
1780
+ "mp_bound": 19.985750913619995,
1781
+ "n_above_mp": 1,
1782
+ "n_total": 1024,
1783
+ "signal_ratio": 0.0009765625,
1784
+ "alpha": 0.2766327750388291,
1785
+ "alpha_r2": 0.9013307804087409,
1786
+ "condition_number": 80.55670166015625,
1787
+ "top_10_sv": [
1788
+ 23.248565673828125,
1789
+ 18.41067886352539,
1790
+ 15.474272727966309,
1791
+ 14.885811805725098,
1792
+ 14.50477409362793,
1793
+ 14.26229476928711,
1794
+ 14.121097564697266,
1795
+ 13.90327262878418,
1796
+ 13.509455680847168,
1797
+ 13.493183135986328
1798
+ ]
1799
+ },
1800
+ "transformer.h.15.attn.c_attn.weight": {
1801
+ "shape": [
1802
+ 1024,
1803
+ 3072
1804
+ ],
1805
+ "effective_rank": 927.6941528320312,
1806
+ "stable_rank": 123.29837799072266,
1807
+ "spectral_norm": 16.74991798400879,
1808
+ "frobenius_norm": 185.99075317382812,
1809
+ "mp_bound": 14.70581637928394,
1810
+ "n_above_mp": 3,
1811
+ "n_total": 1024,
1812
+ "signal_ratio": 0.0029296875,
1813
+ "alpha": 0.33338608542644904,
1814
+ "alpha_r2": 0.889507619370194,
1815
+ "condition_number": 13.823485374450684,
1816
+ "top_10_sv": [
1817
+ 16.74991798400879,
1818
+ 15.065845489501953,
1819
+ 14.728111267089844,
1820
+ 14.221285820007324,
1821
+ 14.000044822692871,
1822
+ 13.83415412902832,
1823
+ 13.696669578552246,
1824
+ 13.493361473083496,
1825
+ 13.296980857849121,
1826
+ 13.158197402954102
1827
+ ]
1828
+ },
1829
+ "transformer.h.15.attn.c_proj.weight": {
1830
+ "shape": [
1831
+ 1024,
1832
+ 1024
1833
+ ],
1834
+ "effective_rank": 734.9755859375,
1835
+ "stable_rank": 39.57041549682617,
1836
+ "spectral_norm": 15.907938957214355,
1837
+ "frobenius_norm": 100.06892395019531,
1838
+ "mp_bound": 3.168710231781006,
1839
+ "n_above_mp": 315,
1840
+ "n_total": 1024,
1841
+ "signal_ratio": 0.3076171875,
1842
+ "alpha": 0.5890122729886013,
1843
+ "alpha_r2": 0.7620616339931927,
1844
+ "condition_number": 118929.2421875,
1845
+ "top_10_sv": [
1846
+ 15.907938957214355,
1847
+ 9.037424087524414,
1848
+ 8.875422477722168,
1849
+ 8.679743766784668,
1850
+ 8.60929012298584,
1851
+ 8.428465843200684,
1852
+ 8.394245147705078,
1853
+ 8.310733795166016,
1854
+ 8.301813125610352,
1855
+ 8.17810344696045
1856
+ ]
1857
+ },
1858
+ "transformer.h.15.mlp.c_fc.weight": {
1859
+ "shape": [
1860
+ 1024,
1861
+ 4096
1862
+ ],
1863
+ "effective_rank": 938.0027465820312,
1864
+ "stable_rank": 33.95222854614258,
1865
+ "spectral_norm": 36.603248596191406,
1866
+ "frobenius_norm": 213.28179931640625,
1867
+ "mp_bound": 18.90978169441223,
1868
+ "n_above_mp": 1,
1869
+ "n_total": 1024,
1870
+ "signal_ratio": 0.0009765625,
1871
+ "alpha": 0.30609205088241465,
1872
+ "alpha_r2": 0.9144551505747684,
1873
+ "condition_number": 63.18374252319336,
1874
+ "top_10_sv": [
1875
+ 36.603248596191406,
1876
+ 17.053295135498047,
1877
+ 16.544368743896484,
1878
+ 16.148056030273438,
1879
+ 16.080337524414062,
1880
+ 15.509836196899414,
1881
+ 15.409363746643066,
1882
+ 15.122523307800293,
1883
+ 14.887523651123047,
1884
+ 14.568466186523438
1885
+ ]
1886
+ },
1887
+ "transformer.h.15.mlp.c_proj.weight": {
1888
+ "shape": [
1889
+ 4096,
1890
+ 1024
1891
+ ],
1892
+ "effective_rank": 954.9154663085938,
1893
+ "stable_rank": 67.59476470947266,
1894
+ "spectral_norm": 27.46138572692871,
1895
+ "frobenius_norm": 225.77662658691406,
1896
+ "mp_bound": 20.990967750549316,
1897
+ "n_above_mp": 1,
1898
+ "n_total": 1024,
1899
+ "signal_ratio": 0.0009765625,
1900
+ "alpha": 0.2748501163371319,
1901
+ "alpha_r2": 0.8922496782749698,
1902
+ "condition_number": 75.69937896728516,
1903
+ "top_10_sv": [
1904
+ 27.46138572692871,
1905
+ 20.168291091918945,
1906
+ 16.48451805114746,
1907
+ 14.961456298828125,
1908
+ 14.724319458007812,
1909
+ 14.283745765686035,
1910
+ 14.013591766357422,
1911
+ 13.970895767211914,
1912
+ 13.828929901123047,
1913
+ 13.712251663208008
1914
+ ]
1915
+ },
1916
+ "transformer.h.16.attn.c_attn.weight": {
1917
+ "shape": [
1918
+ 1024,
1919
+ 3072
1920
+ ],
1921
+ "effective_rank": 946.042724609375,
1922
+ "stable_rank": 144.74566650390625,
1923
+ "spectral_norm": 15.667328834533691,
1924
+ "frobenius_norm": 188.4940948486328,
1925
+ "mp_bound": 15.990341963357688,
1926
+ "n_above_mp": 0,
1927
+ "n_total": 1024,
1928
+ "signal_ratio": 0.0,
1929
+ "alpha": 0.29235290161143895,
1930
+ "alpha_r2": 0.8708847197371617,
1931
+ "condition_number": 10.925378799438477,
1932
+ "top_10_sv": [
1933
+ 15.667328834533691,
1934
+ 13.777777671813965,
1935
+ 13.580466270446777,
1936
+ 13.480743408203125,
1937
+ 13.149480819702148,
1938
+ 12.728473663330078,
1939
+ 12.464872360229492,
1940
+ 12.249978065490723,
1941
+ 12.104480743408203,
1942
+ 12.058478355407715
1943
+ ]
1944
+ },
1945
+ "transformer.h.16.attn.c_proj.weight": {
1946
+ "shape": [
1947
+ 1024,
1948
+ 1024
1949
+ ],
1950
+ "effective_rank": 771.8905639648438,
1951
+ "stable_rank": 34.916778564453125,
1952
+ "spectral_norm": 17.508285522460938,
1953
+ "frobenius_norm": 103.45719909667969,
1954
+ "mp_bound": 3.9519896507263184,
1955
+ "n_above_mp": 251,
1956
+ "n_total": 1024,
1957
+ "signal_ratio": 0.2451171875,
1958
+ "alpha": 0.5150372729977702,
1959
+ "alpha_r2": 0.7343136754490875,
1960
+ "condition_number": 7742.5263671875,
1961
+ "top_10_sv": [
1962
+ 17.508285522460938,
1963
+ 8.428668022155762,
1964
+ 8.318235397338867,
1965
+ 7.7397379875183105,
1966
+ 7.588472843170166,
1967
+ 7.562111854553223,
1968
+ 7.487995624542236,
1969
+ 7.291107177734375,
1970
+ 7.21236515045166,
1971
+ 7.17236852645874
1972
+ ]
1973
+ },
1974
+ "transformer.h.16.mlp.c_fc.weight": {
1975
+ "shape": [
1976
+ 1024,
1977
+ 4096
1978
+ ],
1979
+ "effective_rank": 940.1898193359375,
1980
+ "stable_rank": 35.53369140625,
1981
+ "spectral_norm": 35.824527740478516,
1982
+ "frobenius_norm": 213.55052185058594,
1983
+ "mp_bound": 19.066128730773926,
1984
+ "n_above_mp": 1,
1985
+ "n_total": 1024,
1986
+ "signal_ratio": 0.0009765625,
1987
+ "alpha": 0.29995952857301905,
1988
+ "alpha_r2": 0.9104848364911763,
1989
+ "condition_number": 64.50019836425781,
1990
+ "top_10_sv": [
1991
+ 35.824527740478516,
1992
+ 16.836957931518555,
1993
+ 16.33154296875,
1994
+ 15.77452564239502,
1995
+ 15.625938415527344,
1996
+ 15.192853927612305,
1997
+ 14.959126472473145,
1998
+ 14.744555473327637,
1999
+ 14.429903030395508,
2000
+ 14.108305931091309
2001
+ ]
2002
+ },
2003
+ "transformer.h.16.mlp.c_proj.weight": {
2004
+ "shape": [
2005
+ 4096,
2006
+ 1024
2007
+ ],
2008
+ "effective_rank": 953.3158569335938,
2009
+ "stable_rank": 65.17062377929688,
2010
+ "spectral_norm": 29.556535720825195,
2011
+ "frobenius_norm": 238.60496520996094,
2012
+ "mp_bound": 22.07623028755188,
2013
+ "n_above_mp": 1,
2014
+ "n_total": 1024,
2015
+ "signal_ratio": 0.0009765625,
2016
+ "alpha": 0.27669229111220206,
2017
+ "alpha_r2": 0.8878361607414883,
2018
+ "condition_number": 59.69606018066406,
2019
+ "top_10_sv": [
2020
+ 29.556535720825195,
2021
+ 20.332595825195312,
2022
+ 17.3956356048584,
2023
+ 15.779086112976074,
2024
+ 15.581547737121582,
2025
+ 15.288317680358887,
2026
+ 15.058049201965332,
2027
+ 14.879526138305664,
2028
+ 14.627667427062988,
2029
+ 14.318242073059082
2030
+ ]
2031
+ },
2032
+ "transformer.h.17.attn.c_attn.weight": {
2033
+ "shape": [
2034
+ 1024,
2035
+ 3072
2036
+ ],
2037
+ "effective_rank": 947.1865234375,
2038
+ "stable_rank": 142.42913818359375,
2039
+ "spectral_norm": 15.703680992126465,
2040
+ "frobenius_norm": 187.4134979248047,
2041
+ "mp_bound": 15.91192969963621,
2042
+ "n_above_mp": 0,
2043
+ "n_total": 1024,
2044
+ "signal_ratio": 0.0,
2045
+ "alpha": 0.28839905133263977,
2046
+ "alpha_r2": 0.8639671147195912,
2047
+ "condition_number": 9.761466026306152,
2048
+ "top_10_sv": [
2049
+ 15.703680992126465,
2050
+ 14.009387016296387,
2051
+ 13.541427612304688,
2052
+ 13.323938369750977,
2053
+ 12.849181175231934,
2054
+ 12.739950180053711,
2055
+ 12.229618072509766,
2056
+ 12.002062797546387,
2057
+ 11.895586013793945,
2058
+ 11.779114723205566
2059
+ ]
2060
+ },
2061
+ "transformer.h.17.attn.c_proj.weight": {
2062
+ "shape": [
2063
+ 1024,
2064
+ 1024
2065
+ ],
2066
+ "effective_rank": 761.0531005859375,
2067
+ "stable_rank": 24.522497177124023,
2068
+ "spectral_norm": 21.281356811523438,
2069
+ "frobenius_norm": 105.38569641113281,
2070
+ "mp_bound": 3.7680649757385254,
2071
+ "n_above_mp": 283,
2072
+ "n_total": 1024,
2073
+ "signal_ratio": 0.2763671875,
2074
+ "alpha": 0.5307534843077817,
2075
+ "alpha_r2": 0.7242129172164211,
2076
+ "condition_number": 3630.01904296875,
2077
+ "top_10_sv": [
2078
+ 21.281356811523438,
2079
+ 8.807740211486816,
2080
+ 8.406106948852539,
2081
+ 8.249408721923828,
2082
+ 7.748202323913574,
2083
+ 7.372308254241943,
2084
+ 7.20876407623291,
2085
+ 7.15388298034668,
2086
+ 7.152019500732422,
2087
+ 7.0919060707092285
2088
+ ]
2089
+ },
2090
+ "transformer.h.17.mlp.c_fc.weight": {
2091
+ "shape": [
2092
+ 1024,
2093
+ 4096
2094
+ ],
2095
+ "effective_rank": 941.4163208007812,
2096
+ "stable_rank": 32.30669403076172,
2097
+ "spectral_norm": 37.66846466064453,
2098
+ "frobenius_norm": 214.10369873046875,
2099
+ "mp_bound": 19.23757553100586,
2100
+ "n_above_mp": 1,
2101
+ "n_total": 1024,
2102
+ "signal_ratio": 0.0009765625,
2103
+ "alpha": 0.29567148822186234,
2104
+ "alpha_r2": 0.9092601420648001,
2105
+ "condition_number": 106.81797790527344,
2106
+ "top_10_sv": [
2107
+ 37.66846466064453,
2108
+ 16.419097900390625,
2109
+ 15.927478790283203,
2110
+ 15.601706504821777,
2111
+ 15.464677810668945,
2112
+ 15.215024948120117,
2113
+ 14.866231918334961,
2114
+ 14.730674743652344,
2115
+ 14.174066543579102,
2116
+ 14.107979774475098
2117
+ ]
2118
+ },
2119
+ "transformer.h.17.mlp.c_proj.weight": {
2120
+ "shape": [
2121
+ 4096,
2122
+ 1024
2123
+ ],
2124
+ "effective_rank": 955.3389892578125,
2125
+ "stable_rank": 56.71522521972656,
2126
+ "spectral_norm": 32.85487747192383,
2127
+ "frobenius_norm": 247.42848205566406,
2128
+ "mp_bound": 23.075778007507324,
2129
+ "n_above_mp": 1,
2130
+ "n_total": 1024,
2131
+ "signal_ratio": 0.0009765625,
2132
+ "alpha": 0.26997116277589567,
2133
+ "alpha_r2": 0.8814747074927219,
2134
+ "condition_number": 65.88758850097656,
2135
+ "top_10_sv": [
2136
+ 32.85487747192383,
2137
+ 20.14518165588379,
2138
+ 17.330669403076172,
2139
+ 16.186241149902344,
2140
+ 15.963332176208496,
2141
+ 15.739975929260254,
2142
+ 15.285326957702637,
2143
+ 14.80821418762207,
2144
+ 14.750692367553711,
2145
+ 14.634135246276855
2146
+ ]
2147
+ },
2148
+ "transformer.h.18.attn.c_attn.weight": {
2149
+ "shape": [
2150
+ 1024,
2151
+ 3072
2152
+ ],
2153
+ "effective_rank": 946.826171875,
2154
+ "stable_rank": 142.5345458984375,
2155
+ "spectral_norm": 15.78872013092041,
2156
+ "frobenius_norm": 188.49810791015625,
2157
+ "mp_bound": 15.980405578724305,
2158
+ "n_above_mp": 0,
2159
+ "n_total": 1024,
2160
+ "signal_ratio": 0.0,
2161
+ "alpha": 0.29059627057789356,
2162
+ "alpha_r2": 0.8667678175025767,
2163
+ "condition_number": 8.09025764465332,
2164
+ "top_10_sv": [
2165
+ 15.78872013092041,
2166
+ 13.945246696472168,
2167
+ 13.766387939453125,
2168
+ 13.405774116516113,
2169
+ 13.049812316894531,
2170
+ 12.528767585754395,
2171
+ 12.479240417480469,
2172
+ 12.379585266113281,
2173
+ 12.237228393554688,
2174
+ 12.036273956298828
2175
+ ]
2176
+ },
2177
+ "transformer.h.18.attn.c_proj.weight": {
2178
+ "shape": [
2179
+ 1024,
2180
+ 1024
2181
+ ],
2182
+ "effective_rank": 786.9091796875,
2183
+ "stable_rank": 25.3469181060791,
2184
+ "spectral_norm": 21.475971221923828,
2185
+ "frobenius_norm": 108.12232971191406,
2186
+ "mp_bound": 4.413455486297607,
2187
+ "n_above_mp": 222,
2188
+ "n_total": 1024,
2189
+ "signal_ratio": 0.216796875,
2190
+ "alpha": 0.478483490126483,
2191
+ "alpha_r2": 0.7161237347700887,
2192
+ "condition_number": 69973.6328125,
2193
+ "top_10_sv": [
2194
+ 21.475971221923828,
2195
+ 8.631346702575684,
2196
+ 8.211710929870605,
2197
+ 8.147095680236816,
2198
+ 7.6618499755859375,
2199
+ 7.513932228088379,
2200
+ 7.377252101898193,
2201
+ 7.279123783111572,
2202
+ 7.242250919342041,
2203
+ 7.182393550872803
2204
+ ]
2205
+ },
2206
+ "transformer.h.18.mlp.c_fc.weight": {
2207
+ "shape": [
2208
+ 1024,
2209
+ 4096
2210
+ ],
2211
+ "effective_rank": 944.5552978515625,
2212
+ "stable_rank": 33.118221282958984,
2213
+ "spectral_norm": 37.25223159790039,
2214
+ "frobenius_norm": 214.38075256347656,
2215
+ "mp_bound": 19.486257076263428,
2216
+ "n_above_mp": 1,
2217
+ "n_total": 1024,
2218
+ "signal_ratio": 0.0009765625,
2219
+ "alpha": 0.289183381623581,
2220
+ "alpha_r2": 0.9070325638595973,
2221
+ "condition_number": 98.16948699951172,
2222
+ "top_10_sv": [
2223
+ 37.25223159790039,
2224
+ 15.790482521057129,
2225
+ 15.473872184753418,
2226
+ 15.28934097290039,
2227
+ 15.085716247558594,
2228
+ 14.840494155883789,
2229
+ 14.610716819763184,
2230
+ 14.552131652832031,
2231
+ 14.460594177246094,
2232
+ 14.226823806762695
2233
+ ]
2234
+ },
2235
+ "transformer.h.18.mlp.c_proj.weight": {
2236
+ "shape": [
2237
+ 4096,
2238
+ 1024
2239
+ ],
2240
+ "effective_rank": 955.5623168945312,
2241
+ "stable_rank": 62.169673919677734,
2242
+ "spectral_norm": 32.66622543334961,
2243
+ "frobenius_norm": 257.5658264160156,
2244
+ "mp_bound": 24.027284145355225,
2245
+ "n_above_mp": 1,
2246
+ "n_total": 1024,
2247
+ "signal_ratio": 0.0009765625,
2248
+ "alpha": 0.2709203055980388,
2249
+ "alpha_r2": 0.8864060504096509,
2250
+ "condition_number": 42.17201232910156,
2251
+ "top_10_sv": [
2252
+ 32.66622543334961,
2253
+ 21.241456985473633,
2254
+ 18.316198348999023,
2255
+ 18.020597457885742,
2256
+ 17.09003257751465,
2257
+ 16.6181583404541,
2258
+ 16.53766632080078,
2259
+ 16.19798469543457,
2260
+ 15.890321731567383,
2261
+ 15.60676383972168
2262
+ ]
2263
+ },
2264
+ "transformer.h.19.attn.c_attn.weight": {
2265
+ "shape": [
2266
+ 1024,
2267
+ 3072
2268
+ ],
2269
+ "effective_rank": 942.326171875,
2270
+ "stable_rank": 155.50115966796875,
2271
+ "spectral_norm": 14.992883682250977,
2272
+ "frobenius_norm": 186.96141052246094,
2273
+ "mp_bound": 15.57602523915173,
2274
+ "n_above_mp": 0,
2275
+ "n_total": 1024,
2276
+ "signal_ratio": 0.0,
2277
+ "alpha": 0.2982009204884547,
2278
+ "alpha_r2": 0.8572427405933445,
2279
+ "condition_number": 7.912736892700195,
2280
+ "top_10_sv": [
2281
+ 14.992883682250977,
2282
+ 14.037845611572266,
2283
+ 13.398187637329102,
2284
+ 13.230586051940918,
2285
+ 12.737166404724121,
2286
+ 12.300458908081055,
2287
+ 12.292224884033203,
2288
+ 12.19207763671875,
2289
+ 12.00888729095459,
2290
+ 11.759111404418945
2291
+ ]
2292
+ },
2293
+ "transformer.h.19.attn.c_proj.weight": {
2294
+ "shape": [
2295
+ 1024,
2296
+ 1024
2297
+ ],
2298
+ "effective_rank": 807.1912231445312,
2299
+ "stable_rank": 17.76304054260254,
2300
+ "spectral_norm": 27.394882202148438,
2301
+ "frobenius_norm": 115.45907592773438,
2302
+ "mp_bound": 5.350113868713379,
2303
+ "n_above_mp": 135,
2304
+ "n_total": 1024,
2305
+ "signal_ratio": 0.1318359375,
2306
+ "alpha": 0.4361821075722079,
2307
+ "alpha_r2": 0.7264706510936031,
2308
+ "condition_number": 15827.857421875,
2309
+ "top_10_sv": [
2310
+ 27.394882202148438,
2311
+ 9.131587028503418,
2312
+ 8.159581184387207,
2313
+ 7.967867374420166,
2314
+ 7.304231643676758,
2315
+ 7.250730037689209,
2316
+ 7.2139668464660645,
2317
+ 7.170718669891357,
2318
+ 7.162339687347412,
2319
+ 7.095859050750732
2320
+ ]
2321
+ },
2322
+ "transformer.h.19.mlp.c_fc.weight": {
2323
+ "shape": [
2324
+ 1024,
2325
+ 4096
2326
+ ],
2327
+ "effective_rank": 946.293212890625,
2328
+ "stable_rank": 33.909671783447266,
2329
+ "spectral_norm": 36.853206634521484,
2330
+ "frobenius_norm": 214.6036376953125,
2331
+ "mp_bound": 19.577980041503906,
2332
+ "n_above_mp": 1,
2333
+ "n_total": 1024,
2334
+ "signal_ratio": 0.0009765625,
2335
+ "alpha": 0.2855408853304053,
2336
+ "alpha_r2": 0.9022405306041533,
2337
+ "condition_number": 91.69256591796875,
2338
+ "top_10_sv": [
2339
+ 36.853206634521484,
2340
+ 15.805397033691406,
2341
+ 15.353994369506836,
2342
+ 15.227054595947266,
2343
+ 15.066706657409668,
2344
+ 14.814555168151855,
2345
+ 14.390864372253418,
2346
+ 14.297786712646484,
2347
+ 14.181514739990234,
2348
+ 13.928579330444336
2349
+ ]
2350
+ },
2351
+ "transformer.h.19.mlp.c_proj.weight": {
2352
+ "shape": [
2353
+ 4096,
2354
+ 1024
2355
+ ],
2356
+ "effective_rank": 956.4804077148438,
2357
+ "stable_rank": 52.073970794677734,
2358
+ "spectral_norm": 37.2791862487793,
2359
+ "frobenius_norm": 269.0151672363281,
2360
+ "mp_bound": 25.231703281402588,
2361
+ "n_above_mp": 1,
2362
+ "n_total": 1024,
2363
+ "signal_ratio": 0.0009765625,
2364
+ "alpha": 0.26739284885206765,
2365
+ "alpha_r2": 0.8835308501225437,
2366
+ "condition_number": 45.95317077636719,
2367
+ "top_10_sv": [
2368
+ 37.2791862487793,
2369
+ 20.44887351989746,
2370
+ 19.732032775878906,
2371
+ 18.161226272583008,
2372
+ 17.561620712280273,
2373
+ 17.188261032104492,
2374
+ 17.060258865356445,
2375
+ 16.776437759399414,
2376
+ 16.5764217376709,
2377
+ 16.08405303955078
2378
+ ]
2379
+ },
2380
+ "transformer.h.20.attn.c_attn.weight": {
2381
+ "shape": [
2382
+ 1024,
2383
+ 3072
2384
+ ],
2385
+ "effective_rank": 936.0965576171875,
2386
+ "stable_rank": 156.9998779296875,
2387
+ "spectral_norm": 14.947098731994629,
2388
+ "frobenius_norm": 187.28652954101562,
2389
+ "mp_bound": 15.20929343045714,
2390
+ "n_above_mp": 0,
2391
+ "n_total": 1024,
2392
+ "signal_ratio": 0.0,
2393
+ "alpha": 0.3120911362337675,
2394
+ "alpha_r2": 0.858786145933246,
2395
+ "condition_number": 8.875840187072754,
2396
+ "top_10_sv": [
2397
+ 14.947098731994629,
2398
+ 14.385692596435547,
2399
+ 13.843692779541016,
2400
+ 13.672863960266113,
2401
+ 13.224756240844727,
2402
+ 12.674160957336426,
2403
+ 12.55836296081543,
2404
+ 12.521924018859863,
2405
+ 12.441327095031738,
2406
+ 11.893940925598145
2407
+ ]
2408
+ },
2409
+ "transformer.h.20.attn.c_proj.weight": {
2410
+ "shape": [
2411
+ 1024,
2412
+ 1024
2413
+ ],
2414
+ "effective_rank": 801.7127685546875,
2415
+ "stable_rank": 16.856487274169922,
2416
+ "spectral_norm": 28.67116355895996,
2417
+ "frobenius_norm": 117.71420288085938,
2418
+ "mp_bound": 5.2284698486328125,
2419
+ "n_above_mp": 163,
2420
+ "n_total": 1024,
2421
+ "signal_ratio": 0.1591796875,
2422
+ "alpha": 0.444035142929919,
2423
+ "alpha_r2": 0.7121527583409499,
2424
+ "condition_number": 31572.4609375,
2425
+ "top_10_sv": [
2426
+ 28.67116355895996,
2427
+ 9.656560897827148,
2428
+ 8.204039573669434,
2429
+ 7.896355152130127,
2430
+ 7.5075554847717285,
2431
+ 7.466111660003662,
2432
+ 7.3787007331848145,
2433
+ 7.314346790313721,
2434
+ 7.256669044494629,
2435
+ 7.230027675628662
2436
+ ]
2437
+ },
2438
+ "transformer.h.20.mlp.c_fc.weight": {
2439
+ "shape": [
2440
+ 1024,
2441
+ 4096
2442
+ ],
2443
+ "effective_rank": 948.0650024414062,
2444
+ "stable_rank": 35.34494400024414,
2445
+ "spectral_norm": 36.2116813659668,
2446
+ "frobenius_norm": 215.2843017578125,
2447
+ "mp_bound": 19.70071792602539,
2448
+ "n_above_mp": 1,
2449
+ "n_total": 1024,
2450
+ "signal_ratio": 0.0009765625,
2451
+ "alpha": 0.2835115515690549,
2452
+ "alpha_r2": 0.8999757829277988,
2453
+ "condition_number": 68.69733428955078,
2454
+ "top_10_sv": [
2455
+ 36.2116813659668,
2456
+ 15.233745574951172,
2457
+ 15.040852546691895,
2458
+ 15.021078109741211,
2459
+ 14.903677940368652,
2460
+ 14.672463417053223,
2461
+ 14.484726905822754,
2462
+ 14.234298706054688,
2463
+ 13.947386741638184,
2464
+ 13.761749267578125
2465
+ ]
2466
+ },
2467
+ "transformer.h.20.mlp.c_proj.weight": {
2468
+ "shape": [
2469
+ 4096,
2470
+ 1024
2471
+ ],
2472
+ "effective_rank": 957.45361328125,
2473
+ "stable_rank": 53.200496673583984,
2474
+ "spectral_norm": 38.38079071044922,
2475
+ "frobenius_norm": 279.94439697265625,
2476
+ "mp_bound": 26.34588861465454,
2477
+ "n_above_mp": 1,
2478
+ "n_total": 1024,
2479
+ "signal_ratio": 0.0009765625,
2480
+ "alpha": 0.26655693080921455,
2481
+ "alpha_r2": 0.8858868317060345,
2482
+ "condition_number": 45.9372444152832,
2483
+ "top_10_sv": [
2484
+ 38.38079071044922,
2485
+ 20.71181869506836,
2486
+ 19.3667049407959,
2487
+ 19.097551345825195,
2488
+ 18.877281188964844,
2489
+ 18.161039352416992,
2490
+ 17.817678451538086,
2491
+ 17.619935989379883,
2492
+ 17.085142135620117,
2493
+ 17.002307891845703
2494
+ ]
2495
+ },
2496
+ "transformer.h.21.attn.c_attn.weight": {
2497
+ "shape": [
2498
+ 1024,
2499
+ 3072
2500
+ ],
2501
+ "effective_rank": 941.8446044921875,
2502
+ "stable_rank": 150.69029235839844,
2503
+ "spectral_norm": 15.391748428344727,
2504
+ "frobenius_norm": 188.94290161132812,
2505
+ "mp_bound": 15.802790776950205,
2506
+ "n_above_mp": 0,
2507
+ "n_total": 1024,
2508
+ "signal_ratio": 0.0,
2509
+ "alpha": 0.3000971543853719,
2510
+ "alpha_r2": 0.8719087171149849,
2511
+ "condition_number": 11.076602935791016,
2512
+ "top_10_sv": [
2513
+ 15.391748428344727,
2514
+ 14.966634750366211,
2515
+ 14.548294067382812,
2516
+ 13.83862018585205,
2517
+ 13.741484642028809,
2518
+ 13.272273063659668,
2519
+ 13.161110877990723,
2520
+ 12.848255157470703,
2521
+ 12.513276100158691,
2522
+ 12.309864044189453
2523
+ ]
2524
+ },
2525
+ "transformer.h.21.attn.c_proj.weight": {
2526
+ "shape": [
2527
+ 1024,
2528
+ 1024
2529
+ ],
2530
+ "effective_rank": 802.9078979492188,
2531
+ "stable_rank": 10.218631744384766,
2532
+ "spectral_norm": 38.61031723022461,
2533
+ "frobenius_norm": 123.42403411865234,
2534
+ "mp_bound": 5.600226402282715,
2535
+ "n_above_mp": 132,
2536
+ "n_total": 1024,
2537
+ "signal_ratio": 0.12890625,
2538
+ "alpha": 0.43585709299085673,
2539
+ "alpha_r2": 0.7264214720046629,
2540
+ "condition_number": 16619.98828125,
2541
+ "top_10_sv": [
2542
+ 38.61031723022461,
2543
+ 12.657886505126953,
2544
+ 9.889753341674805,
2545
+ 9.055880546569824,
2546
+ 8.717723846435547,
2547
+ 7.8169941902160645,
2548
+ 7.405618190765381,
2549
+ 7.38328742980957,
2550
+ 7.281287670135498,
2551
+ 7.214484691619873
2552
+ ]
2553
+ },
2554
+ "transformer.h.21.mlp.c_fc.weight": {
2555
+ "shape": [
2556
+ 1024,
2557
+ 4096
2558
+ ],
2559
+ "effective_rank": 947.9262084960938,
2560
+ "stable_rank": 36.02553176879883,
2561
+ "spectral_norm": 35.89358139038086,
2562
+ "frobenius_norm": 215.43785095214844,
2563
+ "mp_bound": 19.718321800231934,
2564
+ "n_above_mp": 1,
2565
+ "n_total": 1024,
2566
+ "signal_ratio": 0.0009765625,
2567
+ "alpha": 0.2838725389249319,
2568
+ "alpha_r2": 0.8980818942031933,
2569
+ "condition_number": 72.88961029052734,
2570
+ "top_10_sv": [
2571
+ 35.89358139038086,
2572
+ 15.480804443359375,
2573
+ 14.935028076171875,
2574
+ 14.85132122039795,
2575
+ 14.700712203979492,
2576
+ 14.630121231079102,
2577
+ 14.243003845214844,
2578
+ 14.099127769470215,
2579
+ 13.748274803161621,
2580
+ 13.597418785095215
2581
+ ]
2582
+ },
2583
+ "transformer.h.21.mlp.c_proj.weight": {
2584
+ "shape": [
2585
+ 4096,
2586
+ 1024
2587
+ ],
2588
+ "effective_rank": 957.150146484375,
2589
+ "stable_rank": 40.031089782714844,
2590
+ "spectral_norm": 45.864524841308594,
2591
+ "frobenius_norm": 290.1854248046875,
2592
+ "mp_bound": 27.241085529327393,
2593
+ "n_above_mp": 2,
2594
+ "n_total": 1024,
2595
+ "signal_ratio": 0.001953125,
2596
+ "alpha": 0.26491965004813967,
2597
+ "alpha_r2": 0.8792307905237831,
2598
+ "condition_number": 55.03404998779297,
2599
+ "top_10_sv": [
2600
+ 45.864524841308594,
2601
+ 28.97075080871582,
2602
+ 19.798044204711914,
2603
+ 18.97209358215332,
2604
+ 18.70978355407715,
2605
+ 18.018537521362305,
2606
+ 17.47182273864746,
2607
+ 17.274417877197266,
2608
+ 16.987682342529297,
2609
+ 16.63946533203125
2610
+ ]
2611
+ },
2612
+ "transformer.h.22.attn.c_attn.weight": {
2613
+ "shape": [
2614
+ 1024,
2615
+ 3072
2616
+ ],
2617
+ "effective_rank": 910.2899780273438,
2618
+ "stable_rank": 125.6297836303711,
2619
+ "spectral_norm": 16.882688522338867,
2620
+ "frobenius_norm": 189.22909545898438,
2621
+ "mp_bound": 14.07356805741784,
2622
+ "n_above_mp": 5,
2623
+ "n_total": 1024,
2624
+ "signal_ratio": 0.0048828125,
2625
+ "alpha": 0.3631109400306433,
2626
+ "alpha_r2": 0.8785417413420795,
2627
+ "condition_number": 18.38054656982422,
2628
+ "top_10_sv": [
2629
+ 16.882688522338867,
2630
+ 15.70670223236084,
2631
+ 15.6824312210083,
2632
+ 14.943181037902832,
2633
+ 14.69243335723877,
2634
+ 13.932169914245605,
2635
+ 13.615700721740723,
2636
+ 13.460670471191406,
2637
+ 13.248589515686035,
2638
+ 12.95559024810791
2639
+ ]
2640
+ },
2641
+ "transformer.h.22.attn.c_proj.weight": {
2642
+ "shape": [
2643
+ 1024,
2644
+ 1024
2645
+ ],
2646
+ "effective_rank": 791.9282836914062,
2647
+ "stable_rank": 7.863592624664307,
2648
+ "spectral_norm": 47.40917205810547,
2649
+ "frobenius_norm": 132.9452667236328,
2650
+ "mp_bound": 5.678843975067139,
2651
+ "n_above_mp": 149,
2652
+ "n_total": 1024,
2653
+ "signal_ratio": 0.1455078125,
2654
+ "alpha": 0.4420612606508839,
2655
+ "alpha_r2": 0.7218298086729467,
2656
+ "condition_number": 11620.599609375,
2657
+ "top_10_sv": [
2658
+ 47.40917205810547,
2659
+ 18.899751663208008,
2660
+ 13.706535339355469,
2661
+ 12.33392333984375,
2662
+ 10.662578582763672,
2663
+ 8.17455768585205,
2664
+ 7.832640171051025,
2665
+ 7.784000396728516,
2666
+ 7.671928882598877,
2667
+ 7.559737682342529
2668
+ ]
2669
+ },
2670
+ "transformer.h.22.mlp.c_fc.weight": {
2671
+ "shape": [
2672
+ 1024,
2673
+ 4096
2674
+ ],
2675
+ "effective_rank": 947.9542846679688,
2676
+ "stable_rank": 35.02265548706055,
2677
+ "spectral_norm": 36.702762603759766,
2678
+ "frobenius_norm": 217.2067413330078,
2679
+ "mp_bound": 19.8194797039032,
2680
+ "n_above_mp": 1,
2681
+ "n_total": 1024,
2682
+ "signal_ratio": 0.0009765625,
2683
+ "alpha": 0.28619079431362643,
2684
+ "alpha_r2": 0.9002755587642126,
2685
+ "condition_number": 57.37492752075195,
2686
+ "top_10_sv": [
2687
+ 36.702762603759766,
2688
+ 15.816225051879883,
2689
+ 15.100847244262695,
2690
+ 14.771247863769531,
2691
+ 14.534582138061523,
2692
+ 14.363577842712402,
2693
+ 14.114562034606934,
2694
+ 14.111546516418457,
2695
+ 13.981919288635254,
2696
+ 13.779252052307129
2697
+ ]
2698
+ },
2699
+ "transformer.h.22.mlp.c_proj.weight": {
2700
+ "shape": [
2701
+ 4096,
2702
+ 1024
2703
+ ],
2704
+ "effective_rank": 947.5176391601562,
2705
+ "stable_rank": 17.500995635986328,
2706
+ "spectral_norm": 73.44800567626953,
2707
+ "frobenius_norm": 307.2637939453125,
2708
+ "mp_bound": 27.96702003479004,
2709
+ "n_above_mp": 3,
2710
+ "n_total": 1024,
2711
+ "signal_ratio": 0.0029296875,
2712
+ "alpha": 0.27600893237986174,
2713
+ "alpha_r2": 0.8907517199504488,
2714
+ "condition_number": 77.1149673461914,
2715
+ "top_10_sv": [
2716
+ 73.44800567626953,
2717
+ 46.17403030395508,
2718
+ 41.175079345703125,
2719
+ 19.932249069213867,
2720
+ 19.340335845947266,
2721
+ 18.94622039794922,
2722
+ 18.65909767150879,
2723
+ 17.994863510131836,
2724
+ 17.704139709472656,
2725
+ 17.6275577545166
2726
+ ]
2727
+ },
2728
+ "transformer.h.23.attn.c_attn.weight": {
2729
+ "shape": [
2730
+ 1024,
2731
+ 3072
2732
+ ],
2733
+ "effective_rank": 920.36767578125,
2734
+ "stable_rank": 83.51404571533203,
2735
+ "spectral_norm": 20.953210830688477,
2736
+ "frobenius_norm": 191.48306274414062,
2737
+ "mp_bound": 14.838865915472999,
2738
+ "n_above_mp": 6,
2739
+ "n_total": 1024,
2740
+ "signal_ratio": 0.005859375,
2741
+ "alpha": 0.347956924436965,
2742
+ "alpha_r2": 0.9030461026469062,
2743
+ "condition_number": 30.88874626159668,
2744
+ "top_10_sv": [
2745
+ 20.953210830688477,
2746
+ 18.181076049804688,
2747
+ 16.9401798248291,
2748
+ 16.4212703704834,
2749
+ 16.156402587890625,
2750
+ 15.156576156616211,
2751
+ 14.803506851196289,
2752
+ 14.501842498779297,
2753
+ 14.308381080627441,
2754
+ 14.216387748718262
2755
+ ]
2756
+ },
2757
+ "transformer.h.23.attn.c_proj.weight": {
2758
+ "shape": [
2759
+ 1024,
2760
+ 1024
2761
+ ],
2762
+ "effective_rank": 725.8294067382812,
2763
+ "stable_rank": 7.097742080688477,
2764
+ "spectral_norm": 54.51199722290039,
2765
+ "frobenius_norm": 145.22862243652344,
2766
+ "mp_bound": 4.808560848236084,
2767
+ "n_above_mp": 205,
2768
+ "n_total": 1024,
2769
+ "signal_ratio": 0.2001953125,
2770
+ "alpha": 0.5165891829463887,
2771
+ "alpha_r2": 0.7805439618628243,
2772
+ "condition_number": 81928.5625,
2773
+ "top_10_sv": [
2774
+ 54.51199722290039,
2775
+ 51.59781265258789,
2776
+ 31.993318557739258,
2777
+ 28.781885147094727,
2778
+ 20.638469696044922,
2779
+ 19.69414520263672,
2780
+ 17.257286071777344,
2781
+ 11.122438430786133,
2782
+ 9.82822036743164,
2783
+ 8.54098129272461
2784
+ ]
2785
+ },
2786
+ "transformer.h.23.mlp.c_fc.weight": {
2787
+ "shape": [
2788
+ 1024,
2789
+ 4096
2790
+ ],
2791
+ "effective_rank": 949.3385009765625,
2792
+ "stable_rank": 32.35031509399414,
2793
+ "spectral_norm": 39.07307052612305,
2794
+ "frobenius_norm": 222.2372283935547,
2795
+ "mp_bound": 20.35976243019104,
2796
+ "n_above_mp": 2,
2797
+ "n_total": 1024,
2798
+ "signal_ratio": 0.001953125,
2799
+ "alpha": 0.2880342497839811,
2800
+ "alpha_r2": 0.9166713464789396,
2801
+ "condition_number": 156.26080322265625,
2802
+ "top_10_sv": [
2803
+ 39.07307052612305,
2804
+ 21.5715389251709,
2805
+ 18.421354293823242,
2806
+ 18.168561935424805,
2807
+ 16.698549270629883,
2808
+ 16.427534103393555,
2809
+ 15.287932395935059,
2810
+ 14.903624534606934,
2811
+ 14.39078140258789,
2812
+ 14.156229019165039
2813
+ ]
2814
+ },
2815
+ "transformer.h.23.mlp.c_proj.weight": {
2816
+ "shape": [
2817
+ 4096,
2818
+ 1024
2819
+ ],
2820
+ "effective_rank": 952.0796508789062,
2821
+ "stable_rank": 23.627531051635742,
2822
+ "spectral_norm": 65.28954315185547,
2823
+ "frobenius_norm": 317.3604431152344,
2824
+ "mp_bound": 29.342853784561157,
2825
+ "n_above_mp": 3,
2826
+ "n_total": 1024,
2827
+ "signal_ratio": 0.0029296875,
2828
+ "alpha": 0.2737939337158801,
2829
+ "alpha_r2": 0.9043915028204278,
2830
+ "condition_number": 46.49357223510742,
2831
+ "top_10_sv": [
2832
+ 65.28954315185547,
2833
+ 45.984188079833984,
2834
+ 36.61665725708008,
2835
+ 27.076080322265625,
2836
+ 23.387649536132812,
2837
+ 21.129486083984375,
2838
+ 20.738719940185547,
2839
+ 20.499759674072266,
2840
+ 20.096046447753906,
2841
+ 19.36311912536621
2842
+ ]
2843
+ }
2844
+ }
scripts/spectral_output/prisma/weight_spectra_attention_K.png ADDED

Git LFS Details

  • SHA256: 4362c4c591649646c7ee371d716cc5019ad928842fd8477ac8fefada00d7dc38
  • Pointer size: 131 Bytes
  • Size of remote file: 280 kB
scripts/spectral_output/prisma/weight_spectra_attention_O.png ADDED

Git LFS Details

  • SHA256: 793409603d5bef8de97ecfaf7b41926a63d19a2b714f656145947ce2e25bcac1
  • Pointer size: 131 Bytes
  • Size of remote file: 197 kB
scripts/spectral_output/prisma/weight_spectra_attention_Q.png ADDED

Git LFS Details

  • SHA256: 1a4095e10d24664eb401edcf4f3217b1bcdfbf3a935e8b0011998a2a76d1e077
  • Pointer size: 131 Bytes
  • Size of remote file: 179 kB
scripts/spectral_output/prisma/weight_spectra_attention_V.png ADDED

Git LFS Details

  • SHA256: eecac39bf1fc98c000973ec3a9d308518a247e175f18c178dc09e633b57bc0bf
  • Pointer size: 131 Bytes
  • Size of remote file: 336 kB
scripts/spectral_output/prisma/weight_spectra_embedding.png ADDED

Git LFS Details

  • SHA256: d7b76c13f4b59fda5ef2fc897b2cbcf5ba306aa64642e3feb9d2c25965f0e45d
  • Pointer size: 130 Bytes
  • Size of remote file: 73.8 kB
scripts/spectral_output/prisma/weight_spectra_ffn_W1.png ADDED

Git LFS Details

  • SHA256: 80fc8e1baedbd8b12101cc2a77bb465764dd06637cf87bb6602c84a757cf34d4
  • Pointer size: 131 Bytes
  • Size of remote file: 157 kB
scripts/spectral_output/prisma/weight_spectra_ffn_W2.png ADDED

Git LFS Details

  • SHA256: fd787b3f628cf52884610c579913133866ba1409648f8436540faa830a67ec24
  • Pointer size: 131 Bytes
  • Size of remote file: 253 kB
scripts/spectral_output/prisma/weight_spectra_ffn_gate_W3.png ADDED

Git LFS Details

  • SHA256: e2c6437c3ee0966357dfed09708145777637b9eff706e12c8a53e419a8f028ae
  • Pointer size: 131 Bytes
  • Size of remote file: 154 kB
scripts/spectral_output/prisma/weight_spectra_ffn_gate_W4.png ADDED

Git LFS Details

  • SHA256: a4dfd384233ac5739e28780e6f97ecc5239edb9360b83ed08fbe9e56d8045c8a
  • Pointer size: 131 Bytes
  • Size of remote file: 158 kB
scripts/spectral_output/prisma/weight_spectra_other.png ADDED

Git LFS Details

  • SHA256: 8a5dac4541f470dddf6f2786c9454dec23b57ffc487cfe561d9d74c934e84af9
  • Pointer size: 130 Bytes
  • Size of remote file: 71.8 kB