AbstractPhil commited on
Commit
1b9a2c2
Β·
verified Β·
1 Parent(s): 1d62b3b

Create cell_4_proper_experiment_3.txt

Browse files
Files changed (1) hide show
  1. cell_4_proper_experiment_3.txt +260 -0
cell_4_proper_experiment_3.txt ADDED
@@ -0,0 +1,260 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Loading Freckles v40 + CIFAR-10...
2
+
3
+ ======================================================================
4
+ 1. FULL ROUND-TRIP β€” Per-patch reconstruction error
5
+ ======================================================================
6
+
7
+ Collecting per-patch reconstruction errors...
8
+
9
+
10
+ Reconstructing: 0%| | 0/157 [00:00<?, ?it/s]
11
+ Reconstructing: 3%|β–Ž | 4/157 [00:00<00:03, 38.85it/s]
12
+ Reconstructing: 8%|β–Š | 13/157 [00:00<00:02, 68.11it/s]
13
+ Reconstructing: 14%|β–ˆβ– | 22/157 [00:00<00:01, 77.91it/s]
14
+ Reconstructing: 20%|β–ˆβ–‰ | 31/157 [00:00<00:01, 81.98it/s]
15
+ Reconstructing: 26%|β–ˆβ–ˆβ–Œ | 41/157 [00:00<00:01, 85.54it/s]
16
+ Reconstructing: 32%|β–ˆβ–ˆβ–ˆβ– | 51/157 [00:00<00:01, 87.75it/s]
17
+ Reconstructing: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 61/157 [00:00<00:01, 89.17it/s]
18
+ Reconstructing: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 71/157 [00:00<00:00, 89.94it/s]
19
+ Reconstructing: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 81/157 [00:00<00:00, 90.52it/s]
20
+ Reconstructing: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 91/157 [00:01<00:00, 90.89it/s]
21
+ Reconstructing: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 101/157 [00:01<00:00, 91.03it/s]
22
+ Reconstructing: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 111/157 [00:01<00:00, 91.05it/s]
23
+ Reconstructing: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 121/157 [00:01<00:00, 91.14it/s]
24
+ Reconstructing: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 131/157 [00:01<00:00, 91.22it/s]
25
+ Reconstructing: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 141/157 [00:01<00:00, 91.26it/s]
26
+ Reconstructing: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 157/157 [00:01<00:00, 86.32it/s]
27
+ Collected 10000 images, 2000 individual maps
28
+
29
+ ======================================================================
30
+ 1a. SPATIAL STRUCTURE β€” Does recon error vary across patches?
31
+ ======================================================================
32
+
33
+ Per-image spatial CV of reconstruction error:
34
+ Mean CV: 0.3972
35
+ Median CV: 0.3989
36
+ Min CV: 0.0865
37
+ Max CV: 0.7128
38
+ VERDICT: HAS SPATIAL STRUCTURE
39
+
40
+ ======================================================================
41
+ 1b. PER-CLASS RECONSTRUCTION ERROR
42
+ ======================================================================
43
+
44
+ Class Mean MSE Std MSE Max patch
45
+ ------------------------------------------
46
+ airplane 0.000000 0.000000 0.000000
47
+ auto 0.000000 0.000000 0.000000
48
+ bird 0.000000 0.000000 0.000000
49
+ cat 0.000000 0.000000 0.000000
50
+ deer 0.000000 0.000000 0.000000
51
+ dog 0.000000 0.000000 0.000000
52
+ frog 0.000000 0.000000 0.000000
53
+ horse 0.000000 0.000000 0.000000
54
+ ship 0.000000 0.000000 0.000000
55
+ truck 0.000000 0.000000 0.000000
56
+
57
+ Mean inter-class cosine similarity: 0.996998
58
+ Min inter-class cosine similarity: 0.991408
59
+ VERDICT: SIMILAR PATTERNS
60
+
61
+ ======================================================================
62
+ 2. CENTER vs EDGE β€” Where does reconstruction fail?
63
+ ======================================================================
64
+
65
+ Class Center Edge Corner E/C ratio
66
+ ------------------------------------------------
67
+ airplane 0.000000 0.000000 0.000000 0.9007
68
+ auto 0.000000 0.000000 0.000000 0.9717
69
+ bird 0.000000 0.000000 0.000000 0.9379
70
+ cat 0.000000 0.000000 0.000000 0.9448
71
+ deer 0.000000 0.000000 0.000000 0.9685
72
+ dog 0.000000 0.000000 0.000000 1.0470
73
+ frog 0.000000 0.000000 0.000000 0.9538
74
+ horse 0.000000 0.000000 0.000000 0.9497
75
+ ship 0.000000 0.000000 0.000000 1.0124
76
+ truck 0.000000 0.000000 0.000000 0.9136
77
+
78
+ ======================================================================
79
+ 3. PER-MODE RECONSTRUCTION β€” Ablating SVD modes
80
+ ======================================================================
81
+
82
+ Reconstructing with individual modes...
83
+
84
+ Per-mode energy fraction (how much each mode contributes):
85
+
86
+ Class Mode0 Mode1 Mode2 Mode3 FullMSE
87
+ --------------------------------------------------
88
+ airplane 0.4242 0.3352 0.1705 0.0701 0.000000
89
+ auto 0.4234 0.3359 0.1704 0.0703 0.000000
90
+ bird 0.4237 0.3361 0.1700 0.0703 0.000000
91
+ cat 0.4232 0.3363 0.1701 0.0704 0.000000
92
+ deer 0.4236 0.3363 0.1700 0.0701 0.000000
93
+ dog 0.4238 0.3358 0.1703 0.0701 0.000000
94
+ frog 0.4229 0.3367 0.1698 0.0706 0.000000
95
+ horse 0.4237 0.3358 0.1703 0.0702 0.000000
96
+ ship 0.4243 0.3353 0.1706 0.0699 0.000000
97
+ truck 0.4237 0.3358 0.1704 0.0701 0.000000
98
+
99
+ ======================================================================
100
+ 4. LINEAR PROBE β€” Reconstruction error maps as features
101
+ ======================================================================
102
+
103
+ Ridge probe comparison:
104
+
105
+ Recon error spatial map dims= 256 train=51.6% test=19.0%
106
+ Class Acc
107
+ ------------------
108
+ airplane 25.9% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
109
+ auto 19.4% β–ˆβ–ˆβ–ˆ
110
+ bird 9.1% β–ˆ
111
+ cat 12.2% β–ˆβ–ˆ
112
+ deer 7.9% β–ˆ
113
+ dog 31.4% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
114
+ frog 22.7% β–ˆβ–ˆβ–ˆβ–ˆ
115
+ horse 14.3% β–ˆβ–ˆ
116
+ ship 36.6% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
117
+ truck 15.6% β–ˆβ–ˆβ–ˆ
118
+
119
+ ======================================================================
120
+ 5. FULL CONDUIT β€” Release error + eigenvalues + friction
121
+ ======================================================================
122
+
123
+ Full conduit: 0%| | 0/157 [00:00<?, ?it/s]
124
+ Full conduit: 2%|▏ | 3/157 [00:00<00:06, 25.60it/s]
125
+ Full conduit: 6%|β–Œ | 9/157 [00:00<00:03, 42.69it/s]
126
+ Full conduit: 10%|β–‰ | 15/157 [00:00<00:02, 48.80it/s]
127
+ Full conduit: 13%|β–ˆβ–Ž | 21/157 [00:00<00:02, 51.60it/s]
128
+ Full conduit: 20%|β–ˆβ–ˆ | 32/157 [00:00<00:02, 47.18it/s]
129
+
130
+ Comparative linear probes:
131
+
132
+ Release error only dims= 256 train=51.6% test=16.0%
133
+ Class Acc
134
+ ------------------
135
+ airplane 27.8% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
136
+ auto 18.2% β–ˆβ–ˆβ–ˆ
137
+ bird 14.6% β–ˆβ–ˆ
138
+ cat 5.4% β–ˆ
139
+ deer 8.3% β–ˆ
140
+ dog 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
141
+ frog 18.9% β–ˆβ–ˆβ–ˆ
142
+ horse 17.1% β–ˆβ–ˆβ–ˆ
143
+ ship 11.6% β–ˆβ–ˆ
144
+ truck 20.0% β–ˆβ–ˆβ–ˆβ–ˆ
145
+
146
+ Eigenvalues (S) only dims= 1024 train=91.6% test=20.7%
147
+ Class Acc
148
+ ------------------
149
+ airplane 16.7% β–ˆβ–ˆβ–ˆ
150
+ auto 24.2% β–ˆβ–ˆβ–ˆβ–ˆ
151
+ bird 18.8% β–ˆβ–ˆβ–ˆ
152
+ cat 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
153
+ deer 14.6% β–ˆβ–ˆ
154
+ dog 16.2% β–ˆβ–ˆβ–ˆ
155
+ frog 24.3% β–ˆβ–ˆβ–ˆβ–ˆ
156
+ horse 17.1% β–ˆβ–ˆβ–ˆ
157
+ ship 32.6% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
158
+ truck 22.5% β–ˆβ–ˆβ–ˆβ–ˆ
159
+
160
+ Friction only dims= 1024 train=92.4% test=22.5%
161
+ Class Acc
162
+ ------------------
163
+ airplane 19.4% β–ˆβ–ˆβ–ˆ
164
+ auto 21.2% β–ˆβ–ˆβ–ˆβ–ˆ
165
+ bird 18.8% β–ˆβ–ˆβ–ˆ
166
+ cat 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
167
+ deer 14.6% β–ˆβ–ˆ
168
+ dog 16.2% β–ˆβ–ˆβ–ˆ
169
+ frog 27.0% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
170
+ horse 22.0% β–ˆβ–ˆβ–ˆβ–ˆ
171
+ ship 34.9% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
172
+ truck 30.0% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
173
+
174
+ Combinations:
175
+
176
+ Release + Eigenvalues dims= 1280 train=98.4% test=18.5%
177
+ Class Acc
178
+ ------------------
179
+ airplane 2.8%
180
+ auto 21.2% β–ˆβ–ˆβ–ˆβ–ˆ
181
+ bird 18.8% β–ˆβ–ˆβ–ˆ
182
+ cat 2.7%
183
+ deer 18.8% β–ˆβ–ˆβ–ˆ
184
+ dog 16.2% β–ˆβ–ˆβ–ˆ
185
+ frog 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
186
+ horse 26.8% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
187
+ ship 25.6% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
188
+ truck 27.5% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
189
+ Release + Friction dims= 1280 train=98.3% test=15.3%
190
+ Class Acc
191
+ ------------------
192
+ airplane 8.3% β–ˆ
193
+ auto 18.2% β–ˆβ–ˆβ–ˆ
194
+ bird 16.7% β–ˆβ–ˆβ–ˆ
195
+ cat 5.4% β–ˆ
196
+ deer 12.5% β–ˆβ–ˆ
197
+ dog 16.2% β–ˆβ–ˆβ–ˆ
198
+ frog 13.5% β–ˆβ–ˆ
199
+ horse 19.5% β–ˆβ–ˆβ–ˆ
200
+ ship 23.3% β–ˆβ–ˆβ–ˆβ–ˆ
201
+ truck 17.5% β–ˆβ–ˆβ–ˆ
202
+ Release + Eigenvalues + Friction dims= 2304 train=99.9% test=17.0%
203
+ Class Acc
204
+ ------------------
205
+ airplane 13.9% β–ˆβ–ˆ
206
+ auto 3.0%
207
+ bird 16.7% β–ˆβ–ˆβ–ˆ
208
+ cat 10.8% β–ˆβ–ˆ
209
+ deer 12.5% β–ˆβ–ˆ
210
+ dog 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
211
+ frog 27.0% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
212
+ horse 22.0% β–ˆβ–ˆβ–ˆβ–ˆ
213
+ ship 23.3% β–ˆβ–ˆβ–ˆβ–ˆ
214
+ truck 17.5% β–ˆβ–ˆβ–ˆ
215
+
216
+ ======================================================================
217
+ 6. HIGH-ERROR PATCHES β€” Where does reconstruction fail?
218
+ ======================================================================
219
+
220
+ Top error positions per class (patch coordinates):
221
+ Class Top 3 positions (row, col) Error ratio
222
+ ----------------------------------------------------------------
223
+ airplane (10,8), (9,8), (10,7) 1.19x
224
+ auto (11,14), (11,7), (10,1) 1.12x
225
+ bird (8,9), (7,9), (10,8) 1.11x
226
+ cat (5,8), (5,7), (6,5) 1.08x
227
+ deer (5,7), (5,5), (7,6) 1.06x
228
+ dog (15,15), (14,15), (9,15) 1.07x
229
+ frog (5,4), (4,7), (4,6) 1.07x
230
+ horse (10,9), (9,7), (9,9) 1.11x
231
+ ship (14,13), (14,12), (15,9) 1.20x
232
+ truck (10,14), (10,13), (10,1) 1.16x
233
+
234
+ Overall error map:
235
+ Mean: 0.000000
236
+ Std: 0.000000
237
+ Hot patches (>2Οƒ): 0/256
238
+
239
+ ======================================================================
240
+ THEOREM 3: RELEASE FIDELITY β€” SUMMARY
241
+ ======================================================================
242
+
243
+ SPATIAL STRUCTURE:
244
+ Recon error spatial CV: 0.3972
245
+ (Friction spatial CV was: 0.0137)
246
+
247
+ CLASSIFICATION (ridge probe, test accuracy):
248
+ Chance: 10.0%
249
+ Friction maps: 24.3% (from Cell 3)
250
+ Eigenvalue (S) maps: 21.0% (from Cell 3)
251
+ Release error maps: 19.0%
252
+ Release + Eigenvalues: 18.5%
253
+ Release + Friction: 15.3%
254
+ FULL CONDUIT (all three): 17.0%
255
+
256
+ THE QUESTION ANSWERED:
257
+ Does the release signal carry class-discriminative information
258
+ that eigenvalues and friction do not?
259
+ Lift from release over eigenvalues: -4.7pp
260
+ Lift from full conduit over eigenvalues: -3.7pp