geolip-conduit-experiments / cell_4_proper_experiment_3.txt
AbstractPhil's picture
Create cell_4_proper_experiment_3.txt
1b9a2c2 verified
Loading Freckles v40 + CIFAR-10...
======================================================================
1. FULL ROUND-TRIP β€” Per-patch reconstruction error
======================================================================
Collecting per-patch reconstruction errors...
Reconstructing: 0%| | 0/157 [00:00<?, ?it/s]
Reconstructing: 3%|β–Ž | 4/157 [00:00<00:03, 38.85it/s]
Reconstructing: 8%|β–Š | 13/157 [00:00<00:02, 68.11it/s]
Reconstructing: 14%|β–ˆβ– | 22/157 [00:00<00:01, 77.91it/s]
Reconstructing: 20%|β–ˆβ–‰ | 31/157 [00:00<00:01, 81.98it/s]
Reconstructing: 26%|β–ˆβ–ˆβ–Œ | 41/157 [00:00<00:01, 85.54it/s]
Reconstructing: 32%|β–ˆβ–ˆβ–ˆβ– | 51/157 [00:00<00:01, 87.75it/s]
Reconstructing: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 61/157 [00:00<00:01, 89.17it/s]
Reconstructing: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 71/157 [00:00<00:00, 89.94it/s]
Reconstructing: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 81/157 [00:00<00:00, 90.52it/s]
Reconstructing: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 91/157 [00:01<00:00, 90.89it/s]
Reconstructing: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 101/157 [00:01<00:00, 91.03it/s]
Reconstructing: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 111/157 [00:01<00:00, 91.05it/s]
Reconstructing: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 121/157 [00:01<00:00, 91.14it/s]
Reconstructing: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 131/157 [00:01<00:00, 91.22it/s]
Reconstructing: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 141/157 [00:01<00:00, 91.26it/s]
Reconstructing: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 157/157 [00:01<00:00, 86.32it/s]
Collected 10000 images, 2000 individual maps
======================================================================
1a. SPATIAL STRUCTURE β€” Does recon error vary across patches?
======================================================================
Per-image spatial CV of reconstruction error:
Mean CV: 0.3972
Median CV: 0.3989
Min CV: 0.0865
Max CV: 0.7128
VERDICT: HAS SPATIAL STRUCTURE
======================================================================
1b. PER-CLASS RECONSTRUCTION ERROR
======================================================================
Class Mean MSE Std MSE Max patch
------------------------------------------
airplane 0.000000 0.000000 0.000000
auto 0.000000 0.000000 0.000000
bird 0.000000 0.000000 0.000000
cat 0.000000 0.000000 0.000000
deer 0.000000 0.000000 0.000000
dog 0.000000 0.000000 0.000000
frog 0.000000 0.000000 0.000000
horse 0.000000 0.000000 0.000000
ship 0.000000 0.000000 0.000000
truck 0.000000 0.000000 0.000000
Mean inter-class cosine similarity: 0.996998
Min inter-class cosine similarity: 0.991408
VERDICT: SIMILAR PATTERNS
======================================================================
2. CENTER vs EDGE β€” Where does reconstruction fail?
======================================================================
Class Center Edge Corner E/C ratio
------------------------------------------------
airplane 0.000000 0.000000 0.000000 0.9007
auto 0.000000 0.000000 0.000000 0.9717
bird 0.000000 0.000000 0.000000 0.9379
cat 0.000000 0.000000 0.000000 0.9448
deer 0.000000 0.000000 0.000000 0.9685
dog 0.000000 0.000000 0.000000 1.0470
frog 0.000000 0.000000 0.000000 0.9538
horse 0.000000 0.000000 0.000000 0.9497
ship 0.000000 0.000000 0.000000 1.0124
truck 0.000000 0.000000 0.000000 0.9136
======================================================================
3. PER-MODE RECONSTRUCTION β€” Ablating SVD modes
======================================================================
Reconstructing with individual modes...
Per-mode energy fraction (how much each mode contributes):
Class Mode0 Mode1 Mode2 Mode3 FullMSE
--------------------------------------------------
airplane 0.4242 0.3352 0.1705 0.0701 0.000000
auto 0.4234 0.3359 0.1704 0.0703 0.000000
bird 0.4237 0.3361 0.1700 0.0703 0.000000
cat 0.4232 0.3363 0.1701 0.0704 0.000000
deer 0.4236 0.3363 0.1700 0.0701 0.000000
dog 0.4238 0.3358 0.1703 0.0701 0.000000
frog 0.4229 0.3367 0.1698 0.0706 0.000000
horse 0.4237 0.3358 0.1703 0.0702 0.000000
ship 0.4243 0.3353 0.1706 0.0699 0.000000
truck 0.4237 0.3358 0.1704 0.0701 0.000000
======================================================================
4. LINEAR PROBE β€” Reconstruction error maps as features
======================================================================
Ridge probe comparison:
Recon error spatial map dims= 256 train=51.6% test=19.0%
Class Acc
------------------
airplane 25.9% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
auto 19.4% β–ˆβ–ˆβ–ˆ
bird 9.1% β–ˆ
cat 12.2% β–ˆβ–ˆ
deer 7.9% β–ˆ
dog 31.4% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
frog 22.7% β–ˆβ–ˆβ–ˆβ–ˆ
horse 14.3% β–ˆβ–ˆ
ship 36.6% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
truck 15.6% β–ˆβ–ˆβ–ˆ
======================================================================
5. FULL CONDUIT β€” Release error + eigenvalues + friction
======================================================================
Full conduit: 0%| | 0/157 [00:00<?, ?it/s]
Full conduit: 2%|▏ | 3/157 [00:00<00:06, 25.60it/s]
Full conduit: 6%|β–Œ | 9/157 [00:00<00:03, 42.69it/s]
Full conduit: 10%|β–‰ | 15/157 [00:00<00:02, 48.80it/s]
Full conduit: 13%|β–ˆβ–Ž | 21/157 [00:00<00:02, 51.60it/s]
Full conduit: 20%|β–ˆβ–ˆ | 32/157 [00:00<00:02, 47.18it/s]
Comparative linear probes:
Release error only dims= 256 train=51.6% test=16.0%
Class Acc
------------------
airplane 27.8% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
auto 18.2% β–ˆβ–ˆβ–ˆ
bird 14.6% β–ˆβ–ˆ
cat 5.4% β–ˆ
deer 8.3% β–ˆ
dog 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
frog 18.9% β–ˆβ–ˆβ–ˆ
horse 17.1% β–ˆβ–ˆβ–ˆ
ship 11.6% β–ˆβ–ˆ
truck 20.0% β–ˆβ–ˆβ–ˆβ–ˆ
Eigenvalues (S) only dims= 1024 train=91.6% test=20.7%
Class Acc
------------------
airplane 16.7% β–ˆβ–ˆβ–ˆ
auto 24.2% β–ˆβ–ˆβ–ˆβ–ˆ
bird 18.8% β–ˆβ–ˆβ–ˆ
cat 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
deer 14.6% β–ˆβ–ˆ
dog 16.2% β–ˆβ–ˆβ–ˆ
frog 24.3% β–ˆβ–ˆβ–ˆβ–ˆ
horse 17.1% β–ˆβ–ˆβ–ˆ
ship 32.6% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
truck 22.5% β–ˆβ–ˆβ–ˆβ–ˆ
Friction only dims= 1024 train=92.4% test=22.5%
Class Acc
------------------
airplane 19.4% β–ˆβ–ˆβ–ˆ
auto 21.2% β–ˆβ–ˆβ–ˆβ–ˆ
bird 18.8% β–ˆβ–ˆβ–ˆ
cat 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
deer 14.6% β–ˆβ–ˆ
dog 16.2% β–ˆβ–ˆβ–ˆ
frog 27.0% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
horse 22.0% β–ˆβ–ˆβ–ˆβ–ˆ
ship 34.9% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
truck 30.0% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
Combinations:
Release + Eigenvalues dims= 1280 train=98.4% test=18.5%
Class Acc
------------------
airplane 2.8%
auto 21.2% β–ˆβ–ˆβ–ˆβ–ˆ
bird 18.8% β–ˆβ–ˆβ–ˆ
cat 2.7%
deer 18.8% β–ˆβ–ˆβ–ˆ
dog 16.2% β–ˆβ–ˆβ–ˆ
frog 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
horse 26.8% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
ship 25.6% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
truck 27.5% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
Release + Friction dims= 1280 train=98.3% test=15.3%
Class Acc
------------------
airplane 8.3% β–ˆ
auto 18.2% β–ˆβ–ˆβ–ˆ
bird 16.7% β–ˆβ–ˆβ–ˆ
cat 5.4% β–ˆ
deer 12.5% β–ˆβ–ˆ
dog 16.2% β–ˆβ–ˆβ–ˆ
frog 13.5% β–ˆβ–ˆ
horse 19.5% β–ˆβ–ˆβ–ˆ
ship 23.3% β–ˆβ–ˆβ–ˆβ–ˆ
truck 17.5% β–ˆβ–ˆβ–ˆ
Release + Eigenvalues + Friction dims= 2304 train=99.9% test=17.0%
Class Acc
------------------
airplane 13.9% β–ˆβ–ˆ
auto 3.0%
bird 16.7% β–ˆβ–ˆβ–ˆ
cat 10.8% β–ˆβ–ˆ
deer 12.5% β–ˆβ–ˆ
dog 21.6% β–ˆβ–ˆβ–ˆβ–ˆ
frog 27.0% β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ
horse 22.0% β–ˆβ–ˆβ–ˆβ–ˆ
ship 23.3% β–ˆβ–ˆβ–ˆβ–ˆ
truck 17.5% β–ˆβ–ˆβ–ˆ
======================================================================
6. HIGH-ERROR PATCHES β€” Where does reconstruction fail?
======================================================================
Top error positions per class (patch coordinates):
Class Top 3 positions (row, col) Error ratio
----------------------------------------------------------------
airplane (10,8), (9,8), (10,7) 1.19x
auto (11,14), (11,7), (10,1) 1.12x
bird (8,9), (7,9), (10,8) 1.11x
cat (5,8), (5,7), (6,5) 1.08x
deer (5,7), (5,5), (7,6) 1.06x
dog (15,15), (14,15), (9,15) 1.07x
frog (5,4), (4,7), (4,6) 1.07x
horse (10,9), (9,7), (9,9) 1.11x
ship (14,13), (14,12), (15,9) 1.20x
truck (10,14), (10,13), (10,1) 1.16x
Overall error map:
Mean: 0.000000
Std: 0.000000
Hot patches (>2Οƒ): 0/256
======================================================================
THEOREM 3: RELEASE FIDELITY β€” SUMMARY
======================================================================
SPATIAL STRUCTURE:
Recon error spatial CV: 0.3972
(Friction spatial CV was: 0.0137)
CLASSIFICATION (ridge probe, test accuracy):
Chance: 10.0%
Friction maps: 24.3% (from Cell 3)
Eigenvalue (S) maps: 21.0% (from Cell 3)
Release error maps: 19.0%
Release + Eigenvalues: 18.5%
Release + Friction: 15.3%
FULL CONDUIT (all three): 17.0%
THE QUESTION ANSWERED:
Does the release signal carry class-discriminative information
that eigenvalues and friction do not?
Lift from release over eigenvalues: -4.7pp
Lift from full conduit over eigenvalues: -3.7pp