Spaces:

wehe1pwe
/

math-under-llm

Running

Alex W. commited on 9 days ago

Commit

9d65593

1 Parent(s): f38ed3a

Add `core/plotter.py` and `ui/tab_plot.py` to generate Nature/PRL-standard

figures for Wang's Five Laws directly from the SQLite database.
Update `app.py` to mount the new Tab 5.

---

Pure computation layer. No UI dependency, no DB dependency.
Takes a `pd.DataFrame` (from `db/reader.py`) and returns `matplotlib.Figure`.

- `plot_single_model()` — 4×3 grid (12 subplots), single model
- `plot_compare_models()` — 4×3 grid, Model A (solid) vs Model B (dashed)
- `save_figure()` — exports PNG (300 dpi) + PDF (vector) + SVG (vector)
- `fig_to_plotly()` — converts matplotlib Figure to Plotly for interactive preview
- `fig_to_png_bytes()` — in-memory PNG bytes for Gradio Image component

Gradio Tab 5 interface. Calls `core/plotter.py` and `db/reader.py` only.

- Single-model mode: select model → generate → preview → download
- Two-model comparison mode: select A & B → generate → preview → download
- Controls: modality filter, layer range, band toggle, delta toggle
- Download: PNG / PDF / SVG / ZIP (all formats bundled)

---

- Added `from ui.tab_plot import build_tab_plot`
- Added `build_tab_plot()` call inside `gr.Tabs()` block

---

```
Row 1 — Law 1 & 2 (singular value alignment)
[0,0] pearson_QK [0,1] ssr_QK [0,2] alpha_QK

Row 2 — Law 3 (condition numbers & max singular values)
[1,0] sigma_max_Q [1,1] sigma_max_K [1,2] cond_Q & cond_K (dual line)

Row 3 — Law 4 (output subspace, left singular vectors U)
[2,0] cosU_QK [2,1] cosU_QV [2,2] cosU_KV
+ horizontal dashed: random baseline 1/√d_head

Row 4 — Law 5 (input subspace, right singular vectors V)
[3,0] cosV_QK [3,1] cosV_QV [3,2] cosV_KV
+ horizontal dashed: random baseline 1/√d_model
```

---

**Color system (consistent across all figures)**
| Entity | Color | Hex |
|--------|-------|-----|
| Q | Blue | `#2166AC` |
| K | Red | `#D6604D` |
| V | Green | `#4DAC26` |
| QK pair | Purple | `#762A83` |
| QV pair | Teal | `#01665E` |
| KV pair | Orange | `#E08214` |
| Reference lines | Gray | `#555555` |

**Line encoding**
- Model A (base): solid line
- Model B (RL-tuned): dashed line
- Δ (B − A): gray fill

**Statistical band**
- Thick line = median across heads per layer
- Shaded region = 25%–75% quantile range across heads
- Narrow band = heads behave consistently = model is well-organized

**Shared Y-axis**
- Row 3 (cosU): all three subplots share the same Y range → effect size is comparable
- Row 4 (cosV): same treatment
- Rows 1 & 2: independent Y axes (different physical units)

**Annotations**
- Horizontal dashed lines: theoretical ideals (r=1, SSR=0, α=1) and random baselines
- Vertical dotted lines: global (K=V shared) layers — Gemma-4 specific
- Footer text: lists global layer indices when present

**Output specs**
- Canvas: 18 × 20 inches
- Resolution: 300 DPI (PNG), vector (PDF, SVG)
- Font: DejaVu Sans — Title 11pt / Axis label 10pt / Tick 9pt / Legend 9pt
- Spine: top and right spines removed (clean academic style)

---

Follows the existing three-layer pattern:

```
core/plotter.py ← pure computation, returns Figure objects
↑
ui/tab_plot.py ← UI only, calls plotter + db/reader
↑
app.py ← mounts Tab 5
```

`core/plotter.py` can be used standalone from the command line without
importing any Gradio or DB code:

```python
from core.plotter import plot_single_model, save_figure
import pandas as pd

df = pd.read_csv("my_layer_metrics.csv")
fig = plot_single_model(df, "MyModel", head_dim=128, d_model=5120)
save_figure(fig, "./output/my_model")
```

---

Smoke-tested with synthetic data (48 layers × 32 heads):
- `plot_single_model()` → PNG 2.5 MB, PDF 61 KB, SVG 323 KB ✅
- `plot_compare_models()` → PNG 2.1 MB, PDF 70 KB, SVG 393 KB ✅
- All 12 subplots render correctly with band, baselines, and global-layer markers ✅
- Syntax validated: `ast.parse()` on all three files ✅

---

- [ ] Singular value spectrum snapshot figure (3 layers, Law 1&2 direct visualization)
- [ ] SSR deep-layer trend figure (RL improvement per layer group)
- [ ] Wang Score horizontal bar chart (leaderboard visualization)
- [ ] Auto-refresh model dropdown in Tab 5 without page reload
- [ ] CLI script: `python -m core.plotter --model-id google/gemma-4-e2b`

Files changed (3) hide show

app.py +4 -2
core/plotter.py +485 -0
ui/tab_plot.py +412 -0

app.py CHANGED Viewed

@@ -10,10 +10,9 @@ from ui.tab_inspect import build_tab_inspect
 from ui.tab_analyze import build_tab_analyze
 from ui.tab_leaderboard import build_tab_leaderboard
 from ui.tab_database import build_tab_database
 # ── 启动时初始化数据库 ────────────────────────
-# 幂等操作，重复调用安全
-# /data 目录由 HF Space bucket 挂载，重启后数据不丢失
 init_db()
 # ─────────────────────────────────────────────
@@ -58,6 +57,9 @@ with gr.Blocks(
         # Tab4：数据库浏览
         build_tab_database()
     # ── Tab1 → Tab2 同步模型 ID 和 token ─────────
     inspect_model_id.change(
         fn=lambda x: x,

 from ui.tab_analyze import build_tab_analyze
 from ui.tab_leaderboard import build_tab_leaderboard
 from ui.tab_database import build_tab_database
+from ui.tab_plot import build_tab_plot
 # ── 启动时初始化数据库 ────────────────────────
 init_db()
 # ─────────────────────────────────────────────
         # Tab4：数据库浏览
         build_tab_database()
+        # Tab5：作图（论文级别）
+        build_tab_plot()
     # ── Tab1 → Tab2 同步模型 ID 和 token ─────────
     inspect_model_id.change(
         fn=lambda x: x,

core/plotter.py ADDED Viewed

	@@ -0,0 +1,485 @@

+# core/plotter.py
+"""
+Publication-quality figure generation for Wang's Five Laws.
+Standards: Nature / PRL / top-conference level.
+Canvas: 18×20 inches @ 300 DPI, Arial/Helvetica fonts.
+Color system:
+  Q-related  → blue  (#2166AC)
+  K-related  → red   (#D6604D)
+  V-related  → green (#4DAC26)
+  QK pair    → purple (#762A83)
+  QV pair    → cyan   (#01665E)
+  KV pair    → orange (#E08214)
+  Model A (base)   → solid line
+  Model B (RL)     → dashed line
+  Delta            → gray fill
+"""
+import numpy as np
+import pandas as pd
+import matplotlib
+matplotlib.use("Agg")
+import matplotlib.pyplot as plt
+import matplotlib.patches as mpatches
+from matplotlib.lines import Line2D
+import io
+import os
+# ── Font & style ──────────────────────────────────────────────────────────────
+plt.rcParams.update({
+    "font.family":        "DejaVu Sans",   # fallback; Arial not always present
+    "font.size":          9,
+    "axes.titlesize":     11,
+    "axes.labelsize":     10,
+    "xtick.labelsize":    9,
+    "ytick.labelsize":    9,
+    "legend.fontsize":    9,
+    "figure.dpi":         300,
+    "savefig.dpi":        300,
+    "axes.linewidth":     0.8,
+    "grid.linewidth":     0.4,
+    "lines.linewidth":    1.5,
+    "legend.framealpha":  0.85,
+    "legend.edgecolor":   "0.7",
+    "axes.spines.top":    False,
+    "axes.spines.right":  False,
+})
+# ── Color palette ─────────────────────────────────────────────────────────────
+C = {
+    "Q":   "#2166AC",   # blue
+    "K":   "#D6604D",   # red
+    "V":   "#4DAC26",   # green
+    "QK":  "#762A83",   # purple
+    "QV":  "#01665E",   # cyan/teal
+    "KV":  "#E08214",   # orange
+    "ref": "#555555",   # reference line (gray)
+    "band_alpha": 0.18,
+}
+BAND_COLORS = {
+    "Q":  "#2166AC",
+    "K":  "#D6604D",
+    "QK": "#762A83",
+    "QV": "#01665E",
+    "KV": "#E08214",
+}
+# ─────────────────────────────────────────────────────────────────────────────
+# Data helpers
+# ─────────────────────────────────────────────────────────────────────────────
+def _aggregate_by_layer(df: pd.DataFrame, col: str):
+    """
+    Group by layer, return (layers, median, q25, q75).
+    Excludes kv_shared=True rows for KV metrics to avoid theoretical-value bias.
+    """
+    kv_cols = {"ssr_KV", "pearson_KV", "cosU_KV", "cosV_KV", "alpha_KV"}
+    if col in kv_cols:
+        df = df[df["kv_shared"] == 0] if "kv_shared" in df.columns else df
+    grp = df.groupby("layer")[col]
+    layers = np.array(sorted(df["layer"].unique()))
+    med    = grp.median().reindex(layers).values
+    q25    = grp.quantile(0.25).reindex(layers).values
+    q75    = grp.quantile(0.75).reindex(layers).values
+    return layers, med, q25, q75
+def _global_layers(df: pd.DataFrame):
+    """Return list of layer indices where kv_shared==True (Gemma global layers)."""
+    if "kv_shared" not in df.columns:
+        return []
+    return sorted(df[df["kv_shared"] == 1]["layer"].unique().tolist())
+# ─────────────────────────────────────────────────────────────────────────────
+# Single-subplot drawing primitives
+# ─────────────────────────────────────────────────────────────────────────────
+def _draw_line(ax, layers, med, q25, q75, color, label, linestyle="-",
+               show_band=True, global_layers=None):
+    ax.plot(layers, med, color=color, linestyle=linestyle,
+            linewidth=1.8, label=label, zorder=3)
+    if show_band:
+        ax.fill_between(layers, q25, q75, color=color,
+                        alpha=C["band_alpha"], zorder=2)
+    if global_layers:
+        for gl in global_layers:
+            ax.axvline(gl, color="#AAAAAA", linewidth=0.7,
+                       linestyle=":", zorder=1)
+def _add_hline(ax, y, label=None, color=None):
+    color = color or C["ref"]
+    ax.axhline(y, color=color, linewidth=1.0, linestyle="--",
+               alpha=0.75, zorder=1, label=label)
+def _finalize_ax(ax, title, ylabel, xlabel="Layer index"):
+    ax.set_title(title, fontweight="bold", pad=4)
+    ax.set_ylabel(ylabel)
+    ax.set_xlabel(xlabel)
+    ax.grid(True, axis="y", alpha=0.35)
+    ax.legend(loc="best", handlelength=1.5)
+# ─────────────────────────────────────────────────────────────────────────────
+# The 12-panel 4×3 figure  (single model)
+# ─────────────────────────────────────────────────────────────────────────────
+def plot_single_model(
+    df:         pd.DataFrame,
+    model_name: str,
+    show_band:  bool = True,
+    head_dim:   int  = 128,
+    d_model:    int  = 5120,
+) -> plt.Figure:
+    """
+    4×3 grid, 12 subplots.
+    Row 1 — Law 1 & 2 (singular value metrics):
+      [0,0] pearson_QK   [0,1] ssr_QK      [0,2] alpha_QK
+    Row 2 — Law 3 (condition numbers & max singular values):
+      [1,0] sigma_max_Q  [1,1] sigma_max_K  [1,2] cond_Q & cond_K (dual line)
+    Row 3 — Law 4 (output subspace, left singular vectors U):
+      [2,0] cosU_QK      [2,1] cosU_QV      [2,2] cosU_KV
+      + random baseline 1/√d_head
+    Row 4 — Law 5 (input subspace, right singular vectors V):
+      [3,0] cosV_QK      [3,1] cosV_QV      [3,2] cosV_KV
+      + random baseline 1/√d_model
+    """
+    fig, axes = plt.subplots(4, 3, figsize=(18, 20))
+    fig.suptitle(
+        f"Wang's Five Laws — {model_name}",
+        fontsize=14, fontweight="bold", y=0.995
+    )
+    gl = _global_layers(df)
+    baseline_U = 1.0 / np.sqrt(head_dim)
+    baseline_V = 1.0 / np.sqrt(d_model)
+    # ── helper ───────────────────────────────────────────────────────────────
+    def draw(ax, col, color, label, linestyle="-"):
+        layers, med, q25, q75 = _aggregate_by_layer(df, col)
+        _draw_line(ax, layers, med, q25, q75, color, label,
+                   linestyle=linestyle, show_band=show_band,
+                   global_layers=gl)
+    # ── Row 0: Law 1 & 2 ─────────────────────────────────────────────────────
+    ax = axes[0, 0]
+    draw(ax, "pearson_QK", C["QK"], "Pearson r (Q–K)")
+    _add_hline(ax, 1.0, "Ideal = 1")
+    _finalize_ax(ax, "Law 1 — Spectral Linear Alignment",
+                 "Pearson r (Q, K spectra)")
+    ax = axes[0, 1]
+    draw(ax, "ssr_QK", C["QK"], "SSR (Q–K)")
+    _add_hline(ax, 0.0, "Ideal = 0")
+    _finalize_ax(ax, "Law 2 — Spectral Shape Fidelity",
+                 "SSR (Q–K normalized)")
+    ax = axes[0, 2]
+    draw(ax, "alpha_QK", C["QK"], "α (Q–K)")
+    _add_hline(ax, 1.0, "Ideal = 1")
+    _finalize_ax(ax, "Law 1+2 — Scale Factor α (Q–K)",
+                 "Scale factor α")
+    # ── Row 1: Law 3 ─────────────────────────────────────────────────────────
+    ax = axes[1, 0]
+    draw(ax, "sigma_max_Q", C["Q"], "σ_max (Q)")
+    _finalize_ax(ax, "Law 3 — Max Singular Value (Q)",
+                 "σ_max")
+    ax = axes[1, 1]
+    draw(ax, "sigma_max_K", C["K"], "σ_max (K)")
+    _finalize_ax(ax, "Law 3 — Max Singular Value (K)",
+                 "σ_max")
+    ax = axes[1, 2]
+    draw(ax, "cond_Q", C["Q"], "κ(Q)")
+    draw(ax, "cond_K", C["K"], "κ(K)")
+    _finalize_ax(ax, "Law 3 — Condition Number κ",
+                 "Condition number κ")
+    # ── Row 2: Law 4 ─────────────────────────────────────────────────────────
+    # Share y-axis across this row
+    axU = [axes[2, 0], axes[2, 1], axes[2, 2]]
+    u_data = {}
+    for col in ["cosU_QK", "cosU_QV", "cosU_KV"]:
+        _, med, q25, q75 = _aggregate_by_layer(df, col)
+        u_data[col] = (med, q25, q75)
+    all_u = np.concatenate([np.concatenate([v[1], v[2]]) for v in u_data.values()])
+    all_u = all_u[~np.isnan(all_u)]
+    if len(all_u) > 0:
+        u_ymin = max(0, np.nanmin(all_u) * 0.92)
+        u_ymax = np.nanmax(all_u) * 1.08
+    else:
+        u_ymin, u_ymax = 0, 0.15
+    for (col, color, title_suffix), ax in zip(
+        [("cosU_QK", C["QK"], "Q–K"),
+         ("cosU_QV", C["QV"], "Q–V"),
+         ("cosU_KV", C["KV"], "K–V")],
+        axU
+    ):
+        draw(ax, col, color, f"cosU ({title_suffix})")
+        _add_hline(ax, baseline_U,
+                   f"Random = 1/√d_h ≈ {baseline_U:.4f}")
+        ax.set_ylim(u_ymin, u_ymax)
+        _finalize_ax(ax, f"Law 4 — Output Subspace cosU ({title_suffix})",
+                     "Mean |cos| (left singular vectors)")
+    # ── Row 3: Law 5 ─────────────────────────────────────────────────────────
+    axV = [axes[3, 0], axes[3, 1], axes[3, 2]]
+    v_data = {}
+    for col in ["cosV_QK", "cosV_QV", "cosV_KV"]:
+        _, med, q25, q75 = _aggregate_by_layer(df, col)
+        v_data[col] = (med, q25, q75)
+    all_v = np.concatenate([np.concatenate([v[1], v[2]]) for v in v_data.values()])
+    all_v = all_v[~np.isnan(all_v)]
+    if len(all_v) > 0:
+        v_ymin = max(0, np.nanmin(all_v) * 0.92)
+        v_ymax = np.nanmax(all_v) * 1.08
+    else:
+        v_ymin, v_ymax = 0, 0.05
+    for (col, color, title_suffix), ax in zip(
+        [("cosV_QK", C["QK"], "Q–K"),
+         ("cosV_QV", C["QV"], "Q–V"),
+         ("cosV_KV", C["KV"], "K–V")],
+        axV
+    ):
+        draw(ax, col, color, f"cosV ({title_suffix})")
+        _add_hline(ax, baseline_V,
+                   f"Random = 1/√D ≈ {baseline_V:.4f}")
+        ax.set_ylim(v_ymin, v_ymax)
+        _finalize_ax(ax, f"Law 5 — Input Subspace cosV ({title_suffix})",
+                     "Mean |cos| (right singular vectors)")
+    # ── Global layer legend ───────────────────────────────────────────────────
+    if gl:
+        fig.text(
+            0.5, 0.001,
+            f"Vertical dotted lines mark global (K=V shared) layers: {gl}",
+            ha="center", fontsize=8, color="#666666"
+        )
+    fig.tight_layout(rect=[0, 0.01, 1, 0.995])
+    return fig
+# ─────────────────────────────────────────────────────────────────────────────
+# Two-model comparison figure  (same 4×3, dual lines + delta subpanels)
+# ─────────────────────────────────────────────────────────────────────────────
+def plot_compare_models(
+    df_a:        pd.DataFrame,
+    df_b:        pd.DataFrame,
+    name_a:      str,
+    name_b:      str,
+    show_band:   bool = True,
+    show_delta:  bool = True,
+    head_dim:    int  = 128,
+    d_model:     int  = 5120,
+) -> plt.Figure:
+    """
+    4×3 comparison grid.
+    Each subplot: Model A (solid) vs Model B (dashed).
+    Delta (B - A) shown as gray fill when show_delta=True.
+    """
+    fig, axes = plt.subplots(4, 3, figsize=(18, 20))
+    fig.suptitle(
+        f"Wang's Five Laws — {name_a}  vs  {name_b}",
+        fontsize=14, fontweight="bold", y=0.995
+    )
+    gl_a = _global_layers(df_a)
+    gl_b = _global_layers(df_b)
+    gl   = sorted(set(gl_a) | set(gl_b))
+    baseline_U = 1.0 / np.sqrt(head_dim)
+    baseline_V = 1.0 / np.sqrt(d_model)
+    def draw_pair(ax, col, color, label_a, label_b, hline=None, hline_label=None):
+        """Draw Model A (solid) and Model B (dashed) on the same axes."""
+        lay_a, med_a, q25_a, q75_a = _aggregate_by_layer(df_a, col)
+        lay_b, med_b, q25_b, q75_b = _aggregate_by_layer(df_b, col)
+        _draw_line(ax, lay_a, med_a, q25_a, q75_a, color, label_a,
+                   linestyle="-", show_band=show_band, global_layers=gl)
+        _draw_line(ax, lay_b, med_b, q25_b, q75_b, color, label_b,
+                   linestyle="--", show_band=show_band, global_layers=None)
+        # Delta fill
+        if show_delta:
+            common = np.intersect1d(lay_a, lay_b)
+            if len(common) > 1:
+                idx_a = np.isin(lay_a, common)
+                idx_b = np.isin(lay_b, common)
+                delta = med_b[idx_b] - med_a[idx_a]
+                pos   = np.maximum(delta, 0)
+                neg   = np.minimum(delta, 0)
+                ax.fill_between(common, 0, pos,
+                                color="#AAAAAA", alpha=0.25, zorder=0)
+                ax.fill_between(common, 0, neg,
+                                color="#AAAAAA", alpha=0.25, zorder=0)
+        if hline is not None:
+            _add_hline(ax, hline, hline_label)
+    # ── Row 0 ────────────────────────────────────────────────────────────────
+    ax = axes[0, 0]
+    draw_pair(ax, "pearson_QK", C["QK"],
+              f"{name_a} Pearson r", f"{name_b} Pearson r", hline=1.0, hline_label="Ideal=1")
+    _finalize_ax(ax, "Law 1 — Spectral Linear Alignment", "Pearson r (Q, K)")
+    ax = axes[0, 1]
+    draw_pair(ax, "ssr_QK", C["QK"],
+              f"{name_a} SSR", f"{name_b} SSR", hline=0.0, hline_label="Ideal=0")
+    _finalize_ax(ax, "Law 2 — Spectral Shape Fidelity", "SSR (Q–K)")
+    ax = axes[0, 2]
+    draw_pair(ax, "alpha_QK", C["QK"],
+              f"{name_a} α", f"{name_b} α", hline=1.0, hline_label="Ideal=1")
+    _finalize_ax(ax, "Law 1+2 — Scale Factor α (Q–K)", "Scale factor α")
+    # ── Row 1 ────────────────────────────────────────────────────────────────
+    ax = axes[1, 0]
+    draw_pair(ax, "sigma_max_Q", C["Q"],
+              f"{name_a} σ_max(Q)", f"{name_b} σ_max(Q)")
+    _finalize_ax(ax, "Law 3 — Max Singular Value (Q)", "σ_max")
+    ax = axes[1, 1]
+    draw_pair(ax, "sigma_max_K", C["K"],
+              f"{name_a} σ_max(K)", f"{name_b} σ_max(K)")
+    _finalize_ax(ax, "Law 3 — Max Singular Value (K)", "σ_max")
+    ax = axes[1, 2]
+    # cond: draw both Q and K for both models → 4 lines
+    lay_a, med_a, q25_a, q75_a = _aggregate_by_layer(df_a, "cond_Q")
+    lay_b, med_b, q25_b, q75_b = _aggregate_by_layer(df_b, "cond_Q")
+    _draw_line(ax, lay_a, med_a, q25_a, q75_a, C["Q"],
+               f"{name_a} κ(Q)", "-", show_band, gl)
+    _draw_line(ax, lay_b, med_b, q25_b, q75_b, C["Q"],
+               f"{name_b} κ(Q)", "--", show_band, None)
+    lay_a, med_a, q25_a, q75_a = _aggregate_by_layer(df_a, "cond_K")
+    lay_b, med_b, q25_b, q75_b = _aggregate_by_layer(df_b, "cond_K")
+    _draw_line(ax, lay_a, med_a, q25_a, q75_a, C["K"],
+               f"{name_a} κ(K)", "-", show_band, None)
+    _draw_line(ax, lay_b, med_b, q25_b, q75_b, C["K"],
+               f"{name_b} κ(K)", "--", show_band, None)
+    _finalize_ax(ax, "Law 3 — Condition Number κ", "Condition number κ")
+    # ── Row 2: Law 4 ─────────────────────────────────────────────────────────
+    u_cols = [("cosU_QK", C["QK"], "Q–K"),
+              ("cosU_QV", C["QV"], "Q–V"),
+              ("cosU_KV", C["KV"], "K–V")]
+    # Compute shared y range
+    u_vals = []
+    for col, _, _ in u_cols:
+        for df_ in [df_a, df_b]:
+            _, med, q25, q75 = _aggregate_by_layer(df_, col)
+            u_vals.extend(q25[~np.isnan(q25)].tolist())
+            u_vals.extend(q75[~np.isnan(q75)].tolist())
+    u_ymin = max(0, min(u_vals) * 0.92) if u_vals else 0
+    u_ymax = (max(u_vals) * 1.08) if u_vals else 0.15
+    for (col, color, suffix), ax in zip(u_cols, axes[2]):
+        draw_pair(ax, col, color,
+                  f"{name_a}", f"{name_b}",
+                  hline=baseline_U,
+                  hline_label=f"Random 1/√d_h ≈ {baseline_U:.4f}")
+        ax.set_ylim(u_ymin, u_ymax)
+        _finalize_ax(ax, f"Law 4 — cosU ({suffix})",
+                     "Mean |cos| (U)")
+    # ── Row 3: Law 5 ─────────────────────────────────────────────────────────
+    v_cols = [("cosV_QK", C["QK"], "Q–K"),
+              ("cosV_QV", C["QV"], "Q–V"),
+              ("cosV_KV", C["KV"], "K–V")]
+    v_vals = []
+    for col, _, _ in v_cols:
+        for df_ in [df_a, df_b]:
+            _, med, q25, q75 = _aggregate_by_layer(df_, col)
+            v_vals.extend(q25[~np.isnan(q25)].tolist())
+            v_vals.extend(q75[~np.isnan(q75)].tolist())
+    v_ymin = max(0, min(v_vals) * 0.92) if v_vals else 0
+    v_ymax = (max(v_vals) * 1.08) if v_vals else 0.05
+    for (col, color, suffix), ax in zip(v_cols, axes[3]):
+        draw_pair(ax, col, color,
+                  f"{name_a}", f"{name_b}",
+                  hline=baseline_V,
+                  hline_label=f"Random 1/√D ≈ {baseline_V:.4f}")
+        ax.set_ylim(v_ymin, v_ymax)
+        _finalize_ax(ax, f"Law 5 — cosV ({suffix})",
+                     "Mean |cos| (V)")
+    # ── Legend for line styles ────────────────────────────────────────────────
+    solid_patch  = Line2D([0], [0], color="#333333", linewidth=1.8,
+                          linestyle="-",  label=f"Solid = {name_a}")
+    dashed_patch = Line2D([0], [0], color="#333333", linewidth=1.8,
+                          linestyle="--", label=f"Dashed = {name_b}")
+    fig.legend(handles=[solid_patch, dashed_patch],
+               loc="lower center", ncol=2, fontsize=9,
+               bbox_to_anchor=(0.5, 0.001))
+    if gl:
+        fig.text(
+            0.5, 0.0045,
+            f"Vertical dotted lines mark global (K=V shared) layers: {gl}",
+            ha="center", fontsize=8, color="#666666"
+        )
+    fig.tight_layout(rect=[0, 0.015, 1, 0.995])
+    return fig
+# ─────────────────────────────────────────────────────────────────────────────
+# Export helpers
+# ─────────────────────────────────────────────────────────────────────────────
+def save_figure(fig: plt.Figure, base_path: str):
+    """
+    Save figure to PNG (300 dpi), PDF (vector), and SVG (vector).
+    base_path: path without extension, e.g. "/tmp/wang_laws_gemma"
+    Returns list of saved file paths.
+    """
+    paths = []
+    for fmt, kwargs in [
+        ("png", {"dpi": 300, "bbox_inches": "tight"}),
+        ("pdf", {"bbox_inches": "tight"}),
+        ("svg", {"bbox_inches": "tight"}),
+    ]:
+        p = f"{base_path}.{fmt}"
+        fig.savefig(p, format=fmt, **kwargs)
+        paths.append(p)
+    return paths
+def fig_to_png_bytes(fig: plt.Figure) -> bytes:
+    """Return PNG bytes for Gradio Image component."""
+    buf = io.BytesIO()
+    fig.savefig(buf, format="png", dpi=150, bbox_inches="tight")
+    buf.seek(0)
+    return buf.read()
+def fig_to_plotly(fig_mpl: plt.Figure):
+    """
+    Convert matplotlib Figure to a Plotly figure via mpl_to_plotly.
+    Requires plotly installed.  Falls back gracefully.
+    """
+    try:
+        import plotly.tools as tls
+        return tls.mpl_to_plotly(fig_mpl)
+    except Exception:
+        return None

ui/tab_plot.py ADDED Viewed

	@@ -0,0 +1,412 @@

+# ui/tab_plot.py
+"""
+Tab5: Plot — Publication-quality figure generation
+Data pulled from SQLite DB.
+Supports: single model (4×3) and two-model comparison (4×3).
+Export: PNG (300 dpi) / PDF / SVG.
+Engine: matplotlib (publication) + optional Plotly (interactive).
+"""
+import os
+import tempfile
+import zipfile
+import gradio as gr
+import pandas as pd
+import numpy as np
+from db.schema import init_db
+from db.reader import get_layer_metrics, get_analyzed_models
+from core.plotter import (
+    plot_single_model,
+    plot_compare_models,
+    save_figure,
+    fig_to_plotly,
+)
+# ── Output directory ──────────────────────────────────────────────────────────
+_OUT_DIR = "/tmp/wang_plots"
+os.makedirs(_OUT_DIR, exist_ok=True)
+# ─────────────────────────────────────────────────────────────────────────────
+# DB helpers
+# ─────────────────────────────────────────────────────────────────────────────
+def _get_model_choices() -> list[str]:
+    try:
+        conn = init_db()
+        df   = get_analyzed_models(conn)
+        if df.empty:
+            return []
+        return df["model_id"].tolist()
+    except Exception:
+        return []
+def _load_df(model_id: str, modality: str,
+             start_layer: int, end_layer: int) -> pd.DataFrame:
+    conn = init_db()
+    df = get_layer_metrics(
+        conn,
+        model_id    = model_id,
+        modality    = modality if modality != "all" else None,
+        layer_type  = None,
+        start_layer = int(start_layer),
+        end_layer   = int(end_layer),
+    )
+    return df
+def _infer_dims(df: pd.DataFrame) -> tuple[int, int]:
+    """Try to read head_dim and d_model from the dataframe."""
+    head_dim = 128
+    d_model  = 5120
+    if not df.empty:
+        if "head_dim" in df.columns:
+            v = df["head_dim"].dropna()
+            if len(v):
+                head_dim = int(v.median())
+        if "d_model" in df.columns:
+            v = df["d_model"].dropna()
+            if len(v):
+                d_model = int(v.median())
+    return head_dim, d_model
+def _short_name(model_id: str) -> str:
+    return model_id.split("/")[-1] if "/" in model_id else model_id
+def _safe_base_path(name: str) -> str:
+    safe = name.replace("/", "_").replace(" ", "_")
+    return os.path.join(_OUT_DIR, safe)
+# ─────────────────────────────────────────────────────────────────────────────
+# Main generation functions
+# ─────────────────────────────────────────────────────────────────────────────
+def generate_single(
+    model_id:    str,
+    modality:    str,
+    start_layer: int,
+    end_layer:   int,
+    show_band:   bool,
+    progress=gr.Progress()
+) -> tuple:
+    """
+    Returns: (status_str, png_path, [png_path, pdf_path, svg_path], plotly_fig)
+    """
+    if not model_id or not model_id.strip():
+        return "❌ Please select a model.", None, None, None
+    progress(0.1, desc="Loading data from DB...")
+    df = _load_df(model_id, modality, start_layer, end_layer)
+    if df.empty:
+        return (
+            f"❌ No data found for {model_id} "
+            f"(modality={modality}, layers {start_layer}~{end_layer}).\n"
+            f"Please run analysis first in Tab 2.",
+            None, None, None
+        )
+    progress(0.35, desc="Inferring dimensions...")
+    head_dim, d_model = _infer_dims(df)
+    n_layers  = df["layer"].nunique()
+    n_records = len(df)
+    progress(0.50, desc="Generating matplotlib figure...")
+    name = _short_name(model_id)
+    fig  = plot_single_model(
+        df, model_name=name,
+        show_band=show_band,
+        head_dim=head_dim,
+        d_model=d_model,
+    )
+    progress(0.75, desc="Saving PNG / PDF / SVG...")
+    base  = _safe_base_path(f"single_{name}_L{start_layer}-{end_layer}")
+    paths = save_figure(fig, base)
+    progress(0.90, desc="Generating Plotly preview...")
+    plotly_fig = fig_to_plotly(fig)
+    import matplotlib.pyplot as plt
+    plt.close(fig)
+    status = (
+        f"✅ {model_id}  |  modality={modality}  "
+        f"|  layers {start_layer}~{end_layer}  "
+        f"|  {n_layers} layers  {n_records} head-records\n"
+        f"   head_dim={head_dim}  d_model={d_model}\n"
+        f"   Saved: {', '.join(os.path.basename(p) for p in paths)}"
+    )
+    png_path = paths[0]
+    return status, png_path, paths, plotly_fig
+def generate_compare(
+    model_a:     str,
+    model_b:     str,
+    modality:    str,
+    start_layer: int,
+    end_layer:   int,
+    show_band:   bool,
+    show_delta:  bool,
+    progress=gr.Progress()
+) -> tuple:
+    if not model_a or not model_b:
+        return "❌ Please select both models.", None, None, None
+    if model_a == model_b:
+        return "❌ Please select two different models.", None, None, None
+    progress(0.10, desc="Loading Model A from DB...")
+    df_a = _load_df(model_a, modality, start_layer, end_layer)
+    progress(0.25, desc="Loading Model B from DB...")
+    df_b = _load_df(model_b, modality, start_layer, end_layer)
+    if df_a.empty:
+        return f"❌ No data for Model A ({model_a}).", None, None, None
+    if df_b.empty:
+        return f"❌ No data for Model B ({model_b}).", None, None, None
+    head_dim_a, d_model_a = _infer_dims(df_a)
+    head_dim_b, d_model_b = _infer_dims(df_b)
+    head_dim = int((head_dim_a + head_dim_b) / 2)
+    d_model  = int((d_model_a + d_model_b) / 2)
+    progress(0.50, desc="Generating comparison figure...")
+    name_a = _short_name(model_a)
+    name_b = _short_name(model_b)
+    fig = plot_compare_models(
+        df_a, df_b,
+        name_a=name_a, name_b=name_b,
+        show_band=show_band,
+        show_delta=show_delta,
+        head_dim=head_dim,
+        d_model=d_model,
+    )
+    progress(0.80, desc="Saving PNG / PDF / SVG...")
+    base  = _safe_base_path(f"compare_{name_a}_vs_{name_b}_L{start_layer}-{end_layer}")
+    paths = save_figure(fig, base)
+    progress(0.92, desc="Generating Plotly preview...")
+    plotly_fig = fig_to_plotly(fig)
+    import matplotlib.pyplot as plt
+    plt.close(fig)
+    status = (
+        f"✅ {name_a}  vs  {name_b}\n"
+        f"   modality={modality}  layers {start_layer}~{end_layer}\n"
+        f"   Model A: {len(df_a)} records  |  Model B: {len(df_b)} records\n"
+        f"   head_dim≈{head_dim}  d_model≈{d_model}\n"
+        f"   Saved: {', '.join(os.path.basename(p) for p in paths)}"
+    )
+    return status, paths[0], paths, plotly_fig
+def make_zip(file_paths: list) -> str | None:
+    """Bundle all exported files into a single ZIP for download."""
+    if not file_paths:
+        return None
+    zip_path = os.path.join(_OUT_DIR, "wang_laws_figures.zip")
+    with zipfile.ZipFile(zip_path, "w", zipfile.ZIP_DEFLATED) as zf:
+        for p in file_paths:
+            if p and os.path.exists(p):
+                zf.write(p, os.path.basename(p))
+    return zip_path
+# ─────────────────────────────────────────────────────────────────────────────
+# Tab5 UI
+# ─────────────────────────────────────────────────────────────────────────────
+def build_tab_plot():
+    with gr.Tab("📈 Plot"):
+        gr.Markdown("""
+        ## Wang's Five Laws — Publication-Quality Figures
+        Data is loaded directly from the SQLite database (Tab 2 must be run first).
+        **4×3 grid layout** (12 subplots, one figure):
+        | Row | Content | Laws |
+        |-----|---------|------|
+        | 1 | pearson_QK · SSR_QK · α_QK | Law 1 & 2 |
+        | 2 | σ_max(Q) · σ_max(K) · κ(Q) & κ(K) | Law 3 |
+        | 3 | cosU QK · QV · KV + random baseline | Law 4 |
+        | 4 | cosV QK · QV · KV + random baseline | Law 5 |
+        Export: **PNG 300 dpi** · **PDF (vector)** · **SVG (vector)**
+        """)
+        # ── Shared controls ───────────────────────────────────────────────────
+        with gr.Row():
+            modality_sel = gr.Dropdown(
+                label="Modality",
+                choices=["language", "vision", "audio", "all"],
+                value="language",
+                scale=1,
+            )
+            start_l = gr.Number(
+                label="Start Layer", value=0,  precision=0, scale=1
+            )
+            end_l = gr.Number(
+                label="End Layer",   value=47, precision=0, scale=1
+            )
+            show_band_chk = gr.Checkbox(
+                label="Show 25%–75% band (head consistency)",
+                value=True, scale=1
+            )
+        gr.Markdown("---")
+        # ══ Mode 1: Single model ══════════════════════════════════════════════
+        with gr.Accordion("📊 Single Model", open=True):
+            with gr.Row():
+                model_choices = _get_model_choices()
+                single_model = gr.Dropdown(
+                    label="Model",
+                    choices=model_choices,
+                    value=model_choices[0] if model_choices else None,
+                    allow_custom_value=True,
+                    scale=3,
+                    info="Refresh the page after analyzing new models to update this list."
+                )
+                single_btn = gr.Button(
+                    "🎨 Generate Figure", variant="primary", scale=1
+                )
+            single_status = gr.Textbox(
+                label="Status", lines=3, interactive=False
+            )
+            with gr.Tabs():
+                with gr.Tab("🖼️ Preview (PNG)"):
+                    single_img = gr.Image(
+                        label="Figure preview",
+                        type="filepath",
+                        height=600,
+                    )
+                with gr.Tab("📉 Interactive (Plotly)"):
+                    single_plotly = gr.Plot(label="Plotly interactive")
+            with gr.Row():
+                dl_single_png = gr.File(label="⬇ PNG (300 dpi)")
+                dl_single_pdf = gr.File(label="⬇ PDF (vector)")
+                dl_single_svg = gr.File(label="⬇ SVG (vector)")
+                dl_single_zip = gr.File(label="⬇ ZIP (all formats)")
+        gr.Markdown("---")
+        # ══ Mode 2: Two-model comparison ══════════════════════════════════════
+        with gr.Accordion("📊 Two-Model Comparison", open=False):
+            with gr.Row():
+                model_a = gr.Dropdown(
+                    label="Model A (solid line)",
+                    choices=model_choices,
+                    value=model_choices[0] if len(model_choices) > 0 else None,
+                    allow_custom_value=True,
+                    scale=2,
+                )
+                model_b = gr.Dropdown(
+                    label="Model B (dashed line)",
+                    choices=model_choices,
+                    value=model_choices[1] if len(model_choices) > 1 else None,
+                    allow_custom_value=True,
+                    scale=2,
+                )
+                show_delta_chk = gr.Checkbox(
+                    label="Show Δ (B − A) fill",
+                    value=True, scale=1
+                )
+                compare_btn = gr.Button(
+                    "🎨 Generate Comparison", variant="primary", scale=1
+                )
+            compare_status = gr.Textbox(
+                label="Status", lines=3, interactive=False
+            )
+            with gr.Tabs():
+                with gr.Tab("🖼️ Preview (PNG)"):
+                    compare_img = gr.Image(
+                        label="Comparison figure preview",
+                        type="filepath",
+                        height=600,
+                    )
+                with gr.Tab("📉 Interactive (Plotly)"):
+                    compare_plotly = gr.Plot(label="Plotly interactive")
+            with gr.Row():
+                dl_cmp_png = gr.File(label="⬇ PNG (300 dpi)")
+                dl_cmp_pdf = gr.File(label="⬇ PDF (vector)")
+                dl_cmp_svg = gr.File(label="⬇ SVG (vector)")
+                dl_cmp_zip = gr.File(label="⬇ ZIP (all formats)")
+        gr.Markdown("""
+        ---
+        **Tips**
+        - Band = 25%–75% quantile across attention heads per layer.
+          Narrow band → heads behave consistently → model is "well-organized".
+        - Vertical dotted lines mark **global layers** (K=V shared, e.g. Gemma-4).
+        - Dashed horizontal lines = theoretical ideals or random baselines.
+        - For Law 4 & 5 panels, Q–V and K–V cosU values **below** the random baseline
+          indicate **super-orthogonality** — a key signature of pretraining convergence.
+        """)
+        # ── Wire up single model ──────────────────────────────────────────────
+        _single_file_state = gr.State([])
+        def _run_single(model_id, modality, start, end, band, progress=gr.Progress()):
+            status, png, paths, plotly_fig = generate_single(
+                model_id, modality, int(start), int(end), band, progress
+            )
+            if paths is None:
+                return status, None, None, None, None, None, None, []
+            zip_p = make_zip(paths)
+            png_p = paths[0] if len(paths) > 0 else None
+            pdf_p = paths[1] if len(paths) > 1 else None
+            svg_p = paths[2] if len(paths) > 2 else None
+            return (status, png, plotly_fig,
+                    png_p, pdf_p, svg_p, zip_p, paths)
+        single_btn.click(
+            fn=_run_single,
+            inputs=[single_model, modality_sel, start_l, end_l, show_band_chk],
+            outputs=[
+                single_status, single_img, single_plotly,
+                dl_single_png, dl_single_pdf, dl_single_svg, dl_single_zip,
+                _single_file_state,
+            ]
+        )
+        # ── Wire up comparison ────────────────────────────────────────────────
+        _compare_file_state = gr.State([])
+        def _run_compare(ma, mb, modality, start, end, band, delta,
+                         progress=gr.Progress()):
+            status, png, paths, plotly_fig = generate_compare(
+                ma, mb, modality, int(start), int(end), band, delta, progress
+            )
+            if paths is None:
+                return status, None, None, None, None, None, None, []
+            zip_p = make_zip(paths)
+            png_p = paths[0] if len(paths) > 0 else None
+            pdf_p = paths[1] if len(paths) > 1 else None
+            svg_p = paths[2] if len(paths) > 2 else None
+            return (status, png, plotly_fig,
+                    png_p, pdf_p, svg_p, zip_p, paths)
+        compare_btn.click(
+            fn=_run_compare,
+            inputs=[model_a, model_b, modality_sel,
+                    start_l, end_l, show_band_chk, show_delta_chk],
+            outputs=[
+                compare_status, compare_img, compare_plotly,
+                dl_cmp_png, dl_cmp_pdf, dl_cmp_svg, dl_cmp_zip,
+                _compare_file_state,
+            ]
+        )