Spaces:

karlexmarin
/

taf-agent

Running

karlexmarin Claude Opus 4.7 (1M context) commited on 15 days ago

Commit

abea671

1 Parent(s): 3ebadcd

feat(v0.4): add 3 diagnostic recipes from sesión 29 cross-model panel

New TAF formulas (sesión 29 findings, 2026-04-28, n=22 LLM panel):
- §28 ν = −1/(2π) learned-imprint slope (DERIVED + empirical err 0.3%)
- §29 K = γ × log(N²·D) Chinchilla-attention invariant (CV=0.329)
- §30 sign(γ_text − γ_random) IH-formation discriminator
- §31 γ-cluster on famous constants (CodeLlama=1−1/φ, etc — n=4 intriguing)

New Python functions (python/taf_browser.py):
- gamma_random_predict(theta, T_eval, n_params_M) — F1 imprint formula
- imprint_purity(...) — diagnostic with ±0.18 CI
- compute_invariant_K(...) — F2 with z-score vs panel
- ih_phase_check(...) — F4 Δγ probe
- gamma_decompose_v2(...) — 6-axis with imprint + instruct
- famous_constant_proximity(...) — golden-ratio detector

New recipes:
- X-21 Imprint Purity Diagnostic (predicts γ_random, classifies cleanliness)
- X-22 Compute-Context Invariant (K-band membership check)
- X-23 IH-Phase Detector (Δγ probe + size-consistency check)

UI updates:
- Help modal expanded with v0.4 section in 4 languages (EN/ES/FR/ZH)
- Recipe count updated 5 → 8
- New help.recipe.x{21,22,23} keys + help.section.v04 + help.v04.{imprint,invariant,ih_probe,constants}

README adds:
- Diagnostic recipes block (X-21/X-22/X-23) under "What it does"
- "What's new in v0.4" section with formulas and use cases

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Files changed (4) hide show

README.md +68 -9
index.html +28 -1
js/i18n.js +48 -4
python/taf_browser.py +355 -0

README.md CHANGED Viewed

@@ -33,7 +33,7 @@ language:
 **🌐 Live**: https://karlesmarin.github.io/tafagent
 **📦 Source**: https://github.com/karlesmarin/tafagent
-**📄 Paper**: [Transformer Thermodynamics — Marin 2026](https://github.com/karlesmarin/NeurIPS)
 ---
@@ -59,15 +59,21 @@ Drop in a model id (or paste any HuggingFace public model), get a
 falsifiable answer to "**will this work?**" — backed by the
 Thermodynamic Attention Framework (TAF) formulas:
 - *Will Llama-3-8B serve 32K context with NIAH retrieval?* → **X-2**
 - *Should I train a custom 7B model or pay for API access?* → **X-1**
 - *I have $5,000 — what model can I afford to train?* → **X-3**
 - *Cheapest GPU to serve Llama-70B at 100M tokens/day?* → **X-5**
 - *Soft KV decay or hard cutoff for compression?* → **X-19**
-Each as a chain of TAF formulas (paper §17, §19, §20, §24, §26) rendered
-with full audit trail. Every number is deterministic Python; nothing
-is hallucinated.
 ## Four ways to use it
@@ -152,9 +158,61 @@ paper (343 JSON files, ~5.5 MB). See `data/README.md` for the layout.
 - ~2 GB free RAM for the synthesis LLM
 - ~350 MB disk for model cache (one-time)
 ## How you can help
-This tool is at v0.3. There's a long way to go.
 - **🐛 Report bugs**: https://github.com/karlesmarin/tafagent/issues
 - **🌐 Translate**: add a language to `js/i18n.js`, send a PR
@@ -171,12 +229,13 @@ This tool is at v0.3. There's a long way to go.
 If this tool helps you — paper or code:
 ```bibtex
-@article{marin2026transformer_thermodynamics,
   author  = {Marin, Carles},
-  title   = {Transformer Thermodynamics: A Closed-Form Theory of Attention Decay,
-             Phase Transitions, and Context-Length Limits in RoPE Language Models},
   year    = {2026},
-  url     = {https://github.com/karlesmarin/NeurIPS},
 }
 @misc{marin2026tafagent,

 **🌐 Live**: https://karlesmarin.github.io/tafagent
 **📦 Source**: https://github.com/karlesmarin/tafagent
+**📄 Paper**: [Predicting How Transformers Atten — Marin 2026](https://zenodo.org/records/19826343)
 ---
 falsifiable answer to "**will this work?**" — backed by the
 Thermodynamic Attention Framework (TAF) formulas:
+**Decision recipes**
 - *Will Llama-3-8B serve 32K context with NIAH retrieval?* → **X-2**
 - *Should I train a custom 7B model or pay for API access?* → **X-1**
 - *I have $5,000 — what model can I afford to train?* → **X-3**
 - *Cheapest GPU to serve Llama-70B at 100M tokens/day?* → **X-5**
 - *Soft KV decay or hard cutoff for compression?* → **X-19**
+**Diagnostic recipes** (NEW v0.4 — sesión 29 findings 2026-04-28)
+- *How much positional bias did training imprint on this model?* → **X-21**
+- *Does this model fit the empirical compute-context invariant band?* → **X-22**
+- *Is this checkpoint pre- or post-induction-head?* → **X-23**
+Each as a chain of TAF formulas (paper §17, §19, §20, §24, §26, §28-§30)
+rendered with full audit trail. Every number is deterministic Python;
+nothing is hallucinated.
 ## Four ways to use it
 - ~2 GB free RAM for the synthesis LLM
 - ~350 MB disk for model cache (one-time)
+## What's new in v0.4 (2026-04-28)
+Three new diagnostic recipes derived from cross-model panel analysis (n=22 LLMs):
+### X-21 — Imprint Purity Diagnostic
+Predicts γ on RANDOM-token input via the **learned-imprint formula**:
+```
+γ_random = γ_pade(θ, T) + ν · log_10(P / 14M)
+   ν = −1/(2π) ≈ −0.1592   (DERIVED from RoPE rotation period)
+```
+Even on random tokens, weights apply a learned positional bias proportional
+to log(N_params). The slope ν is **fixed** (not fitted) — derivable from
+RoPE's 2π rotation period. Empirical validation: n=22 LLMs, p=0.022, |err|=0.3%.
+**Use case**: detect anomalous training, format conversion (e.g. OLMo native
+vs HF Δγ=0.30), or fine-tuning drift by comparing predicted vs measured
+γ_random.
+### X-22 — Compute-Context Invariant
+Computes the empirical Chinchilla×attention invariant:
+```
+K = γ × log(N² · D)   where D = 20·N (Chinchilla compute-optimal)
+Empirical band: K ∈ [34, 68]   (51.2 ± 16.8, CV=0.329, n=22)
+```
+K-outliers indicate scaling/training anomalies. Llama-3-8B with γ=1.045
+gives K=74.6 (z=1.39, high-K OUTLIER) — flags supra-Padé attention.
+### X-23 — IH-Phase Detector
+Uses the Δγ probe (cheaper than ICL benchmark):
+```
+sign(γ_text − γ_random) > 0   ⟺   post-induction-head formation
+```
+Pre-IH (P<400M, n=7): ⟨Δγ⟩=−0.19±0.26
+Post-IH (P≥400M, n=15): ⟨Δγ⟩=+0.03±0.26
+**Use case**: monitor training trajectories without running ICL benchmarks;
+detect anomalous checkpoints.
+### Other v0.4 additions
+- `gamma_decompose_v2(...)` — 6-axis decomposition with the new imprint axis
+- `famous_constant_proximity(...)` — detects γ-cluster on famous constants
+  (e.g. CodeLlama-13b γ=0.382 ≈ 1−1/φ golden conjugate)
+---
 ## How you can help
+This tool is at v0.4. There's a long way to go.
 - **🐛 Report bugs**: https://github.com/karlesmarin/tafagent/issues
 - **🌐 Translate**: add a language to `js/i18n.js`, send a PR
 If this tool helps you — paper or code:
 ```bibtex
+@article{marin2026Predicting How Transformers Atten,
   author  = {Marin, Carles},
+  title   = {Predicting How Transformers Attend
+Analytic Power-Law Theory, Phase Transitions, and Practical Compression
+Tools},
   year    = {2026},
+  url     = {https://zenodo.org/records/19826343},
 }
 @misc{marin2026tafagent,

index.html CHANGED Viewed

@@ -77,7 +77,7 @@
       <p data-i18n="help.modes.ask"><strong>💬 Ask plain English</strong>: free-form question, in-browser LLM picks the recipe. Best for casual exploration.</p>
       <p data-i18n="help.modes.recipe"><strong>📋 Recipe + form</strong>: manual selection, full parameter control. Best when you want exact control.</p>
-      <h3 data-i18n="help.recipes.title">The 5 recipes available</h3>
       <p data-i18n="help.recipe.x1.title"><strong>X-1 Custom training vs API</strong> — compares cost of training your own model vs paying for API access.</p>
       <div class="help-example" data-i18n="help.recipe.x1.example">
@@ -110,6 +110,33 @@
         Answer: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.
       </div>
       <h3 data-i18n="help.add_models.title">Adding new models (3 ways)</h3>
       <ul>
         <li data-i18n="help.add_models.preset"><strong>Preset list</strong>: 11 popular models curated. Just select from dropdown.</li>

       <p data-i18n="help.modes.ask"><strong>💬 Ask plain English</strong>: free-form question, in-browser LLM picks the recipe. Best for casual exploration.</p>
       <p data-i18n="help.modes.recipe"><strong>📋 Recipe + form</strong>: manual selection, full parameter control. Best when you want exact control.</p>
+      <h3 data-i18n="help.recipes.title">The 8 recipes available</h3>
       <p data-i18n="help.recipe.x1.title"><strong>X-1 Custom training vs API</strong> — compares cost of training your own model vs paying for API access.</p>
       <div class="help-example" data-i18n="help.recipe.x1.example">
         Answer: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.
       </div>
+      <h3 style="margin-top: 1.5em;">— v0.4 (sesión 29 findings) —</h3>
+      <p data-i18n="help.section.v04"><strong>What's new in v0.4</strong> (sesión 29 findings 2026-04-28): three diagnostic recipes derived from cross-model panel analysis (n=22 LLMs).</p>
+      <p data-i18n="help.recipe.x21.title"><strong>X-21 Imprint Purity Diagnostic</strong> — predicts γ on RANDOM tokens via ν=−1/(2π); how clean is the model's RoPE prediction?</p>
+      <div class="help-example" data-i18n="help.recipe.x21.example">
+        Try: <em>"How clean is the RoPE prediction on Llama-3-8B?"</em><br>
+        Answer: predicted γ_random + purity diagnostic (CLEAN / OVER-IMPRINTED / UNDER-IMPRINTED).
+      </div>
+      <p data-i18n="help.v04.imprint" style="font-size: 0.9em; opacity: 0.85;"><strong>Learned-imprint slope ν = −1/(2π)</strong>: RoPE rotation period 2π drives a positional bias on weights, proportional to log(N_params). Even random tokens show this scaling. ν is DERIVED — not fitted (empirical err 0.3%).</p>
+      <p data-i18n="help.recipe.x22.title"><strong>X-22 Compute-Context Invariant</strong> — does γ × log(N²·D) lie in panel band 51.2 ± 16.8? Detects scaling/training anomalies.</p>
+      <div class="help-example" data-i18n="help.recipe.x22.example">
+        Try: <em>"Does Mistral-7B fit the compute-context invariant?"</em><br>
+        Answer: K = γ·log(N²·D), z-score, IN-BAND or OUTLIER.
+      </div>
+      <p data-i18n="help.v04.invariant" style="font-size: 0.9em; opacity: 0.85;"><strong>Chinchilla-attention invariant K</strong>: γ × log(N²·D) ≈ 51.2 ± 16.8 (CV=0.329). Connects compute scaling and attention exponent into a single dimensionless number.</p>
+      <p data-i18n="help.recipe.x23.title"><strong>X-23 IH-Phase Detector</strong> — pre- or post-induction-head? Cheap probe via sign(γ_text − γ_random).</p>
+      <div class="help-example" data-i18n="help.recipe.x23.example">
+        Try: <em>"Is Qwen2.5-7B post-induction-head?"</em><br>
+        Answer: CONFIRMED PRE-IH / CONFIRMED POST-IH / ANOMALY (with size-vs-Δγ consistency check).
+      </div>
+      <p data-i18n="help.v04.ih_probe" style="font-size: 0.9em; opacity: 0.85;"><strong>Δγ as IH probe</strong>: sign(γ_text − γ_random) > 0 ⟺ post-induction-head. Cheaper than running an in-context-learning benchmark.</p>
+      <p data-i18n="help.v04.constants" style="font-size: 0.9em; opacity: 0.85;"><strong>γ-cluster on famous constants</strong> (intriguing, n=4): CodeLlama-13b γ=0.382 ≈ 1−1/φ (golden conjugate, err 0.0003); pythia-1.4b γ=0.705 ≈ 1/√2; Llama-2-7b γ=0.287 ≈ 1−1/√2; Mistral-Nemo γ=0.428 ≈ log_10(e). Caveat: could be coincidence.</p>
       <h3 data-i18n="help.add_models.title">Adding new models (3 ways)</h3>
       <ul>
         <li data-i18n="help.add_models.preset"><strong>Preset list</strong>: 11 popular models curated. Just select from dropdown.</li>

js/i18n.js CHANGED Viewed

@@ -170,7 +170,7 @@ export const TRANSLATIONS = {
     "help.modes.compare":       "<strong>🆚 Compare</strong>: 2-3 models side-by-side on same recipe. Best when choosing between candidates.",
     "help.modes.ask":           "<strong>💬 Ask plain English</strong>: free-form question, in-browser LLM picks the recipe. Best for casual exploration.",
     "help.modes.recipe":        "<strong>📋 Recipe + form</strong>: manual selection, full parameter control. Best when you want exact control.",
-    "help.recipes.title":       "The 5 recipes available",
     "help.recipe.x1.title":     "<strong>X-1 Custom training vs API</strong> — compares cost of training your own model vs paying for API access.",
     "help.recipe.x1.example":   "Try: <em>\"Should I train an 8B custom model or use GPT-4o for 50M tokens/month?\"</em><br>Answer types: YES (custom) / NO (API) with break-even months.",
     "help.recipe.x2.title":     "<strong>X-2 Long Context Viability</strong> — predicts if a model serves a target context length reliably.",
@@ -180,7 +180,18 @@ export const TRANSLATIONS = {
     "help.recipe.x5.title":     "<strong>X-5 Hardware selection</strong> — which GPU should I use to serve at target throughput?",
     "help.recipe.x5.example":   "Try: <em>\"Cheapest hardware to serve Llama-3-8B at 10M tokens/day\"</em><br>Answer: best GPU + $/Mtok + capacity vs target.",
     "help.recipe.x19.title":    "<strong>X-19 KV Compression decision</strong> — should I use soft decay, hard cutoff, or literature methods?",
     "help.recipe.x19.example":  "Try: <em>\"How to compress KV cache for Qwen2.5-7B at 32K?\"</em><br>Answer: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.",
     "help.param.theta":         "<strong>θ (rope_theta)</strong>: RoPE base frequency. Higher = more long-range capacity. Typical: 10000 (early), 500000 (Llama-3), 1000000 (Qwen2.5).",
     "help.param.T_train":       "<strong>T_train</strong>: max context the model was trained on. From <code>max_position_embeddings</code>.",
     "help.param.T_eval":        "<strong>T_eval</strong>: <em>your target</em> inference context length. The key knob.",
@@ -368,7 +379,7 @@ export const TRANSLATIONS = {
     "help.modes.compare":       "<strong>🆚 Comparar</strong>: 2-3 modelos lado a lado en la misma receta. Mejor al elegir entre candidatos.",
     "help.modes.ask":           "<strong>💬 Pregunta libre</strong>: pregunta en lenguaje natural, el LLM del navegador elige la receta. Mejor para exploración casual.",
     "help.modes.recipe":        "<strong>📋 Receta + formulario</strong>: selección manual, control total de parámetros. Mejor cuando quieres control exacto.",
-    "help.recipes.title":       "Las 5 recetas disponibles",
     "help.recipe.x1.title":     "<strong>X-1 Entrenamiento custom vs API</strong> — compara coste de entrenar tu propio modelo vs pagar API.",
     "help.recipe.x1.example":   "Prueba: <em>\"¿Entrenar 8B custom o usar GPT-4o para 50M tokens/mes?\"</em><br>Respuestas: SÍ (custom) / NO (API) con meses para break-even.",
     "help.recipe.x2.title":     "<strong>X-2 Viabilidad contexto largo</strong> — predice si un modelo sirve longitud objetivo de manera fiable.",
@@ -378,6 +389,17 @@ export const TRANSLATIONS = {
     "help.recipe.x5.title":     "<strong>X-5 Selección hardware</strong> — ¿qué GPU usar para servir al throughput objetivo?",
     "help.recipe.x5.example":   "Prueba: <em>\"Hardware más barato para servir Llama-3-8B a 10M tokens/día\"</em><br>Respuesta: mejor GPU + $/Mtok + capacidad vs objetivo.",
     "help.recipe.x19.title":    "<strong>X-19 Decisión compresión KV</strong> — ¿usar soft decay, hard cutoff, o métodos de literatura?",
     "help.recipe.x19.example":  "Prueba: <em>\"¿Cómo comprimir caché KV para Qwen2.5-7B a 32K?\"</em><br>Respuesta: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.",
     "help.param.theta":         "<strong>θ (rope_theta)</strong>: frecuencia base RoPE. Mayor = más capacidad de largo alcance. Típico: 10000 (modelos antiguos), 500000 (Llama-3), 1000000 (Qwen2.5).",
     "help.param.T_train":       "<strong>T_train</strong>: contexto máximo que vio el modelo durante entrenamiento. De <code>max_position_embeddings</code>.",
@@ -565,7 +587,7 @@ export const TRANSLATIONS = {
     "help.modes.compare":       "<strong>🆚 Comparer</strong>: 2-3 modèles côte à côte sur la même recette. Mieux pour choisir entre candidats.",
     "help.modes.ask":           "<strong>💬 Question libre</strong>: question en langage naturel, le LLM du navigateur choisit la recette. Mieux pour exploration casuelle.",
     "help.modes.recipe":        "<strong>📋 Recette + formulaire</strong>: sélection manuelle, contrôle total des paramètres. Mieux quand vous voulez un contrôle exact.",
-    "help.recipes.title":       "Les 5 recettes disponibles",
     "help.recipe.x1.title":     "<strong>X-1 Entraînement custom vs API</strong> — compare le coût d'entraîner votre propre modèle vs payer l'accès API.",
     "help.recipe.x1.example":   "Essayez: <em>« Dois-je entraîner un 8B custom ou utiliser GPT-4o pour 50M tokens/mois ? »</em><br>Réponses: OUI (custom) / NON (API) avec mois pour break-even.",
     "help.recipe.x2.title":     "<strong>X-2 Viabilité contexte long</strong> — prédit si un modèle sert une longueur cible de manière fiable.",
@@ -575,7 +597,18 @@ export const TRANSLATIONS = {
     "help.recipe.x5.title":     "<strong>X-5 Sélection hardware</strong> — quel GPU utiliser pour servir au throughput cible ?",
     "help.recipe.x5.example":   "Essayez: <em>« Hardware le moins cher pour servir Llama-3-8B à 10M tokens/jour »</em><br>Réponse: meilleur GPU + $/Mtok + capacité vs cible.",
     "help.recipe.x19.title":    "<strong>X-19 Décision compression KV</strong> — utiliser soft decay, hard cutoff, ou méthodes de littérature ?",
     "help.recipe.x19.example":  "Essayez: <em>« Comment compresser le cache KV pour Qwen2.5-7B à 32K ? »</em><br>Réponse: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.",
     "help.param.theta":         "<strong>θ (rope_theta)</strong>: fréquence de base RoPE. Plus haut = plus de capacité longue portée. Typique: 10000 (anciens), 500000 (Llama-3), 1000000 (Qwen2.5).",
     "help.param.T_train":       "<strong>T_train</strong>: contexte max vu par le modèle pendant l'entraînement. De <code>max_position_embeddings</code>.",
     "help.param.T_eval":        "<strong>T_eval</strong>: <em>votre</em> longueur de contexte cible en inférence. Le bouton clé.",
@@ -762,7 +795,7 @@ export const TRANSLATIONS = {
     "help.modes.compare":       "<strong>🆚 比较</strong>: 2-3 个模型在同一配方上并排。最适合在候选者之间选择。",
     "help.modes.ask":           "<strong>💬 自由提问</strong>: 自然语言问题,浏览器 LLM 选择配方。最适合随意探索。",
     "help.modes.recipe":        "<strong>📋 配方 + 表单</strong>: 手动选择,完全控制参数。最适合需要精确控制时。",
-    "help.recipes.title":       "可用的 5 个配方",
     "help.recipe.x1.title":     "<strong>X-1 自定义训练 vs API</strong> — 比较训练自己模型的成本与付费使用 API 的成本。",
     "help.recipe.x1.example":   "尝试: <em>\"我应该训练 8B 自定义模型还是使用 GPT-4o 处理每月 50M tokens?\"</em><br>答案: 是 (自定义) / 否 (API),含损益平衡月数。",
     "help.recipe.x2.title":     "<strong>X-2 长上下文可行性</strong> — 预测模型是否能可靠地服务目标上下文长度。",
@@ -772,7 +805,18 @@ export const TRANSLATIONS = {
     "help.recipe.x5.title":     "<strong>X-5 硬件选择</strong> — 应该使用哪个 GPU 以达到目标吞吐量?",
     "help.recipe.x5.example":   "尝试: <em>\"以每天 1000 万 tokens 提供 Llama-3-8B 的最便宜硬件\"</em><br>答案: 最佳 GPU + $/Mtok + 容量 vs 目标。",
     "help.recipe.x19.title":    "<strong>X-19 KV 压缩决策</strong> — 应该使用 soft decay、hard cutoff 还是文献方法?",
     "help.recipe.x19.example":  "尝试: <em>\"如何为 Qwen2.5-7B 在 32K 压缩 KV 缓存?\"</em><br>答案: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.",
     "help.param.theta":         "<strong>θ (rope_theta)</strong>: RoPE 基础频率。越高 = 长程能力越强。典型: 10000 (早期),500000 (Llama-3),1000000 (Qwen2.5)。",
     "help.param.T_train":       "<strong>T_train</strong>: 模型训练时的最大上下文。来自 <code>max_position_embeddings</code>。",
     "help.param.T_eval":        "<strong>T_eval</strong>: <em>您的</em> 目标推理上下文长度。关键旋钮。",

     "help.modes.compare":       "<strong>🆚 Compare</strong>: 2-3 models side-by-side on same recipe. Best when choosing between candidates.",
     "help.modes.ask":           "<strong>💬 Ask plain English</strong>: free-form question, in-browser LLM picks the recipe. Best for casual exploration.",
     "help.modes.recipe":        "<strong>📋 Recipe + form</strong>: manual selection, full parameter control. Best when you want exact control.",
+    "help.recipes.title":       "The 8 recipes available",
     "help.recipe.x1.title":     "<strong>X-1 Custom training vs API</strong> — compares cost of training your own model vs paying for API access.",
     "help.recipe.x1.example":   "Try: <em>\"Should I train an 8B custom model or use GPT-4o for 50M tokens/month?\"</em><br>Answer types: YES (custom) / NO (API) with break-even months.",
     "help.recipe.x2.title":     "<strong>X-2 Long Context Viability</strong> — predicts if a model serves a target context length reliably.",
     "help.recipe.x5.title":     "<strong>X-5 Hardware selection</strong> — which GPU should I use to serve at target throughput?",
     "help.recipe.x5.example":   "Try: <em>\"Cheapest hardware to serve Llama-3-8B at 10M tokens/day\"</em><br>Answer: best GPU + $/Mtok + capacity vs target.",
     "help.recipe.x19.title":    "<strong>X-19 KV Compression decision</strong> — should I use soft decay, hard cutoff, or literature methods?",
+    "help.recipe.x21.title":    "<strong>X-21 Imprint Purity Diagnostic</strong> — predicts γ on RANDOM tokens via ν=−1/(2π); how clean is the model's RoPE prediction?",
+    "help.recipe.x22.title":    "<strong>X-22 Compute-Context Invariant</strong> — does γ × log(N²·D) lie in panel band 51.2 ± 16.8? Detects scaling/training anomalies.",
+    "help.recipe.x23.title":    "<strong>X-23 IH-Phase Detector</strong> — pre- or post-induction-head? Cheap probe via sign(γ_text − γ_random).",
     "help.recipe.x19.example":  "Try: <em>\"How to compress KV cache for Qwen2.5-7B at 32K?\"</em><br>Answer: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.",
+    "help.recipe.x21.example":  "Try: <em>\"How clean is the RoPE prediction on Llama-3-8B?\"</em><br>Answer: predicted γ_random + purity diagnostic (CLEAN / OVER-IMPRINTED / UNDER-IMPRINTED).",
+    "help.recipe.x22.example":  "Try: <em>\"Does Mistral-7B fit the compute-context invariant?\"</em><br>Answer: K = γ·log(N²·D), z-score, IN-BAND or OUTLIER.",
+    "help.recipe.x23.example":  "Try: <em>\"Is Qwen2.5-7B post-induction-head?\"</em><br>Answer: CONFIRMED PRE-IH / CONFIRMED POST-IH / ANOMALY (with size-vs-Δγ consistency check).",
+    "help.section.v04":         "<strong>What's new in v0.4</strong> (sesión 29 findings 2026-04-28): three diagnostic recipes derived from cross-model panel analysis (n=22 LLMs).",
+    "help.v04.imprint":         "<strong>Learned-imprint slope ν = −1/(2π)</strong>: RoPE rotation period 2π drives a positional bias on weights, proportional to log(N_params). Even random tokens show this scaling. ν is DERIVED — not fitted (empirical err 0.3%).",
+    "help.v04.invariant":       "<strong>Chinchilla-attention invariant K</strong>: γ × log(N²·D) ≈ 51.2 ± 16.8 (CV=0.329). Connects compute scaling and attention exponent into a single dimensionless number.",
+    "help.v04.ih_probe":        "<strong>Δγ as IH probe</strong>: sign(γ_text − γ_random) > 0 ⟺ post-induction-head. Cheaper than running an in-context-learning benchmark.",
+    "help.v04.constants":       "<strong>γ-cluster on famous constants</strong> (intriguing, n=4): CodeLlama-13b γ=0.382 ≈ 1−1/φ (golden conjugate, err 0.0003); pythia-1.4b γ=0.705 ≈ 1/√2; Llama-2-7b γ=0.287 ≈ 1−1/√2; Mistral-Nemo γ=0.428 ≈ log_10(e). Caveat: could be coincidence.",
     "help.param.theta":         "<strong>θ (rope_theta)</strong>: RoPE base frequency. Higher = more long-range capacity. Typical: 10000 (early), 500000 (Llama-3), 1000000 (Qwen2.5).",
     "help.param.T_train":       "<strong>T_train</strong>: max context the model was trained on. From <code>max_position_embeddings</code>.",
     "help.param.T_eval":        "<strong>T_eval</strong>: <em>your target</em> inference context length. The key knob.",
     "help.modes.compare":       "<strong>🆚 Comparar</strong>: 2-3 modelos lado a lado en la misma receta. Mejor al elegir entre candidatos.",
     "help.modes.ask":           "<strong>💬 Pregunta libre</strong>: pregunta en lenguaje natural, el LLM del navegador elige la receta. Mejor para exploración casual.",
     "help.modes.recipe":        "<strong>📋 Receta + formulario</strong>: selección manual, control total de parámetros. Mejor cuando quieres control exacto.",
+    "help.recipes.title":       "Las 8 recetas disponibles",
     "help.recipe.x1.title":     "<strong>X-1 Entrenamiento custom vs API</strong> — compara coste de entrenar tu propio modelo vs pagar API.",
     "help.recipe.x1.example":   "Prueba: <em>\"¿Entrenar 8B custom o usar GPT-4o para 50M tokens/mes?\"</em><br>Respuestas: SÍ (custom) / NO (API) con meses para break-even.",
     "help.recipe.x2.title":     "<strong>X-2 Viabilidad contexto largo</strong> — predice si un modelo sirve longitud objetivo de manera fiable.",
     "help.recipe.x5.title":     "<strong>X-5 Selección hardware</strong> — ¿qué GPU usar para servir al throughput objetivo?",
     "help.recipe.x5.example":   "Prueba: <em>\"Hardware más barato para servir Llama-3-8B a 10M tokens/día\"</em><br>Respuesta: mejor GPU + $/Mtok + capacidad vs objetivo.",
     "help.recipe.x19.title":    "<strong>X-19 Decisión compresión KV</strong> — ¿usar soft decay, hard cutoff, o métodos de literatura?",
+    "help.recipe.x21.title":    "<strong>X-21 Diagnóstico Pureza Imprint</strong> — predice γ sobre tokens RANDOM via ν=−1/(2π); ¿cuán limpia es la predicción RoPE del modelo?",
+    "help.recipe.x22.title":    "<strong>X-22 Invariante Compute-Context</strong> — ¿γ × log(N²·D) está en banda 51.2 ± 16.8? Detecta anomalías de scaling/training.",
+    "help.recipe.x23.title":    "<strong>X-23 Detector Fase IH</strong> — ¿pre- o post-induction-head? Probe barato via sign(γ_text − γ_random).",
+    "help.recipe.x21.example":  "Prueba: <em>«¿Cuán limpia es la predicción RoPE en Llama-3-8B?»</em><br>Respuesta: γ_random predicho + diagnóstico (CLEAN / OVER-IMPRINTED / UNDER-IMPRINTED).",
+    "help.recipe.x22.example":  "Prueba: <em>«¿Mistral-7B entra en el invariante compute-context?»</em><br>Respuesta: K = γ·log(N²·D), z-score, IN-BAND u OUTLIER.",
+    "help.recipe.x23.example":  "Prueba: <em>«¿Qwen2.5-7B es post-induction-head?»</em><br>Respuesta: CONFIRMED PRE-IH / CONFIRMED POST-IH / ANOMALY (chequeo consistencia tamaño vs Δγ).",
+    "help.section.v04":         "<strong>Novedades v0.4</strong> (hallazgos sesión 29 del 2026-04-28): tres recipes diagnósticas derivadas del análisis panel cross-model (n=22 LLMs).",
+    "help.v04.imprint":         "<strong>Slope imprint aprendido ν = −1/(2π)</strong>: el periodo de rotación RoPE 2π provoca un sesgo posicional en los pesos, proporcional a log(N_params). Incluso tokens random muestran este scaling. ν es DERIVADO — no ajustado (err empírico 0.3%).",
+    "help.v04.invariant":       "<strong>Invariante Chinchilla-atención K</strong>: γ × log(N²·D) ≈ 51.2 ± 16.8 (CV=0.329). Conecta compute scaling y exponente de atención en un solo número adimensional.",
+    "help.v04.ih_probe":        "<strong>Δγ como probe IH</strong>: sign(γ_text − γ_random) > 0 ⟺ post-induction-head. Más barato que correr un benchmark in-context-learning.",
+    "help.v04.constants":       "<strong>γ-cluster en constantes famosas</strong> (intrigante, n=4): CodeLlama-13b γ=0.382 ≈ 1−1/φ (conjugado áureo, err 0.0003); pythia-1.4b γ=0.705 ≈ 1/√2; Llama-2-7b γ=0.287 ≈ 1−1/√2; Mistral-Nemo γ=0.428 ≈ log_10(e). Caveat: podría ser coincidencia.",
     "help.recipe.x19.example":  "Prueba: <em>\"¿Cómo comprimir caché KV para Qwen2.5-7B a 32K?\"</em><br>Respuesta: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.",
     "help.param.theta":         "<strong>θ (rope_theta)</strong>: frecuencia base RoPE. Mayor = más capacidad de largo alcance. Típico: 10000 (modelos antiguos), 500000 (Llama-3), 1000000 (Qwen2.5).",
     "help.param.T_train":       "<strong>T_train</strong>: contexto máximo que vio el modelo durante entrenamiento. De <code>max_position_embeddings</code>.",
     "help.modes.compare":       "<strong>🆚 Comparer</strong>: 2-3 modèles côte à côte sur la même recette. Mieux pour choisir entre candidats.",
     "help.modes.ask":           "<strong>💬 Question libre</strong>: question en langage naturel, le LLM du navigateur choisit la recette. Mieux pour exploration casuelle.",
     "help.modes.recipe":        "<strong>📋 Recette + formulaire</strong>: sélection manuelle, contrôle total des paramètres. Mieux quand vous voulez un contrôle exact.",
+    "help.recipes.title":       "Les 8 recettes disponibles",
     "help.recipe.x1.title":     "<strong>X-1 Entraînement custom vs API</strong> — compare le coût d'entraîner votre propre modèle vs payer l'accès API.",
     "help.recipe.x1.example":   "Essayez: <em>« Dois-je entraîner un 8B custom ou utiliser GPT-4o pour 50M tokens/mois ? »</em><br>Réponses: OUI (custom) / NON (API) avec mois pour break-even.",
     "help.recipe.x2.title":     "<strong>X-2 Viabilité contexte long</strong> — prédit si un modèle sert une longueur cible de manière fiable.",
     "help.recipe.x5.title":     "<strong>X-5 Sélection hardware</strong> — quel GPU utiliser pour servir au throughput cible ?",
     "help.recipe.x5.example":   "Essayez: <em>« Hardware le moins cher pour servir Llama-3-8B à 10M tokens/jour »</em><br>Réponse: meilleur GPU + $/Mtok + capacité vs cible.",
     "help.recipe.x19.title":    "<strong>X-19 Décision compression KV</strong> — utiliser soft decay, hard cutoff, ou méthodes de littérature ?",
+    "help.recipe.x21.title":    "<strong>X-21 Diagnostic Pureté Imprint</strong> — prédit γ sur tokens RANDOM via ν=−1/(2π); à quel point la prédiction RoPE du modèle est-elle propre ?",
+    "help.recipe.x22.title":    "<strong>X-22 Invariant Compute-Context</strong> — γ × log(N²·D) est-il dans la bande 51.2 ± 16.8 ? Détecte anomalies de scaling/training.",
+    "help.recipe.x23.title":    "<strong>X-23 Détecteur Phase IH</strong> — pré- ou post-induction-head ? Probe peu coûteux via sign(γ_text − γ_random).",
     "help.recipe.x19.example":  "Essayez: <em>« Comment compresser le cache KV pour Qwen2.5-7B à 32K ? »</em><br>Réponse: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.",
+    "help.recipe.x21.example":  "Essayez: <em>« Quelle est la pureté de la prédiction RoPE sur Llama-3-8B ? »</em><br>Réponse: γ_random prédit + diagnostic (CLEAN / OVER-IMPRINTED / UNDER-IMPRINTED).",
+    "help.recipe.x22.example":  "Essayez: <em>« Mistral-7B entre-t-il dans l'invariant compute-context ? »</em><br>Réponse: K = γ·log(N²·D), z-score, IN-BAND ou OUTLIER.",
+    "help.recipe.x23.example":  "Essayez: <em>« Qwen2.5-7B est-il post-induction-head ? »</em><br>Réponse: CONFIRMED PRE-IH / CONFIRMED POST-IH / ANOMALY.",
+    "help.section.v04":         "<strong>Nouveautés v0.4</strong> (résultats session 29, 2026-04-28) : trois recettes de diagnostic dérivées de l'analyse panel cross-model (n=22 LLMs).",
+    "help.v04.imprint":         "<strong>Pente d'imprint apprise ν = −1/(2π)</strong> : la période de rotation RoPE 2π entraîne un biais positionnel dans les poids, proportionnel à log(N_params). Même les tokens aléatoires montrent ce scaling. ν est DÉRIVÉ — non ajusté (erreur empirique 0,3 %).",
+    "help.v04.invariant":       "<strong>Invariant Chinchilla-attention K</strong> : γ × log(N²·D) ≈ 51.2 ± 16.8 (CV=0.329). Connecte le scaling de compute et l'exposant d'attention en un seul nombre sans dimension.",
+    "help.v04.ih_probe":        "<strong>Δγ comme probe IH</strong> : sign(γ_text − γ_random) > 0 ⟺ post-induction-head. Moins coûteux que de lancer un benchmark in-context-learning.",
+    "help.v04.constants":       "<strong>γ-cluster sur constantes célèbres</strong> (intriguant, n=4) : CodeLlama-13b γ=0.382 ≈ 1−1/φ (conjugué doré, err 0,0003) ; pythia-1.4b γ=0.705 ≈ 1/√2 ; Llama-2-7b γ=0.287 ≈ 1−1/√2 ; Mistral-Nemo γ=0.428 ≈ log_10(e). Caveat : peut être coïncidence.",
     "help.param.theta":         "<strong>θ (rope_theta)</strong>: fréquence de base RoPE. Plus haut = plus de capacité longue portée. Typique: 10000 (anciens), 500000 (Llama-3), 1000000 (Qwen2.5).",
     "help.param.T_train":       "<strong>T_train</strong>: contexte max vu par le modèle pendant l'entraînement. De <code>max_position_embeddings</code>.",
     "help.param.T_eval":        "<strong>T_eval</strong>: <em>votre</em> longueur de contexte cible en inférence. Le bouton clé.",
     "help.modes.compare":       "<strong>🆚 比较</strong>: 2-3 个模型在同一配方上并排。最适合在候选者之间选择。",
     "help.modes.ask":           "<strong>💬 自由提问</strong>: 自然语言问题,浏览器 LLM 选择配方。最适合随意探索。",
     "help.modes.recipe":        "<strong>📋 配方 + 表单</strong>: 手动选择,完全控制参数。最适合需要精确控制时。",
+    "help.recipes.title":       "可用的 8 个配方",
     "help.recipe.x1.title":     "<strong>X-1 自定义训练 vs API</strong> — 比较训练自己模型的成本与付费使用 API 的成本。",
     "help.recipe.x1.example":   "尝试: <em>\"我应该训练 8B 自定义模型还是使用 GPT-4o 处理每月 50M tokens?\"</em><br>答案: 是 (自定义) / 否 (API),含损益平衡月数。",
     "help.recipe.x2.title":     "<strong>X-2 长上下文可行性</strong> — 预测模型是否能可靠地服务目标上下文长度。",
     "help.recipe.x5.title":     "<strong>X-5 硬件选择</strong> — 应该使用哪个 GPU 以达到目标吞吐量?",
     "help.recipe.x5.example":   "尝试: <em>\"以每天 1000 万 tokens 提供 Llama-3-8B 的最便宜硬件\"</em><br>答案: 最佳 GPU + $/Mtok + 容量 vs 目标。",
     "help.recipe.x19.title":    "<strong>X-19 KV 压缩决策</strong> — 应该使用 soft decay、hard cutoff 还是文献方法?",
+    "help.recipe.x21.title":    "<strong>X-21 Imprint 纯度诊断</strong> — 通过 ν=−1/(2π) 预测 RANDOM token 上的 γ；模型的 RoPE 预测有多干净?",
+    "help.recipe.x22.title":    "<strong>X-22 Compute-Context 不变量</strong> — γ × log(N²·D) 是否落在 51.2 ± 16.8 区间内?检测 scaling/training 异常。",
+    "help.recipe.x23.title":    "<strong>X-23 IH-Phase 检测器</strong> — 前- 还是后-induction-head?通过 sign(γ_text − γ_random) 进行廉价探测。",
     "help.recipe.x19.example":  "尝试: <em>\"如何为 Qwen2.5-7B 在 32K 压缩 KV 缓存?\"</em><br>答案: USE SOFT DECAY / USE D_f CUTOFF / USE LITERATURE METHODS / USE HARD T_train.",
+    "help.recipe.x21.example":  "尝试: <em>\"Llama-3-8B 上的 RoPE 预测有多干净?\"</em><br>答案: 预测的 γ_random + 诊断 (CLEAN / OVER-IMPRINTED / UNDER-IMPRINTED)。",
+    "help.recipe.x22.example":  "尝试: <em>\"Mistral-7B 是否符合 compute-context 不变量?\"</em><br>答案: K = γ·log(N²·D)、z-score、IN-BAND 或 OUTLIER。",
+    "help.recipe.x23.example":  "尝试: <em>\"Qwen2.5-7B 是后-induction-head 吗?\"</em><br>答案: CONFIRMED PRE-IH / CONFIRMED POST-IH / ANOMALY。",
+    "help.section.v04":         "<strong>v0.4 新增</strong> (第 29 次研究会话, 2026-04-28): 来自 cross-model panel 分析 (n=22 LLMs) 的三个诊断 recipes。",
+    "help.v04.imprint":         "<strong>学习印记斜率 ν = −1/(2π)</strong>: RoPE 旋转周期 2π 在权重上引发位置偏置, 与 log(N_params) 成正比。即使 random token 也显示此 scaling。ν 是 DERIVED — 非拟合 (经验误差 0.3%)。",
+    "help.v04.invariant":       "<strong>Chinchilla-attention 不变量 K</strong>: γ × log(N²·D) ≈ 51.2 ± 16.8 (CV=0.329)。将 compute scaling 和 attention 指数连接为单一无量纲数。",
+    "help.v04.ih_probe":        "<strong>Δγ 作为 IH 探测</strong>: sign(γ_text − γ_random) > 0 ⟺ post-induction-head。比运行 in-context-learning 基准更便宜。",
+    "help.v04.constants":       "<strong>γ 簇落在著名常数上</strong> (有趣, n=4): CodeLlama-13b γ=0.382 ≈ 1−1/φ (黄金共轭, err 0.0003); pythia-1.4b γ=0.705 ≈ 1/√2; Llama-2-7b γ=0.287 ≈ 1−1/√2; Mistral-Nemo γ=0.428 ≈ log_10(e)。Caveat: 可能是巧合。",
     "help.param.theta":         "<strong>θ (rope_theta)</strong>: RoPE 基础频率。越高 = 长程能力越强。典型: 10000 (早期),500000 (Llama-3),1000000 (Qwen2.5)。",
     "help.param.T_train":       "<strong>T_train</strong>: 模型训练时的最大上下文。来自 <code>max_position_embeddings</code>。",
     "help.param.T_eval":        "<strong>T_eval</strong>: <em>您的</em> 目标推理上下文长度。关键旋钮。",

python/taf_browser.py CHANGED Viewed

@@ -99,6 +99,170 @@ def kv_soft_decay_regime(theta: float, gamma: float, T_train: int) -> str:
     return "use-hard-cutoff"
 # ════════════════════════════════════════════════════════════════════════════
 # §17 — Pre-training viability formulas
 # ════════════════════════════════════════════════════════════════════════════
@@ -584,6 +748,172 @@ def run_recipe_x19(theta, T_train, T_eval, n_attention_heads, n_kv_heads,
     return _wrap("X-19", "KV compression decision", locals(), chain, verdict, reason, mit)
 # ════════════════════════════════════════════════════════════════════════════
 # Helpers
 # ════════════════════════════════════════════════════════════════════════════
@@ -669,6 +999,31 @@ RECIPES = {
         "category": "kv-compression",
         "uses_sections": ["§26", "§19"],
     },
 }

     return "use-hard-cutoff"
+# ════════════════════════════════════════════════════════════════════════════
+# §28 — Sesión 29 (2026-04-28): learned-imprint, F2 Chinchilla, Δγ-IH probe
+# ════════════════════════════════════════════════════════════════════════════
+NU_IMPRINT = -1.0 / (2 * math.pi)  # §28 — learned-imprint slope (DERIVED, n=22 err 0.3%)
+P_0_IMPRINT_M = 14.0                # baseline pythia-14m (smallest panel reference)
+def gamma_random_predict(theta: float, T_eval: int, n_params_M: float) -> float:
+    """§28.1 — Predicted γ on RANDOM-token input.
+    γ_random = γ_pade(θ,T) + ν · log_10(P / P_0),  ν = -1/(2π) ≈ -0.1592.
+    Empirical n=22 LLMs (sesión 29). Random-input γ scales with model size
+    despite RoPE-Padé predicting only (θ,T) dependence — weights imprint
+    a learned positional bias proportional to log(N_params).
+    Predicted CI ≈ ±0.18 (95%).
+    """
+    g_pade = gamma_pade(theta, T_eval)
+    return g_pade + NU_IMPRINT * math.log10(max(n_params_M, 1e-3) / P_0_IMPRINT_M)
+def imprint_purity(gamma_random_obs: float, theta: float, T_eval: int,
+                   n_params_M: float) -> dict:
+    """§28.2 — Diagnostic: how clean is the model's RoPE-Padé prediction?
+    Compares observed γ_random to predicted (γ_pade + ν·log_10(P/P_0)).
+    Negative residual ⇒ extra-strong training imprint (less clean).
+    Positive ⇒ weaker than expected imprint (cleaner / less trained).
+    """
+    g_pred = gamma_random_predict(theta, T_eval, n_params_M)
+    g_pade_only = gamma_pade(theta, T_eval)
+    residual = gamma_random_obs - g_pred
+    return {
+        "gamma_random_obs":      gamma_random_obs,
+        "gamma_random_pred":     g_pred,
+        "gamma_pade_only":       g_pade_only,
+        "imprint_predicted":     g_pred - g_pade_only,
+        "imprint_residual":      residual,
+        "purity":                "clean (within CI)" if abs(residual) < 0.18 else
+                                 ("over-imprinted" if residual < 0 else "under-imprinted"),
+        "ci_95_half_width":      0.18,
+    }
+def compute_invariant_K(gamma: float, n_params_M: float,
+                        D_tokens: float = None) -> dict:
+    """§29 — F2 Chinchilla compute-context invariant.
+    K = γ × log(N²·D),  D = 20·N (Chinchilla compute-optimal) if not given.
+    Empirical: K ≈ 51.2 ± 16.8 (CV=0.329, n=22). In-distribution if K∈[34, 68].
+    """
+    N = n_params_M * 1e6
+    if D_tokens is None:
+        D_tokens = 20 * N
+    K = gamma * math.log(N * N * D_tokens)
+    panel_mean, panel_std = 51.2, 16.8
+    z = (K - panel_mean) / panel_std
+    return {
+        "K":                K,
+        "panel_mean":       panel_mean,
+        "panel_std":        panel_std,
+        "z_score":          z,
+        "in_distribution":  abs(z) <= 1.0,
+        "interpretation":   "in-band" if abs(z) <= 1.0 else
+                            ("high-K outlier" if z > 0 else "low-K outlier"),
+    }
+def ih_phase_check(gamma_text: float, gamma_random: float,
+                   n_params_M: float = None) -> dict:
+    """§30 — IH-formation phase discriminator.
+    sign(γ_text − γ_random) > 0 ⟺ post-IH (text concentrates more than random).
+    Pre-IH (P<400M, n=7): ⟨Δγ⟩ = -0.19 ± 0.26
+    Post-IH (P≥400M, n=15): ⟨Δγ⟩ = +0.03 ± 0.26
+    """
+    delta = gamma_text - gamma_random
+    phase_observed = "post-IH" if delta > 0 else ("pre-IH" if delta < 0 else "ambiguous")
+    phase_expected = None
+    if n_params_M is not None:
+        phase_expected = "post-IH" if n_params_M * 1e6 >= 4e8 else "pre-IH"
+    consistent = (phase_expected is None) or (phase_observed == phase_expected)
+    return {
+        "delta_gamma":       delta,
+        "phase_observed":    phase_observed,
+        "phase_expected_by_size": phase_expected,
+        "consistent":        consistent,
+        "panel_pre_IH_mean": -0.19,
+        "panel_post_IH_mean": +0.03,
+        "panel_std":         0.26,
+    }
+def gamma_decompose_v2(gamma_pade_val: float, n_params_M: float,
+                       has_GQA: bool = False, has_SWA: bool = False,
+                       corpus: str = "text", is_instruct: bool = False) -> dict:
+    """§28.3 — 6-axis decomposition (sesión 29 update with imprint axis).
+    γ_obs = γ_pade
+           + ν·log_10(P/P_0)·𝟙[corpus=random]    ← NEW imprint axis (DERIVED)
+           + Δ_corpus(text-rand)
+           + δ_arch(GQA, SWA)
+           + δ_circuit(IH phase)
+           + δ_train(steps, RLHF, instruct)
+           + ε
+    Imprint axis activates only on RANDOM input. TEXT input dominated by corpus.
+    """
+    delta_imprint = NU_IMPRINT * math.log10(max(n_params_M, 1e-3) / P_0_IMPRINT_M) \
+                    if corpus == "random" else 0.0
+    delta_GQA = +0.11 if has_GQA else 0.0
+    delta_SWA = -0.21 if has_SWA else 0.0
+    delta_post_IH = -0.15 if n_params_M >= 400 else 0.0
+    delta_instruct = -0.10 if is_instruct else 0.0  # F9 tentative (n=3, p=0.06)
+    return {
+        "pade_centroid":       gamma_pade_val,
+        "delta_imprint":       delta_imprint,
+        "delta_GQA":           delta_GQA,
+        "delta_SWA":           delta_SWA,
+        "delta_post_IH":       delta_post_IH,
+        "delta_instruct":      delta_instruct,
+        "gamma_corrected":     gamma_pade_val + delta_imprint + delta_GQA
+                                + delta_SWA + delta_post_IH + delta_instruct,
+        "corpus":              corpus,
+        "axes":                ["pade", "imprint", "GQA", "SWA", "IH", "instruct"],
+    }
+def famous_constant_proximity(gamma: float, tolerance: float = 0.01) -> dict:
+    """§31 — Detect proximity to famous constants in γ-cluster (sesión 29).
+    Empirical hits (n=4 in panel):
+      CodeLlama-13b   γ=0.3823 ≈ 1−1/φ = 0.3820 (golden conjugate)
+      pythia-1.4b     γ=0.7051 ≈ 1/√2  = 0.7071
+      Llama-2-7b      γ=0.2871 ≈ 1−1/√2 = 0.2929
+      Mistral-Nemo    γ=0.4284 ≈ log_10(e) = 0.4343
+    Returns nearest constant within tolerance, or None.
+    """
+    phi = (1 + math.sqrt(5)) / 2
+    constants = {
+        "1−1/φ (golden conjugate)": 1 - 1/phi,
+        "1/√2":                     1 / math.sqrt(2),
+        "1−1/√2":                   1 - 1/math.sqrt(2),
+        "log_10(e)":                math.log10(math.e),
+        "1/π":                      1 / math.pi,
+        "2/π":                      2 / math.pi,
+        "1/φ":                      1 / phi,
+        "ln(2)":                    math.log(2),
+        "z*_Cayley = (√17−3)/2":    (math.sqrt(17) - 3) / 2,
+    }
+    hits = []
+    for name, val in constants.items():
+        err = abs(gamma - val)
+        if err <= tolerance:
+            hits.append({"constant": name, "value": val, "error": err})
+    hits.sort(key=lambda h: h["error"])
+    return {
+        "gamma":     gamma,
+        "tolerance": tolerance,
+        "n_hits":    len(hits),
+        "hits":      hits[:3],
+        "caveat":    "n=4 hits in panel; could be coincidence (continuous distribution)",
+    }
 # ════════════════════════════════════════════════════════════════════════════
 # §17 — Pre-training viability formulas
 # ════════════════════════════════════════════════════════════════════════════
     return _wrap("X-19", "KV compression decision", locals(), chain, verdict, reason, mit)
+# ─────────────────────────────────────────────────────────────────────
+# X-21 — Imprint Purity Diagnostic (sesión 29 — uses §28 ν=−1/(2π))
+# ─────────────────────────────────────────────────────────────────────
+def run_recipe_x21(theta, T_train, n_attention_heads, n_kv_heads,
+                   d_head, n_layers, n_params, T_eval=None,
+                   gamma_random_obs=None, **_unused):
+    """X-21: how clean is the model's RoPE-Padé prediction?
+    Predicts γ on RANDOM-token input via learned-imprint formula:
+      γ_random = γ_pade(θ,T) + ν·log_10(P/14M),  ν = −1/(2π) ≈ −0.1592
+    If user provides observed γ_random, returns purity diagnostic.
+    """
+    chain = []
+    if T_eval is None:
+        T_eval = T_train
+    # Step 1: γ_Padé baseline
+    g_pade = gamma_pade(theta, T_eval)
+    chain.append(_step(1, "§26.1", "γ_Padé", "(2θ-T√2)/(2θ+T√2)",
+                       {"theta": theta, "T_eval": T_eval}, g_pade,
+                       _phase_label(g_pade)))
+    # Step 2: predicted imprint shift
+    n_params_M = n_params / 1e6
+    imprint_shift = NU_IMPRINT * math.log10(max(n_params_M, 1e-3) / P_0_IMPRINT_M)
+    chain.append(_step(2, "§28.1", "Imprint shift", "ν·log_10(P/P_0), ν=−1/(2π)",
+                       {"P_M": n_params_M, "P_0_M": P_0_IMPRINT_M, "nu": NU_IMPRINT},
+                       imprint_shift,
+                       f"Bigger model → stronger imprint (more negative shift)."))
+    # Step 3: predicted γ_random
+    g_pred = g_pade + imprint_shift
+    chain.append(_step(3, "§28.1", "γ_random predicted", "γ_pade + ν·log_10(P/P_0)",
+                       {"gamma_pade": g_pade, "imprint": imprint_shift}, g_pred,
+                       f"Predicted γ_random = {g_pred:.4f} ± 0.18 (95% CI)"))
+    # Step 4: purity diagnostic if observed value provided
+    if gamma_random_obs is not None:
+        purity = imprint_purity(gamma_random_obs, theta, T_eval, n_params_M)
+        chain.append(_step(4, "§28.2", "Imprint purity",
+                           "obs − pred (purity = within ±0.18)",
+                           {"gamma_random_obs": gamma_random_obs,
+                            "gamma_random_pred": g_pred},
+                           purity["imprint_residual"], purity["purity"]))
+        verdict = "CLEAN" if abs(purity["imprint_residual"]) < 0.18 else \
+                  ("OVER-IMPRINTED" if purity["imprint_residual"] < 0 else "UNDER-IMPRINTED")
+        reason = (f"Residual γ_random_obs − γ_pred = {purity['imprint_residual']:+.4f}. "
+                  f"95% CI is ±0.18.")
+        mit = ("Models far from prediction may have anomalous training (e.g. heavy "
+               "fine-tuning, format conversion). Compare to native checkpoint.")
+    else:
+        verdict = "PREDICTION ONLY"
+        reason = (f"Predicted γ_random = {g_pred:.4f}. Provide gamma_random_obs to "
+                  f"check purity (measure on RANDOM token sequences, e.g. via E4 protocol).")
+        mit = ("To measure: run a 150-prompt forward pass on RANDOM-token sequences "
+               "across distances d=10..1000 and fit power law. "
+               "(See https://github.com/karlesmarin/tafagent for E4 protocol.)")
+    return _wrap("X-21", "Imprint Purity Diagnostic", locals(), chain,
+                 verdict, reason, mit)
+# ─────────────────────────────────────────────────────────────────────
+# X-22 — Compute-Context Invariant Check (sesión 29 — F2 Chinchilla)
+# ─────────────────────────────────────────────────────────────────────
+def run_recipe_x22(theta, T_train, n_params, gamma_obs, D_tokens=None,
+                   T_eval=None, **_unused):
+    """X-22: does the model lie in the empirical Chinchilla invariant band?
+    K = γ × log(N²·D),  D = 20·N if not given.
+    Empirical: K ≈ 51.2 ± 16.8 (CV=0.329, n=22 panel).
+    """
+    chain = []
+    if T_eval is None:
+        T_eval = T_train
+    n_params_M = n_params / 1e6
+    if D_tokens is None:
+        D_tokens = 20 * n_params  # Chinchilla compute-optimal
+    # Step 1: K computation
+    inv = compute_invariant_K(gamma_obs, n_params_M, D_tokens)
+    chain.append(_step(1, "§29", "K = γ·log(N²·D)", "γ × ln(N²·D)",
+                       {"gamma": gamma_obs, "N": n_params, "D": D_tokens},
+                       inv["K"],
+                       f"K = {inv['K']:.2f} (panel mean {inv['panel_mean']:.1f} ± "
+                       f"{inv['panel_std']:.1f})"))
+    # Step 2: z-score interpretation
+    chain.append(_step(2, "§29", "z-score vs panel", "(K − μ)/σ",
+                       {"K": inv["K"], "mean": inv["panel_mean"],
+                        "std": inv["panel_std"]},
+                       inv["z_score"],
+                       inv["interpretation"]))
+    # Step 3: γ_pade comparison (anomaly test)
+    g_pade = gamma_pade(theta, T_eval)
+    pade_diff = gamma_obs - g_pade
+    chain.append(_step(3, "§26.1", "γ deviation from Padé", "γ_obs − γ_pade",
+                       {"gamma_obs": gamma_obs, "gamma_pade": g_pade}, pade_diff,
+                       "negative = anomaly (sub-Padé); positive = supra-Padé"))
+    if inv["in_distribution"]:
+        verdict = "IN-BAND"
+        reason = f"K = {inv['K']:.2f} within ±1σ of panel mean {inv['panel_mean']:.1f}."
+        mit = "Model conforms to compute-context invariant. No action needed."
+    else:
+        verdict = "OUTLIER"
+        reason = (f"K = {inv['K']:.2f} ({inv['interpretation']}). "
+                  f"|z| = {abs(inv['z_score']):.2f} > 1.")
+        mit = ("High-K (over-concentrating attention for given compute) or low-K "
+               "(under-using compute for attention concentration). Check tokenizer, "
+               "training recipe, fine-tuning history.")
+    return _wrap("X-22", "Compute-Context Invariant", locals(), chain,
+                 verdict, reason, mit)
+# ─────────────────────────────────────────────────────────────────────
+# X-23 — IH-Phase Detector (sesión 29 — F4 Δγ probe)
+# ─────────────────────────────────────────────────────────────────────
+def run_recipe_x23(n_params, gamma_text=None, gamma_random=None, **_unused):
+    """X-23: is this checkpoint pre- or post-induction-head formation?
+    Discriminator: sign(γ_text − γ_random) > 0 ⟺ post-IH.
+    Cheaper than ICL benchmark for monitoring training trajectories.
+    """
+    chain = []
+    n_params_M = n_params / 1e6
+    # Step 1: size-based prediction
+    expected = "post-IH" if n_params >= 4e8 else "pre-IH"
+    chain.append(_step(1, "§30", "Size-based phase prediction",
+                       "P ≥ 400M ⇒ post-IH",
+                       {"n_params_M": n_params_M, "threshold_M": 400}, expected))
+    # Step 2: γ-based discrimination if both gammas given
+    if gamma_text is not None and gamma_random is not None:
+        check = ih_phase_check(gamma_text, gamma_random, n_params_M)
+        chain.append(_step(2, "§30", "Δγ discriminator", "sign(γ_text − γ_random)",
+                           {"gamma_text": gamma_text, "gamma_random": gamma_random},
+                           check["delta_gamma"],
+                           f"observed phase: {check['phase_observed']}"))
+        if check["consistent"]:
+            verdict = f"CONFIRMED {check['phase_observed'].upper()}"
+            reason = (f"Δγ = {check['delta_gamma']:+.3f} sign matches size-prediction "
+                      f"({expected}).")
+            mit = "Phase confirmed. Use this checkpoint for downstream tasks accordingly."
+        else:
+            verdict = "ANOMALY"
+            reason = (f"Δγ = {check['delta_gamma']:+.3f} suggests {check['phase_observed']}, "
+                      f"but size predicts {expected}. Investigate.")
+            mit = ("Possible causes: incomplete training, anomalous fine-tuning, "
+                   "format conversion, tokenizer corruption (cf. F5 OLMo Δγ=0.30).")
+    else:
+        verdict = f"PREDICTED {expected.upper()}"
+        reason = (f"Only size given: P = {n_params_M:.0f}M. "
+                  f"Provide gamma_text + gamma_random to verify via Δγ probe.")
+        mit = ("Run E4 protocol with corpus=mongo and corpus=random; "
+               "compare γ values.")
+    return _wrap("X-23", "IH-Phase Detector", locals(), chain,
+                 verdict, reason, mit)
 # ════════════════════════════════════════════════════════════════════════════
 # Helpers
 # ════════════════════════════════════════════════════════════════════════════
         "category": "kv-compression",
         "uses_sections": ["§26", "§19"],
     },
+    "X-21": {
+        "name": "Imprint Purity Diagnostic",
+        "description": "How clean is the model's RoPE-Padé prediction? Predicts γ on RANDOM-token input via ν=−1/(2π).",
+        "fn": run_recipe_x21,
+        "params": ["theta", "T_train", "n_attention_heads", "n_kv_heads",
+                   "d_head", "n_layers", "n_params", "T_eval", "gamma_random_obs"],
+        "category": "diagnostic",
+        "uses_sections": ["§26", "§28"],
+    },
+    "X-22": {
+        "name": "Compute-Context Invariant",
+        "description": "Does γ × log(N²·D) lie in the panel band 51.2 ± 16.8? Detects training/scaling anomalies.",
+        "fn": run_recipe_x22,
+        "params": ["theta", "T_train", "n_params", "gamma_obs", "D_tokens", "T_eval"],
+        "category": "diagnostic",
+        "uses_sections": ["§26", "§29"],
+    },
+    "X-23": {
+        "name": "IH-Phase Detector",
+        "description": "Is this model pre- or post-induction-head? Cheap probe via sign(γ_text − γ_random).",
+        "fn": run_recipe_x23,
+        "params": ["n_params", "gamma_text", "gamma_random"],
+        "category": "diagnostic",
+        "uses_sections": ["§30"],
+    },
 }