# 🌐 TAF Agent — Public Registry > Community-curated archive of TAF (Thermodynamic Attention Framework) analyses > for transformer LLMs. Submitted by users of [TAF Agent](https://karlesmarin.github.io/tafagent). This repository **stores no code**. It exists purely as a **public Issues board** where users of the TAF Agent web tool submit their model analyses for the community to verify, refute, comment on, or reuse. --- ## How it works 1. A user runs the [TAF Agent](https://karlesmarin.github.io/tafagent) on a model 2. They click **📤 Submit to registry** 3. A new GitHub Issue opens with the analysis pre-filled in this repo 4. The user reviews, optionally adds a comment, and clicks Submit 5. The analysis becomes a permanent public record --- ## Browsing - 📂 [All issues](https://github.com/karlesmarin/tafagent-registry/issues) — every submission ever made - 🟢 [Verified](https://github.com/karlesmarin/tafagent-registry/issues?q=label%3Averified) — marked as independently verified - 🔴 [Refuted](https://github.com/karlesmarin/tafagent-registry/issues?q=label%3Arefuted) — empirical measurement contradicts the prediction - 🔍 Search by **input hash** to find existing analyses for the same config: e.g. `#8d29feb8` finds all analyses for the same model+T_eval+arch params --- ## The hash system (deduplication) Every TAF analysis is hashed from its **canonical inputs**. Identical inputs (same model, same T_eval, same flags) always produce the same 8-character hex hash. Different inputs produce different hashes. This means: - **Searching `#a1b2c3d4`** finds all submissions for the exact same config - **Independent verification** of an existing analysis = comment on the existing issue (not a new one) - **Refutation** = reply with empirical evidence, the maintainers will add the `refuted` label - **No duplicate spam**: contributors are nudged to search before submitting --- ## What submissions look like Each issue follows the title pattern: ``` [TAF Profile] Meta-Llama-3-8B @ T=32000 #8d29feb8 [TAF X-2] Meta-Llama-3-8B → YES #a1b2c3d4 [TAF Compare] X-2 × 3 models #c5d6e7f8 ``` Body contains the verdict, key numbers, and a collapsible JSON of the full analysis chain. See any [recent issue](https://github.com/karlesmarin/tafagent-registry/issues) for examples. --- ## Contributing ### To submit an analysis Just run the [TAF Agent](https://karlesmarin.github.io/tafagent) and click **📤 Submit to registry**. The form pre-fills everything. ### To verify an existing analysis 1. Find an issue (search by hash if you know one, or browse) 2. Run the same analysis yourself 3. If your result matches → comment "✅ Verified — [evidence link / setup details]" 4. A maintainer will add the `verified` label ### To refute a prediction 1. Find an issue with a verdict you disagree with 2. Run the **actual measurement** (not just TAF prediction) — e.g. for Long-Context (X-2), run NIAH evaluation on real GPU 3. Comment with: - Your measurement value + std - Hardware + software setup (vLLM version, GPU, etc.) - Repro recipe (script or command) 4. A maintainer will add the `refuted` label and link to your evidence Refutations are first-class citizens here. The TAF framework is designed to be falsifiable — if a prediction is wrong, we want to know. ### To propose a new recipe Open an issue with title `[Proposal] X-NN — ` describing: - The practical question the recipe answers - The chain of formulas it would use - An example use case If the recipe is feasible, the maintainer adds it to the [TAF Agent codebase](https://github.com/karlesmarin/tafagent) and labels your issue `recipe-proposed`. ### To add a model preset Open an issue with title `[Preset] ` listing: - `rope_theta`, `max_position_embeddings`, `num_attention_heads`, `num_key_value_heads`, `head_dim`, `num_hidden_layers`, `n_params`, `has_SWA` - A link to the model's HuggingFace page These get bundled into the next release of TAF Agent. --- ## Labels - `verified` — analysis independently confirmed by another user - `refuted` — empirical measurement contradicts TAF prediction - `recipe-proposed` — request for a new TAF recipe - `preset-proposed` — request for a new model preset - `discussion` — ongoing community discussion (no consensus yet) - `question` — clarification request - `frontier` — recently published model (< 1 month old) being evaluated --- ## What we DON'T accept - Closed/proprietary model analyses without permission to share publicly - API keys, tokens, or credentials of any kind - Commercial advertisements or unrelated content - Submissions without input hash in title (suggests not from the official tool) --- ## Code of conduct - Be technical and specific. Disagreements are about the math, not people. - Refutations require evidence. Opinions don't count, measurements do. - Cite your sources (paper sections, GitHub commits, vendor docs). - Assume good faith. Most "wrong" submissions are misunderstandings, not bad actors. --- ## License Submissions are released under [CC0 (public domain dedication)](https://creativecommons.org/publicdomain/zero/1.0/) unless otherwise noted by the contributor. The TAF Agent code itself is [Apache-2.0](https://github.com/karlesmarin/tafagent/blob/main/LICENSE). --- ## Related - 🔬 [TAF Agent web tool](https://karlesmarin.github.io/tafagent) — the diagnostic itself - 📦 [TAF Agent source](https://github.com/karlesmarin/tafagent) — open source - 📄 [Underlying paper](https://zenodo.org/records/20314038) — Marin 2026, *Predicting How Transformers Attend* --- *Maintained by Carles Marin and the TAF community.*