mindbomber
/

aana

@@ -63,6 +63,42 @@ S = (f_theta, E_phi, R, Pi_psi, G)
 The goal is not to claim perfect alignment. The goal is to make deployment-time
 correctability, evidence, gating, and auditability explicit.
 ## Try AANA
 Use the public Hugging Face Space as the quickest way to try the AANA gate with

 The goal is not to claim perfect alignment. The goal is to make deployment-time
 correctability, evidence, gating, and auditability explicit.
+## Head-to-Head Finding
+Across two public agent/tool-call sources, the strongest repeated signal is:
+> AANA improves agent action reliability by combining structured pre-tool-call
+> contracts, verifier gates, and evidence-recovery loops. In these diagnostics,
+> AANA preserves unsafe-action recall while recovering more safe actions than
+> permissive agents, single classifiers, prompt-only guards, LLM judges, or
+> static contract gates.
+Summary:
+| Source | Architecture | Accuracy | Unsafe recall | Safe allow | FP | FN |
+| --- | --- | ---: | ---: | ---: | ---: | ---: |
+| Qwen traces | Permissive agent | `50.00%` | `0.00%` | `100.00%` | `0` | `180` |
+| Qwen traces | Single classifier | `50.00%` | `100.00%` | `0.00%` | `180` | `0` |
+| Qwen traces | Prompt-only guardrail | `81.67%` | `96.67%` | `66.67%` | `60` | `6` |
+| Qwen traces | LLM-as-judge | `73.33%` | `100.00%` | `46.67%` | `96` | `0` |
+| Qwen traces | Contract gate, no recovery | `92.78%` | `100.00%` | `85.56%` | `26` | `0` |
+| Qwen traces | AANA with recovery | `100.00%` | `100.00%` | `100.00%` | `0` | `0` |
+| Hermes traces | Permissive agent | `50.00%` | `0.00%` | `100.00%` | `0` | `180` |
+| Hermes traces | Single classifier | `50.00%` | `100.00%` | `0.00%` | `180` | `0` |
+| Hermes traces | Prompt-only guardrail | `93.06%` | `97.22%` | `88.89%` | `20` | `5` |
+| Hermes traces | LLM-as-judge | `85.28%` | `99.44%` | `71.11%` | `52` | `1` |
+| Hermes traces | Contract gate, no recovery | `92.22%` | `100.00%` | `84.44%` | `28` | `0` |
+| Hermes traces | AANA with recovery | `100.00%` | `100.00%` | `100.00%` | `0` | `0` |
+Evidence tiers matter. PIIMB is an official external benchmark submission.
+The Qwen and Hermes head-to-heads use public datasets with reproducible
+transforms and policy-derived labels, not human-reviewed safety labels. Local
+blind action-gate runs are useful development ablations but weaker external
+validity evidence.
+Public summary:
+https://mindbomber.github.io/Alignment-Aware-Neural-Architecture--AANA-/aana-head-to-head-findings.md
 ## Try AANA
 Use the public Hugging Face Space as the quickest way to try the AANA gate with