molecular-shadows-h4-v10

Direct observable regressor for fermionic shadow spectroscopy on linear-H4 / STO-3G. Predicts time-evolved expectation values of 120 Majorana observables (\Gamma_\mu(t) = e^{iH(R)t}\Gamma_\mu e^{-iH(R)t}) as a function of equal nearest-neighbor bond length (R) and time (t), to feed downstream shadow-spectroscopy post-processing.

Heads-up. v10 H4 has uneven accuracy across the PES β€” strong at (R \geq 1.5) Γ… but degraded at (R < 1.0) Γ… where low-lying singlet avoided crossings drive non-analytic eigenvector rotation. Use this model with awareness of the short-R regime; see "Known limitations" below.

Architecture (v10)

(R, t) + HF orbital energies Ξ΅(R)
      β”‚
      β”œβ”€β”€ freq_net(Ξ΅(R))  β†’ 256 learnable Fourier frequencies Ο‰_k(R)
      β”‚
      β”œβ”€β”€ Fourier features [sin(Ο‰_k t), cos(Ο‰_k t)]  (256 Γ— 2 = 512 features)
      β”‚
      └── Trunk MLP: input [R, fourier]  β†’ 6 layers Γ— 768 hidden β†’ 120 outputs
Hyperparameter Value
n_observables 120 (k=1 Majorana operators on 8 spin-orbitals)
n_fourier 256
trunk depth Γ— width 6 Γ— 768
freq_net depth Γ— width 3 Γ— 128
n_orb_features 4 (HF spatial-orbital energies of H4/STO-3G)
conditioned_frequencies True
adaptive_bandwidth False (v10)
activation GELU
Parameter count ~14 M

Held-out evaluation

50 held-out geometries on the dense (R \in [0.5, 3.0]) Γ… grid (251 total).

R bin (Γ…) pearson_mean (approx)
0.5–1.0 ~0.40 (short-R weak)
1.0–1.5 ~0.60
1.5–2.0 ~0.90
2.0–3.0 >0.95

Aggregate: mean Pearson 0.834 / median 0.978 across all 50 held-out geometries. See eval_results.json for per-R numbers.

Inputs / outputs

  • Input. (R, t) β€” equal nearest-neighbor bond length in Γ… (linear chain geometry: H atoms at 0, R, 2R, 3R) and propagation time in a.u.
  • Output. Length-120 vector of expectation values (\langle\psi_0(R)|\Gamma_\mu(t)|\psi_0(R)\rangle) for k=1 Majorana observables on H4/STO-3G's 8 spin-orbital JW encoding.
  • Valid range. (R \in [0.5, 3.0]) Γ…, (t \in [0, 300]) a.u. Recommended high-confidence range: (R \geq 1.5) Γ….

Quickstart

from inference import MolecularShadowsRegressor
import numpy as np

m = MolecularShadowsRegressor.from_hub(
    "aniketdesh/molecular-shadows-h4-v10",
    revision="v10",
    token="hf_...",
)

t_grid = np.linspace(0, 300, 1500)
y = m.predict_trajectory(R=1.8, t_grid=t_grid)   # (1500, 120)

Known limitations

Short-R (R < 1.0 Γ…) accuracy is structurally weaker. Linear H4 has a multi-reference singlet manifold whose eigenvectors rotate near-discontinuously through avoided crossings as the chain compresses. The current freq_net correctly tracks energy-gap motion (eigenvalues are smooth), but the trunk struggles to encode the rapid amplitude rotation that lives in the eigenvector sector. H2 v10 (single-reference, no avoided crossings) confirms the recipe is sound on simpler chemistry β€” the bottleneck is H4-specific.

Resource-experiment guidance: trust v10 H4 most strongly for (R \geq 1.5) Γ…; treat short-R predictions as exploratory. A v11 release with adaptive Fourier bandwidth is in development to address part of this.

Files in this repo

File Purpose
regressor.pt torch payload (state_dict + config + R/t grids)
observable_regressor.py architecture
inference.py loader
orbital_energies.npz R-grid + HF orbital-energy table
eval_results.json per-R held-out metrics (50 geoms)
eval_summary.json aggregate
history.json training curves
README.md this file

Versioning

  • v10 (current): HF orbital-energy freq_net + dense R-grid + 6Γ—768 trunk. Mean Pearson 0.834 (Rβ‰₯1.5 Γ… strong, R<1.0 Γ… weak).
  • Future versions will be pushed as new commits with new tags. Pin via revision="v10" to preserve loading across architecture changes.

Citation

Method: matchgate-shadow spectroscopy following arXiv:2212.11036 and matchgate-shadow theory.

License

MIT.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Paper for aniketdesh/molecular-shadows-h4-v10