Update model card
Browse files
README.md
CHANGED
|
@@ -48,9 +48,9 @@ interactions, distorted equilibrium geometries, and elementary
|
|
| 48 |
reactions, as well as a small amount of publicly available high-accuracy
|
| 49 |
data.
|
| 50 |
|
| 51 |
-
We demonstrate departure from the historical trade-off between
|
| 52 |
-
and efficiency is enabled by learning non-local representations
|
| 53 |
-
electronic structure directly from data, bypassing the need for
|
| 54 |
increasingly costly hand-engineered features. The Skala-1.1 functional
|
| 55 |
surpasses state-of-the-art hybrid functionals in accuracy across the
|
| 56 |
main-group chemistry benchmark set GMTKN55, which covers general
|
|
@@ -158,45 +158,45 @@ The following data is included in our training set:
|
|
| 158 |
Collection](https://arxiv.org/abs/2506.14492v5) (MSR-ACC).
|
| 159 |
Additionally the MSR-ACC subsets for larger TAEs (up to 9
|
| 160 |
non-hydrogen atoms), conformers, ionization potentials, electron
|
| 161 |
-
affinities, proton affinities, reaction paths and distorted
|
| 162 |
-
equilibrium structures were included. The labels for
|
| 163 |
are obtained with the W1w method and are part of the currently
|
| 164 |
unpublished subsets of the MSR-ACC.
|
| 165 |
|
| 166 |
**Atomic Data**
|
| 167 |
-
: Total energies, electron affinities and ionization potentials (up
|
| 168 |
-
triple ionization) for atoms, from H to Ar (excluding Li and Be
|
| 169 |
-
|
| 170 |
-
|
| 171 |
-
quadruple zeta (QZ) and pentuple zeta (5Z)
|
| 172 |
-
|
| 173 |
-
|
| 174 |
-
|
| 175 |
-
|
| 176 |
-
|
| 177 |
-
|
| 178 |
-
|
| 179 |
-
performed using a two-point extrapolation.
|
| 180 |
|
| 181 |
**Transition metal properties**
|
| 182 |
: Additional data for transition metal atoms and dimers, including
|
| 183 |
-
ionization potentials, spin splittings and dissociation energies.
|
| 184 |
The reference energies were obtained from literature.
|
| 185 |
|
| 186 |
**NCI-Atlas**
|
|
|
|
| 187 |
: Five datasets from the [NCI-Atlas collection of non-covalent
|
| 188 |
interactions](http://www.nciatlas.org/):
|
| 189 |
|
| 190 |
-
|
| 191 |
-
|
| 192 |
-
|
| 193 |
-
|
| 194 |
-
|
| 195 |
-
|
| 196 |
-
|
| 197 |
-
|
| 198 |
-
|
| 199 |
-
|
| 200 |
|
| 201 |
**GDB9**
|
| 202 |
: The graph data base with up to non-hydrogen atoms computed at
|
|
@@ -204,7 +204,7 @@ The following data is included in our training set:
|
|
| 204 |
|
| 205 |
**BH9**
|
| 206 |
: Reactions and barrier heights from [Prasad et. al 2021][prasad2021]
|
| 207 |
-
The data set was
|
| 208 |
non-hydrogen atoms.
|
| 209 |
|
| 210 |
**NCIBLIND**
|
|
@@ -226,10 +226,10 @@ The following data is included in our training set:
|
|
| 226 |
: Containing atomization energies of carbon
|
| 227 |
clusters from [Karton et al. 2009][karton2009].
|
| 228 |
|
| 229 |
-
For all training data
|
| 230 |
-
|
| 231 |
-
|
| 232 |
-
|
| 233 |
|
| 234 |
### Training procedure
|
| 235 |
|
|
@@ -237,42 +237,41 @@ version of the PySCF software package.
|
|
| 237 |
|
| 238 |
The training datapoints are preprocessed as follows.
|
| 239 |
|
| 240 |
-
- For each molecule the density and derived meta-GGA features are
|
| 241 |
-
computed
|
| 242 |
-
|
| 243 |
-
|
| 244 |
-
- Density fitting was not applied
|
| 245 |
-
- The density features were evaluated on an atom
|
| 246 |
grid of level 1.
|
| 247 |
-
- The radial
|
| 248 |
-
Gauss-
|
| 249 |
-
with Treutler
|
| 250 |
- The space-partitioning was performed with Becke partition and
|
| 251 |
Treutler-Ahlrichs radii adjustment, Stratmann-Scuseria-Frisch (SSF)
|
| 252 |
partition scheme, and Laqua-Kussmann-Ochsenfeld (LKO) partition
|
| 253 |
scheme.
|
| 254 |
- The angular grid points were pruned using the NWChem scheme.
|
| 255 |
-
- No density
|
| 256 |
-
|
| 257 |
|
| 258 |
#### Training hyperparameters
|
| 259 |
|
| 260 |
The training hyperparameter settings are detailed in the supplementary
|
| 261 |
-
of [Accurate and scalable exchange-correlation with deep
|
| 262 |
-
et al. 2025](https://arxiv.org/abs/2506.
|
| 263 |
-
includes the code to evaluate the
|
| 264 |
-
code.
|
| 265 |
|
| 266 |
#### Speeds, sizes, times
|
| 267 |
|
| 268 |
-
The training of
|
| 269 |
-
|
| 270 |
-
steps on a [ND A100 v4 series
|
| 271 |
VM](https://learn.microsoft.com/en-us/azure/virtual-machines/sizes/gpu-accelerated/ndasra100v4-series?tabs=sizebasic)
|
| 272 |
-
with 8 NVIDIA A100
|
| 273 |
-
|
| 274 |
|
| 275 |
-
The model checkpoints have ~
|
| 276 |
|
| 277 |
## Evaluation
|
| 278 |
|
|
@@ -290,33 +289,31 @@ We have evaluated our functional on several different benchmark sets:
|
|
| 290 |
CuAgAu83 from [Chan 2019][chan2019],
|
| 291 |
DAPd from [Author et. al 2020][dapd2020],
|
| 292 |
3d4dIPSS, TMB11, and TMD10 from [Liang et. al 2025][liang2025]
|
| 293 |
-
3. GMTKN55. A diverse and highly accurate dataset of general main
|
| 294 |
-
thermochemistry, kinetics and noncovalent
|
| 295 |
interactions from [Goerigk et. al 2017][goerigk2017]
|
| 296 |
4. Geometry optimization datasets: (a) CCse21, equilibrium structures,
|
| 297 |
-
bond lengths and bond angles from [Piccardo et. al 2015][piccardo2015]
|
| 298 |
-
(b) HMGB11, equilibrium structures
|
| 299 |
-
lengths from [Grimme et. al 2015][grimme2015]
|
| 300 |
-
(c) LMGB35, equilibrium structures
|
| 301 |
-
|
| 302 |
(d) W4-11-GEOM, equilibrium structures, bond
|
| 303 |
-
lengths and bond angles.
|
| 304 |
-
5. The
|
| 305 |
-
6. Conformer search benchmark dataset of 22 molecules spanning
|
| 306 |
-
|
| 307 |
[Grimme et al. 2019][grimme2019]
|
| 308 |
|
| 309 |
-
|
| 310 |
-
|
| 311 |
-
|
| 312 |
-
|
| 313 |
-
|
| 314 |
-
|
| 315 |
-
|
| 316 |
-
|
| 317 |
-
|
| 318 |
-
SCF with our model and compares its scaling behavior with respect to
|
| 319 |
-
system size with the scaling of traditional functionals.
|
| 320 |
|
| 321 |
The metrics for the different benchmark sets are:
|
| 322 |
|
|
@@ -327,13 +324,13 @@ The metrics for the different benchmark sets are:
|
|
| 327 |
of reaction r as calculated by a high-accuracy method from the W4
|
| 328 |
family (CCSDT(Q)/CBS to CCSDTQ56/CBS), and $\Delta E_r^\theta$ is
|
| 329 |
the prediction of the reaction energy difference using SCF
|
| 330 |
-
calculations with our functional
|
| 331 |
2. Weighted total mean absolute deviations 2 (WTMAD-2) in kcal/mol for
|
| 332 |
the GMTKN55 benchmark set
|
| 333 |
$\text{WTMAD-2} = \frac1{\sum^{55}_{i=1} N_i} \sum_{i=1}^{55} N_i \frac{56.84\text{ kcal/mol}}{\overline{|\Delta E|}_i} \text{MAE}_i$
|
| 334 |
Here $N_i$ is the number of reactions in subset *i*,
|
| 335 |
$\overline{|\Delta E|}_i$ is the average energy difference in subset
|
| 336 |
-
*i* in kcal/mol and $\text{MAE}_i$ is the mean absolute error in
|
| 337 |
kcal/mol for subset *i*.
|
| 338 |
3. For the geometry benchmark sets that report bond lengths, we measure
|
| 339 |
the absolute error in bond lengths in Angstrom, averaged over the
|
|
@@ -341,22 +338,19 @@ The metrics for the different benchmark sets are:
|
|
| 341 |
dataset. For the benchmark that also contains bond angles, we report
|
| 342 |
the absolute error of the angles, averaged over the number of bonds
|
| 343 |
and equilibrium structures in the dataset.
|
| 344 |
-
4.
|
| 345 |
-
|
| 346 |
-
|
| 347 |
-
|
| 348 |
-
|
| 349 |
-
$\mu_i^
|
| 350 |
-
|
| 351 |
-
|
| 352 |
-
|
| 353 |
-
|
| 354 |
-
|
| 355 |
-
|
| 356 |
-
|
| 357 |
-
$\text{Error}_i = \frac{|\vec{\mu}_i^\theta - \vec{\mu}_i^\text{ref}|}{max(\mu_i^\text{ref}, 1D)} \times 100\%$.
|
| 358 |
-
Using these errors we compute the RMSE as follows:
|
| 359 |
-
$\text{RMSE} = \sqrt{\frac{1}{N} \sum_{i=1}^N \text{Error}_i^2}$
|
| 360 |
5. We fit a power law of the form
|
| 361 |
$C(M) = \left(\frac{n(M)}{A}\right)^k$ to the 22 data points of the
|
| 362 |
test set where *C(M)* and *n(M)* are the computational cost and
|
|
@@ -366,50 +360,32 @@ The metrics for the different benchmark sets are:
|
|
| 366 |
|
| 367 |
### Evaluation results
|
| 368 |
|
| 369 |
-
|
| 370 |
-
|
| 371 |
-
|
| 372 |
-
|
| 373 |
-
benchmark set GMTKN55, which covers general-main group thermochemistry,
|
| 374 |
-
kinetics and noncovalent interactions, our model makes predictions
|
| 375 |
-
around 2.72 kcal/mol. This accuracy is better than state-of-the-art
|
| 376 |
range-separated hybrid functionals while only requiring runtimes typical
|
| 377 |
of semi-local DFT.
|
| 378 |
|
| 379 |
-
On the geometry optimization benchmarks
|
| 380 |
-
|
| 381 |
-
|
| 382 |
-
|
| 383 |
-
moment prediction with respect to reference values is better than
|
| 384 |
state-of-the-art range-separated hybrid functionals.
|
| 385 |
|
| 386 |
-
Finally,
|
| 387 |
-
asymptotic scaling behavior of a
|
| 388 |
-
prefactor of 3
|
| 389 |
|
| 390 |
## License
|
| 391 |
|
| 392 |
-
|
| 393 |
-
|
| 394 |
-
|
| 395 |
-
|
| 396 |
-
|
| 397 |
-
|
| 398 |
-
|
| 399 |
-
> to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
| 400 |
-
> copies of the Software, and to permit persons to whom the Software is
|
| 401 |
-
> furnished to do so, subject to the following conditions:
|
| 402 |
-
>
|
| 403 |
-
> The above copyright notice and this permission notice shall be included in all
|
| 404 |
-
> copies or substantial portions of the Software.
|
| 405 |
-
>
|
| 406 |
-
> THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
| 407 |
-
> IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
| 408 |
-
> FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
| 409 |
-
> AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
| 410 |
-
> LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
| 411 |
-
> OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
| 412 |
-
> SOFTWARE.
|
| 413 |
|
| 414 |
## Citation
|
| 415 |
|
|
@@ -421,7 +397,7 @@ version number as follows:
|
|
| 421 |
``` bibtex
|
| 422 |
@misc{luise2025,
|
| 423 |
title={Accurate and scalable exchange-correlation with deep learning},
|
| 424 |
-
author={Giulia Luise and Chin-Wei Huang and Thijs Vogels and Derk P. Kooi and Sebastian Ehlert and Stephanie Lanius and Klaas J. H. Giesbertz and Amir Karton and Deniz Gunceler and Megan Stanley and Wessel P. Bruinsma and Lin Huang and Xinran Wei and José Garrido Torres and Abylay Katbashev and Rodrigo Chavez Zavaleta and Bálint Máté and Sékou-Oumar Kaba and Roberto Sordillo and Yingrong Chen and David B. Williams-Young and Christopher M. Bishop and Jan Hermann and Rianne van den Berg and Paola Gori-Giorgi},
|
| 425 |
year={2025},
|
| 426 |
eprint={2506.14665},
|
| 427 |
archivePrefix={arXiv},
|
|
|
|
| 48 |
reactions, as well as a small amount of publicly available high-accuracy
|
| 49 |
data.
|
| 50 |
|
| 51 |
+
We demonstrate that departure from the historical trade-off between
|
| 52 |
+
accuracy and efficiency is enabled by learning non-local representations
|
| 53 |
+
of electronic structure directly from data, bypassing the need for
|
| 54 |
increasingly costly hand-engineered features. The Skala-1.1 functional
|
| 55 |
surpasses state-of-the-art hybrid functionals in accuracy across the
|
| 56 |
main-group chemistry benchmark set GMTKN55, which covers general
|
|
|
|
| 158 |
Collection](https://arxiv.org/abs/2506.14492v5) (MSR-ACC).
|
| 159 |
Additionally the MSR-ACC subsets for larger TAEs (up to 9
|
| 160 |
non-hydrogen atoms), conformers, ionization potentials, electron
|
| 161 |
+
affinities, proton affinities, reaction paths, and distorted
|
| 162 |
+
equilibrium structures were included. The labels for these data sets
|
| 163 |
are obtained with the W1w method and are part of the currently
|
| 164 |
unpublished subsets of the MSR-ACC.
|
| 165 |
|
| 166 |
**Atomic Data**
|
| 167 |
+
: Total energies, electron affinities, and ionization potentials (up
|
| 168 |
+
to triple ionization) for atoms, from H to Ar (excluding Li and Be
|
| 169 |
+
due to basis-set constraints). This data was produced in-house with
|
| 170 |
+
CCSD(T) by extrapolating to the complete basis set limit from
|
| 171 |
+
quadruple zeta (QZ) and pentuple zeta (5Z) calculations. The basis
|
| 172 |
+
sets used for H and He were aug-cc-pV(Q+d)Z and aug-cc-pV(5+d),
|
| 173 |
+
while for the remaining elements B--Ar the basis sets were
|
| 174 |
+
aug-cc-pCVQZ and aug-cc-pCV5Z. All basis sets were obtained from the
|
| 175 |
+
[Basis Set Exchange (BSE)](https://www.basissetexchange.org/).
|
| 176 |
+
Extrapolation of the correlation energy was performed by fitting a
|
| 177 |
+
$Z^{-3}$ expression, while the Hartree--Fock energy was extrapolated
|
| 178 |
+
using the two-point scheme of [Karton 2006][karton2006].
|
|
|
|
| 179 |
|
| 180 |
**Transition metal properties**
|
| 181 |
: Additional data for transition metal atoms and dimers, including
|
| 182 |
+
ionization potentials, spin splittings, and dissociation energies.
|
| 183 |
The reference energies were obtained from literature.
|
| 184 |
|
| 185 |
**NCI-Atlas**
|
| 186 |
+
|
| 187 |
: Five datasets from the [NCI-Atlas collection of non-covalent
|
| 188 |
interactions](http://www.nciatlas.org/):
|
| 189 |
|
| 190 |
+
- [D442x10](http://www.nciatlas.org/D442x10.html), dissociation
|
| 191 |
+
curves for dispersion-bound van der Waals complexes
|
| 192 |
+
- [SH250x10](http://www.nciatlas.org/SH250.html), dissociation
|
| 193 |
+
curves for sigma-hole-bound van der Waals complexes
|
| 194 |
+
- [R739x5](http://www.nciatlas.org/R739.html), compressed van der
|
| 195 |
+
Waals complexes
|
| 196 |
+
- [HB300SPXx10](http://www.nciatlas.org/HB300SPX.html), dissociation
|
| 197 |
+
curves for hydrogen-bound van der Waals complexes
|
| 198 |
+
- [IHB100x10](http://www.nciatlas.org/IHB100.html), dissociation
|
| 199 |
+
curves for ionic hydrogen-bound van der Waals complexes
|
| 200 |
|
| 201 |
**GDB9**
|
| 202 |
: The graph data base with up to non-hydrogen atoms computed at
|
|
|
|
| 204 |
|
| 205 |
**BH9**
|
| 206 |
: Reactions and barrier heights from [Prasad et. al 2021][prasad2021]
|
| 207 |
+
The data set was filtered for systems with up to ten
|
| 208 |
non-hydrogen atoms.
|
| 209 |
|
| 210 |
**NCIBLIND**
|
|
|
|
| 226 |
: Containing atomization energies of carbon
|
| 227 |
clusters from [Karton et al. 2009][karton2009].
|
| 228 |
|
| 229 |
+
For all training data, input density and derived meta-GGA features were
|
| 230 |
+
computed from density matrices of converged B3LYP SCF calculations
|
| 231 |
+
(def2-QZVP and ma-def2-QZVP basis sets) using a modified version of
|
| 232 |
+
PySCF.
|
| 233 |
|
| 234 |
### Training procedure
|
| 235 |
|
|
|
|
| 237 |
|
| 238 |
The training datapoints are preprocessed as follows.
|
| 239 |
|
| 240 |
+
- For each molecule, the density and derived meta-GGA features are
|
| 241 |
+
computed from the density matrix of a converged B3LYP SCF calculation
|
| 242 |
+
using a def2-QZVP or ma-def2-QZVP basis set in a modified version of
|
| 243 |
+
PySCF.
|
| 244 |
+
- Density fitting was not applied.
|
| 245 |
+
- The density features were evaluated on an atom-centered integration
|
| 246 |
grid of level 1.
|
| 247 |
+
- The radial quadrature was performed with Treutler-Ahlrichs,
|
| 248 |
+
Gauss-Chebyshev, Delley, or Mura-Knowles schemes based on Bragg atomic
|
| 249 |
+
radii with Treutler-based radii adjustment.
|
| 250 |
- The space-partitioning was performed with Becke partition and
|
| 251 |
Treutler-Ahlrichs radii adjustment, Stratmann-Scuseria-Frisch (SSF)
|
| 252 |
partition scheme, and Laqua-Kussmann-Ochsenfeld (LKO) partition
|
| 253 |
scheme.
|
| 254 |
- The angular grid points were pruned using the NWChem scheme.
|
| 255 |
+
- No density-based cutoff was applied; all grid points were retained for
|
| 256 |
+
training.
|
| 257 |
|
| 258 |
#### Training hyperparameters
|
| 259 |
|
| 260 |
The training hyperparameter settings are detailed in the supplementary
|
| 261 |
+
material of [Accurate and scalable exchange-correlation with deep
|
| 262 |
+
learning, Luise et al. 2025](https://arxiv.org/abs/2506.14492). This
|
| 263 |
+
repository only includes the code to evaluate the provided checkpoints,
|
| 264 |
+
not the training code.
|
| 265 |
|
| 266 |
#### Speeds, sizes, times
|
| 267 |
|
| 268 |
+
The training of the functional on the dataset described above took
|
| 269 |
+
approximately 48 hours for 1M steps on an [ND A100 v4 series
|
|
|
|
| 270 |
VM](https://learn.microsoft.com/en-us/azure/virtual-machines/sizes/gpu-accelerated/ndasra100v4-series?tabs=sizebasic)
|
| 271 |
+
with 8 NVIDIA A100 GPUs (80 GB each), 96 CPU cores, 880 GB RAM, and a 6
|
| 272 |
+
TB disk.
|
| 273 |
|
| 274 |
+
The model checkpoints have ~385k trainable parameters.
|
| 275 |
|
| 276 |
## Evaluation
|
| 277 |
|
|
|
|
| 289 |
CuAgAu83 from [Chan 2019][chan2019],
|
| 290 |
DAPd from [Author et. al 2020][dapd2020],
|
| 291 |
3d4dIPSS, TMB11, and TMD10 from [Liang et. al 2025][liang2025]
|
| 292 |
+
3. GMTKN55. A diverse and highly accurate dataset of general main-group
|
| 293 |
+
thermochemistry, kinetics, and noncovalent
|
| 294 |
interactions from [Goerigk et. al 2017][goerigk2017]
|
| 295 |
4. Geometry optimization datasets: (a) CCse21, equilibrium structures,
|
| 296 |
+
bond lengths, and bond angles from [Piccardo et. al 2015][piccardo2015];
|
| 297 |
+
(b) HMGB11, equilibrium structures and bond
|
| 298 |
+
lengths from [Grimme et. al 2015][grimme2015];
|
| 299 |
+
(c) LMGB35, equilibrium structures and bond lengths
|
| 300 |
+
from [Grimme et. al 2015][grimme2015]; and
|
| 301 |
(d) W4-11-GEOM, equilibrium structures, bond
|
| 302 |
+
lengths, and bond angles.
|
| 303 |
+
5. The dipole benchmark dataset from [Hait et al. 2018][hait2018]
|
| 304 |
+
6. Conformer search benchmark dataset of 22 molecules spanning 24 to
|
| 305 |
+
176 atoms, used for cost-scaling analysis, from
|
| 306 |
[Grimme et al. 2019][grimme2019]
|
| 307 |
|
| 308 |
+
These six benchmark types serve to measure different performance aspects
|
| 309 |
+
of the functional. Benchmarks 1 and 2 focus on the accuracy of predicted
|
| 310 |
+
reaction energies. Benchmark 3 evaluates general main-group
|
| 311 |
+
thermochemistry, kinetics, and noncovalent interactions. Benchmark 4
|
| 312 |
+
evaluates geometry optimization and convergence to reference equilibrium
|
| 313 |
+
structures. Benchmark 5 measures dipole moments, providing a proxy for
|
| 314 |
+
the quality of the self-consistent electron density produced by the SCF
|
| 315 |
+
procedure. Finally, benchmark 6 assesses computational cost scaling with
|
| 316 |
+
respect to system size.
|
|
|
|
|
|
|
| 317 |
|
| 318 |
The metrics for the different benchmark sets are:
|
| 319 |
|
|
|
|
| 324 |
of reaction r as calculated by a high-accuracy method from the W4
|
| 325 |
family (CCSDT(Q)/CBS to CCSDTQ56/CBS), and $\Delta E_r^\theta$ is
|
| 326 |
the prediction of the reaction energy difference using SCF
|
| 327 |
+
calculations with our functional.
|
| 328 |
2. Weighted total mean absolute deviations 2 (WTMAD-2) in kcal/mol for
|
| 329 |
the GMTKN55 benchmark set
|
| 330 |
$\text{WTMAD-2} = \frac1{\sum^{55}_{i=1} N_i} \sum_{i=1}^{55} N_i \frac{56.84\text{ kcal/mol}}{\overline{|\Delta E|}_i} \text{MAE}_i$
|
| 331 |
Here $N_i$ is the number of reactions in subset *i*,
|
| 332 |
$\overline{|\Delta E|}_i$ is the average energy difference in subset
|
| 333 |
+
*i* in kcal/mol, and $\text{MAE}_i$ is the mean absolute error in
|
| 334 |
kcal/mol for subset *i*.
|
| 335 |
3. For the geometry benchmark sets that report bond lengths, we measure
|
| 336 |
the absolute error in bond lengths in Angstrom, averaged over the
|
|
|
|
| 338 |
dataset. For the benchmark that also contains bond angles, we report
|
| 339 |
the absolute error of the angles, averaged over the number of bonds
|
| 340 |
and equilibrium structures in the dataset.
|
| 341 |
+
4. For the dipole benchmark, we follow the metrics defined in
|
| 342 |
+
[Hait et al. 2018][hait2018]. For molecules
|
| 343 |
+
(indexed by *i*) for which only the reference magnitude of the
|
| 344 |
+
dipole moment $\mu_i^{\text{ref}} = |{\vec\mu}_i^{\text{ref}}|$ is
|
| 345 |
+
provided, the error is defined as
|
| 346 |
+
$\text{Error}_i = \frac{\mu_i^\theta - \mu_i^\text{ref}}{\max(\mu_i^\text{ref}, 1D)} \times 100\%$,
|
| 347 |
+
where $\mu_i^{\theta} = |{\vec\mu}_i^{\theta}|$ is the predicted
|
| 348 |
+
magnitude and *D* denotes the unit of Debye. For molecules for which
|
| 349 |
+
the reference dipole vector $\vec{\mu}_i^\text{ref}$ is also
|
| 350 |
+
available, we instead compute
|
| 351 |
+
$\text{Error}_i = \frac{|\vec{\mu}_i^\theta - \vec{\mu}_i^\text{ref}|}{\max(\mu_i^\text{ref}, 1D)} \times 100\%$.
|
| 352 |
+
The RMSE is then
|
| 353 |
+
$\text{RMSE} = \sqrt{\frac{1}{N} \sum_{i=1}^N \text{Error}_i^2}$.
|
|
|
|
|
|
|
|
|
|
| 354 |
5. We fit a power law of the form
|
| 355 |
$C(M) = \left(\frac{n(M)}{A}\right)^k$ to the 22 data points of the
|
| 356 |
test set where *C(M)* and *n(M)* are the computational cost and
|
|
|
|
| 360 |
|
| 361 |
### Evaluation results
|
| 362 |
|
| 363 |
+
On W4-17, the Skala-1.1 functional predicts atomization energies at
|
| 364 |
+
chemical accuracy (~1 kcal/mol MAE). On GMTKN55, which covers general
|
| 365 |
+
main-group thermochemistry, kinetics, and noncovalent interactions, it
|
| 366 |
+
achieves a WTMAD-2 of 2.72 kcal/mol, surpassing state-of-the-art
|
|
|
|
|
|
|
|
|
|
| 367 |
range-separated hybrid functionals while only requiring runtimes typical
|
| 368 |
of semi-local DFT.
|
| 369 |
|
| 370 |
+
On the geometry optimization benchmarks, the functional converges to
|
| 371 |
+
reference equilibrium structures with errors comparable to a
|
| 372 |
+
range-separated hybrid functional. On the dipole prediction benchmark,
|
| 373 |
+
the error in dipole moment predictions is better than that of
|
|
|
|
| 374 |
state-of-the-art range-separated hybrid functionals.
|
| 375 |
|
| 376 |
+
Finally, the scaling results show that the Skala-1.1 functional exhibits
|
| 377 |
+
the asymptotic scaling behavior of a meta-GGA functional, with an
|
| 378 |
+
approximate prefactor of 3 relative to r2SCAN.
|
| 379 |
|
| 380 |
## License
|
| 381 |
|
| 382 |
+
:::: dropdown
|
| 383 |
+
MIT License
|
| 384 |
+
|
| 385 |
+
::: {.literalinclude lines="3-"}
|
| 386 |
+
../../LICENSE.txt
|
| 387 |
+
:::
|
| 388 |
+
::::
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 389 |
|
| 390 |
## Citation
|
| 391 |
|
|
|
|
| 397 |
``` bibtex
|
| 398 |
@misc{luise2025,
|
| 399 |
title={Accurate and scalable exchange-correlation with deep learning},
|
| 400 |
+
author={Giulia Luise and Chin-Wei Huang and Thijs Vogels and Derk P. Kooi and Sebastian Ehlert and Stephanie Lanius and Klaas J. H. Giesbertz and Amir Karton and Deniz Gunceler and Stefano Battaglia and Gregor N. C. Simm and P. Bernát Szabó and Megan Stanley and Wessel P. Bruinsma and Lin Huang and Xinran Wei and José Garrido Torres and Abylay Katbashev and Rodrigo Chavez Zavaleta and Bálint Máté and Sékou-Oumar Kaba and Roberto Sordillo and Yingrong Chen and David B. Williams-Young and Christopher M. Bishop and Jan Hermann and Rianne van den Berg and Paola Gori-Giorgi},
|
| 401 |
year={2025},
|
| 402 |
eprint={2506.14665},
|
| 403 |
archivePrefix={arXiv},
|