awvwgk commited on
Commit
7e396e2
·
verified ·
1 Parent(s): fcdbc33

Update model card

Browse files
Files changed (1) hide show
  1. README.md +108 -132
README.md CHANGED
@@ -48,9 +48,9 @@ interactions, distorted equilibrium geometries, and elementary
48
  reactions, as well as a small amount of publicly available high-accuracy
49
  data.
50
 
51
- We demonstrate departure from the historical trade-off between accuracy
52
- and efficiency is enabled by learning non-local representations of
53
- electronic structure directly from data, bypassing the need for
54
  increasingly costly hand-engineered features. The Skala-1.1 functional
55
  surpasses state-of-the-art hybrid functionals in accuracy across the
56
  main-group chemistry benchmark set GMTKN55, which covers general
@@ -158,45 +158,45 @@ The following data is included in our training set:
158
  Collection](https://arxiv.org/abs/2506.14492v5) (MSR-ACC).
159
  Additionally the MSR-ACC subsets for larger TAEs (up to 9
160
  non-hydrogen atoms), conformers, ionization potentials, electron
161
- affinities, proton affinities, reaction paths and distorted
162
- equilibrium structures were included. The labels for this data sets
163
  are obtained with the W1w method and are part of the currently
164
  unpublished subsets of the MSR-ACC.
165
 
166
  **Atomic Data**
167
- : Total energies, electron affinities and ionization potentials (up to
168
- triple ionization) for atoms, from H to Ar (excluding Li and Be
169
- because of basis set constraints).This data was produced in-house
170
- with CCSD(T) by extrapolating to the complete basis set limit from
171
- quadruple zeta (QZ) and pentuple zeta (5Z) basis set
172
- calculations.The basis sets used for H and He were aug-cc-pV(Q+d)Z,
173
- aug-cc-pV(5+d), while for the remaining elements B-Ar the basis sets
174
- used were aug-cc-pCVQZ and aug-cc-pCV5Z. All basis sets were obtained
175
- from the [Basis Set Exchange
176
- (BSE)](https://www.basissetexchange.org/). Extrapolation of the
177
- correlation energy was performed by fitting a simple Z\^(-3)
178
- expression, while extrapolation of the Hartree-Fock energy was
179
- performed using a two-point extrapolation.
180
 
181
  **Transition metal properties**
182
  : Additional data for transition metal atoms and dimers, including
183
- ionization potentials, spin splittings and dissociation energies.
184
  The reference energies were obtained from literature.
185
 
186
  **NCI-Atlas**
 
187
  : Five datasets from the [NCI-Atlas collection of non-covalent
188
  interactions](http://www.nciatlas.org/):
189
 
190
- - [D442x10](http://www.nciatlas.org/D442x10.html), dissociation
191
- curves for dispersion bound van-der-Waals complexes
192
- - [SH250x10](http://www.nciatlas.org/SH250.html), dissociation
193
- curves for sigma-hole bound van-der-Waals complexes
194
- - [R739x5](http://www.nciatlas.org/R739.html), compressed
195
- van-der-Waals complexes
196
- - [HB300SPXx10](http://www.nciatlas.org/HB300SPX.html), dissociation
197
- curves for hydrogen bound van-der-Waals complexes
198
- - [IHB100x10](http://www.nciatlas.org/IHB100.html), dissociation
199
- curves for ionic hydrogen bound van-der-Waals complexes
200
 
201
  **GDB9**
202
  : The graph data base with up to non-hydrogen atoms computed at
@@ -204,7 +204,7 @@ The following data is included in our training set:
204
 
205
  **BH9**
206
  : Reactions and barrier heights from [Prasad et. al 2021][prasad2021]
207
- The data set was filted for systems with up to ten
208
  non-hydrogen atoms.
209
 
210
  **NCIBLIND**
@@ -226,10 +226,10 @@ The following data is included in our training set:
226
  : Containing atomization energies of carbon
227
  clusters from [Karton et al. 2009][karton2009].
228
 
229
- For all training data we have created input density and derived meta-GGA
230
- features using density matrices of converged SCF calculations with the
231
- B3LYP functional (def2-QZVP and ma-def2-QZVP basis set) using a modified
232
- version of the PySCF software package.
233
 
234
  ### Training procedure
235
 
@@ -237,42 +237,41 @@ version of the PySCF software package.
237
 
238
  The training datapoints are preprocessed as follows.
239
 
240
- - For each molecule the density and derived meta-GGA features are
241
- computed based on the density matrix of converged SCF calculations
242
- with the B3LYP functional using a def2-QZVP or ma-def2-QZVP basis set
243
- using a modified version of the PySCF software package.
244
- - Density fitting was not applied for the SCF calculation.
245
- - The density features were evaluated on an atom centered integration
246
  grid of level 1.
247
- - The radial integral was performed with the Treutler-Ahlrichs,
248
- Gauss-Chebychev, Delley, or Mura-Knowles based on Bragg atomic radii
249
- with Treutler based radii adjustment.
250
  - The space-partitioning was performed with Becke partition and
251
  Treutler-Ahlrichs radii adjustment, Stratmann-Scuseria-Frisch (SSF)
252
  partition scheme, and Laqua-Kussmann-Ochsenfeld (LKO) partition
253
  scheme.
254
  - The angular grid points were pruned using the NWChem scheme.
255
- - No density based cutoff was applied and all grid points were retained
256
- for training.
257
 
258
  #### Training hyperparameters
259
 
260
  The training hyperparameter settings are detailed in the supplementary
261
- of [Accurate and scalable exchange-correlation with deep learning, Luise
262
- et al. 2025](https://arxiv.org/abs/2506.14492v6). This repository only
263
- includes the code to evaluate the checkpoints provided, not the training
264
- code.
265
 
266
  #### Speeds, sizes, times
267
 
268
- The training of our functional using the training dataset as detailed in
269
- the section \"Training data\" took approximately 48h for 1M training
270
- steps on a [ND A100 v4 series
271
  VM](https://learn.microsoft.com/en-us/azure/virtual-machines/sizes/gpu-accelerated/ndasra100v4-series?tabs=sizebasic)
272
- with 8 NVIDIA A100 GPU with 80 GB memory each, 96 CPU cores, 880 GB RAM,
273
- and a 6 TB disk.
274
 
275
- The model checkpoints have ~385,217 trainable parameters.
276
 
277
  ## Evaluation
278
 
@@ -290,33 +289,31 @@ We have evaluated our functional on several different benchmark sets:
290
  CuAgAu83 from [Chan 2019][chan2019],
291
  DAPd from [Author et. al 2020][dapd2020],
292
  3d4dIPSS, TMB11, and TMD10 from [Liang et. al 2025][liang2025]
293
- 3. GMTKN55. A diverse and highly accurate dataset of general main group
294
- thermochemistry, kinetics and noncovalent
295
  interactions from [Goerigk et. al 2017][goerigk2017]
296
  4. Geometry optimization datasets: (a) CCse21, equilibrium structures,
297
- bond lengths and bond angles from [Piccardo et. al 2015][piccardo2015]
298
- (b) HMGB11, equilibrium structures, bond
299
- lengths from [Grimme et. al 2015][grimme2015]
300
- (c) LMGB35, equilibrium structures, bond lengths,
301
- and from [Grimme et. al 2015][grimme2015]
302
  (d) W4-11-GEOM, equilibrium structures, bond
303
- lengths and bond angles.
304
- 5. The Dipole benchmark dataset from [Hait et al. 2018][hait2018]
305
- 6. Conformer search benchmark dataset of 22 molecules spanning
306
- molecular size from 24 to 176 atoms for cost scaling from
307
  [Grimme et al. 2019][grimme2019]
308
 
309
- The evaluation of our model using the 5 different types of benchmarks as
310
- defined above serve to measure different performance aspects of our
311
- functional. For example, 1 and 2 focus on the accuracy of predicted
312
- reaction energies, and 3 focuses on the ability of our functional to
313
- perform geometry optimization and to converge to the right equilibrium
314
- molecular structure. Furthermore, 4 measures the dipole moment of the
315
- molecules in the benchmark set, which provides a measure for the quality
316
- of the self-consistent electron density that a converged SCF procedure
317
- produces with our model. Finally, 5 determines the speed of employing
318
- SCF with our model and compares its scaling behavior with respect to
319
- system size with the scaling of traditional functionals.
320
 
321
  The metrics for the different benchmark sets are:
322
 
@@ -327,13 +324,13 @@ The metrics for the different benchmark sets are:
327
  of reaction r as calculated by a high-accuracy method from the W4
328
  family (CCSDT(Q)/CBS to CCSDTQ56/CBS), and $\Delta E_r^\theta$ is
329
  the prediction of the reaction energy difference using SCF
330
- calculations with our functional, and
331
  2. Weighted total mean absolute deviations 2 (WTMAD-2) in kcal/mol for
332
  the GMTKN55 benchmark set
333
  $\text{WTMAD-2} = \frac1{\sum^{55}_{i=1} N_i} \sum_{i=1}^{55} N_i \frac{56.84\text{ kcal/mol}}{\overline{|\Delta E|}_i} \text{MAE}_i$
334
  Here $N_i$ is the number of reactions in subset *i*,
335
  $\overline{|\Delta E|}_i$ is the average energy difference in subset
336
- *i* in kcal/mol and $\text{MAE}_i$ is the mean absolute error in
337
  kcal/mol for subset *i*.
338
  3. For the geometry benchmark sets that report bond lengths, we measure
339
  the absolute error in bond lengths in Angstrom, averaged over the
@@ -341,22 +338,19 @@ The metrics for the different benchmark sets are:
341
  dataset. For the benchmark that also contains bond angles, we report
342
  the absolute error of the angles, averaged over the number of bonds
343
  and equilibrium structures in the dataset.
344
- 4. We follow the metrics defined in `hait2018`{.interpreted-text
345
- role="footcite"}. We measure the Root Mean Squared Error (RMSE) of
346
- the dipole moment with respect to reference values provided by the
347
- benchmark dataset. For those molecules (indexed with *i*) for which
348
- only the reference magnitude of the dipole moment
349
- $\mu_i^{\text{ref}} = |{\vec\mu}_i^{\text{ref}}|$ is provided, we
350
- measure the RMSE of the predicted magnitude of the dipole moment
351
- $\mu_i^{\theta} = |{\vec\mu}_i^{\theta}|$ is available, the error is
352
- defined as
353
- $\text{Error}_i = \frac{\mu_i^\theta - \mu_i^\text{ref}}{\max(\mu_i^\text{ref}, 1D)} \times 100\%$.
354
- Here *D* denotes the unit of Debye. For those molecules for which
355
- the reference value of the dipole vector $\vec{\mu}_i^\text{ref}$ is
356
- also available we instead compute
357
- $\text{Error}_i = \frac{|\vec{\mu}_i^\theta - \vec{\mu}_i^\text{ref}|}{max(\mu_i^\text{ref}, 1D)} \times 100\%$.
358
- Using these errors we compute the RMSE as follows:
359
- $\text{RMSE} = \sqrt{\frac{1}{N} \sum_{i=1}^N \text{Error}_i^2}$
360
  5. We fit a power law of the form
361
  $C(M) = \left(\frac{n(M)}{A}\right)^k$ to the 22 data points of the
362
  test set where *C(M)* and *n(M)* are the computational cost and
@@ -366,50 +360,32 @@ The metrics for the different benchmark sets are:
366
 
367
  ### Evaluation results
368
 
369
- We demonstrate that the combination of a large-scale high-accuracy
370
- dataset combined with our deep learning architecture produces the Skala
371
- functional that predicts atomization energies at chemical accuracy (1
372
- kcal/mol), as measured on the public benchmark set W4-17. On the public
373
- benchmark set GMTKN55, which covers general-main group thermochemistry,
374
- kinetics and noncovalent interactions, our model makes predictions
375
- around 2.72 kcal/mol. This accuracy is better than state-of-the-art
376
  range-separated hybrid functionals while only requiring runtimes typical
377
  of semi-local DFT.
378
 
379
- On the geometry optimization benchmarks we demonstrate that we can
380
- converge to the reference equilibrium structure with an error that is
381
- comparable to a range-separated hybrid functional. On the dipole
382
- prediction benchmark test we demonstrate that the error of our dipole
383
- moment prediction with respect to reference values is better than
384
  state-of-the-art range-separated hybrid functionals.
385
 
386
- Finally, our scaling results demonstrate that our functional shows the
387
- asymptotic scaling behavior of a metaGGA functional, with an approximate
388
- prefactor of 3 compared to the r2SCAN.
389
 
390
  ## License
391
 
392
- > MIT License
393
- >
394
- > Copyright (c) Microsoft Corporation.
395
- >
396
- > Permission is hereby granted, free of charge, to any person obtaining a copy
397
- > of this software and associated documentation files (the "Software"), to deal
398
- > in the Software without restriction, including without limitation the rights
399
- > to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
400
- > copies of the Software, and to permit persons to whom the Software is
401
- > furnished to do so, subject to the following conditions:
402
- >
403
- > The above copyright notice and this permission notice shall be included in all
404
- > copies or substantial portions of the Software.
405
- >
406
- > THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
407
- > IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
408
- > FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
409
- > AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
410
- > LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
411
- > OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
412
- > SOFTWARE.
413
 
414
  ## Citation
415
 
@@ -421,7 +397,7 @@ version number as follows:
421
  ``` bibtex
422
  @misc{luise2025,
423
  title={Accurate and scalable exchange-correlation with deep learning},
424
- author={Giulia Luise and Chin-Wei Huang and Thijs Vogels and Derk P. Kooi and Sebastian Ehlert and Stephanie Lanius and Klaas J. H. Giesbertz and Amir Karton and Deniz Gunceler and Megan Stanley and Wessel P. Bruinsma and Lin Huang and Xinran Wei and José Garrido Torres and Abylay Katbashev and Rodrigo Chavez Zavaleta and Bálint Máté and Sékou-Oumar Kaba and Roberto Sordillo and Yingrong Chen and David B. Williams-Young and Christopher M. Bishop and Jan Hermann and Rianne van den Berg and Paola Gori-Giorgi},
425
  year={2025},
426
  eprint={2506.14665},
427
  archivePrefix={arXiv},
 
48
  reactions, as well as a small amount of publicly available high-accuracy
49
  data.
50
 
51
+ We demonstrate that departure from the historical trade-off between
52
+ accuracy and efficiency is enabled by learning non-local representations
53
+ of electronic structure directly from data, bypassing the need for
54
  increasingly costly hand-engineered features. The Skala-1.1 functional
55
  surpasses state-of-the-art hybrid functionals in accuracy across the
56
  main-group chemistry benchmark set GMTKN55, which covers general
 
158
  Collection](https://arxiv.org/abs/2506.14492v5) (MSR-ACC).
159
  Additionally the MSR-ACC subsets for larger TAEs (up to 9
160
  non-hydrogen atoms), conformers, ionization potentials, electron
161
+ affinities, proton affinities, reaction paths, and distorted
162
+ equilibrium structures were included. The labels for these data sets
163
  are obtained with the W1w method and are part of the currently
164
  unpublished subsets of the MSR-ACC.
165
 
166
  **Atomic Data**
167
+ : Total energies, electron affinities, and ionization potentials (up
168
+ to triple ionization) for atoms, from H to Ar (excluding Li and Be
169
+ due to basis-set constraints). This data was produced in-house with
170
+ CCSD(T) by extrapolating to the complete basis set limit from
171
+ quadruple zeta (QZ) and pentuple zeta (5Z) calculations. The basis
172
+ sets used for H and He were aug-cc-pV(Q+d)Z and aug-cc-pV(5+d),
173
+ while for the remaining elements B--Ar the basis sets were
174
+ aug-cc-pCVQZ and aug-cc-pCV5Z. All basis sets were obtained from the
175
+ [Basis Set Exchange (BSE)](https://www.basissetexchange.org/).
176
+ Extrapolation of the correlation energy was performed by fitting a
177
+ $Z^{-3}$ expression, while the Hartree--Fock energy was extrapolated
178
+ using the two-point scheme of [Karton 2006][karton2006].
 
179
 
180
  **Transition metal properties**
181
  : Additional data for transition metal atoms and dimers, including
182
+ ionization potentials, spin splittings, and dissociation energies.
183
  The reference energies were obtained from literature.
184
 
185
  **NCI-Atlas**
186
+
187
  : Five datasets from the [NCI-Atlas collection of non-covalent
188
  interactions](http://www.nciatlas.org/):
189
 
190
+ - [D442x10](http://www.nciatlas.org/D442x10.html), dissociation
191
+ curves for dispersion-bound van der Waals complexes
192
+ - [SH250x10](http://www.nciatlas.org/SH250.html), dissociation
193
+ curves for sigma-hole-bound van der Waals complexes
194
+ - [R739x5](http://www.nciatlas.org/R739.html), compressed van der
195
+ Waals complexes
196
+ - [HB300SPXx10](http://www.nciatlas.org/HB300SPX.html), dissociation
197
+ curves for hydrogen-bound van der Waals complexes
198
+ - [IHB100x10](http://www.nciatlas.org/IHB100.html), dissociation
199
+ curves for ionic hydrogen-bound van der Waals complexes
200
 
201
  **GDB9**
202
  : The graph data base with up to non-hydrogen atoms computed at
 
204
 
205
  **BH9**
206
  : Reactions and barrier heights from [Prasad et. al 2021][prasad2021]
207
+ The data set was filtered for systems with up to ten
208
  non-hydrogen atoms.
209
 
210
  **NCIBLIND**
 
226
  : Containing atomization energies of carbon
227
  clusters from [Karton et al. 2009][karton2009].
228
 
229
+ For all training data, input density and derived meta-GGA features were
230
+ computed from density matrices of converged B3LYP SCF calculations
231
+ (def2-QZVP and ma-def2-QZVP basis sets) using a modified version of
232
+ PySCF.
233
 
234
  ### Training procedure
235
 
 
237
 
238
  The training datapoints are preprocessed as follows.
239
 
240
+ - For each molecule, the density and derived meta-GGA features are
241
+ computed from the density matrix of a converged B3LYP SCF calculation
242
+ using a def2-QZVP or ma-def2-QZVP basis set in a modified version of
243
+ PySCF.
244
+ - Density fitting was not applied.
245
+ - The density features were evaluated on an atom-centered integration
246
  grid of level 1.
247
+ - The radial quadrature was performed with Treutler-Ahlrichs,
248
+ Gauss-Chebyshev, Delley, or Mura-Knowles schemes based on Bragg atomic
249
+ radii with Treutler-based radii adjustment.
250
  - The space-partitioning was performed with Becke partition and
251
  Treutler-Ahlrichs radii adjustment, Stratmann-Scuseria-Frisch (SSF)
252
  partition scheme, and Laqua-Kussmann-Ochsenfeld (LKO) partition
253
  scheme.
254
  - The angular grid points were pruned using the NWChem scheme.
255
+ - No density-based cutoff was applied; all grid points were retained for
256
+ training.
257
 
258
  #### Training hyperparameters
259
 
260
  The training hyperparameter settings are detailed in the supplementary
261
+ material of [Accurate and scalable exchange-correlation with deep
262
+ learning, Luise et al. 2025](https://arxiv.org/abs/2506.14492). This
263
+ repository only includes the code to evaluate the provided checkpoints,
264
+ not the training code.
265
 
266
  #### Speeds, sizes, times
267
 
268
+ The training of the functional on the dataset described above took
269
+ approximately 48 hours for 1M steps on an [ND A100 v4 series
 
270
  VM](https://learn.microsoft.com/en-us/azure/virtual-machines/sizes/gpu-accelerated/ndasra100v4-series?tabs=sizebasic)
271
+ with 8 NVIDIA A100 GPUs (80 GB each), 96 CPU cores, 880 GB RAM, and a 6
272
+ TB disk.
273
 
274
+ The model checkpoints have ~385k trainable parameters.
275
 
276
  ## Evaluation
277
 
 
289
  CuAgAu83 from [Chan 2019][chan2019],
290
  DAPd from [Author et. al 2020][dapd2020],
291
  3d4dIPSS, TMB11, and TMD10 from [Liang et. al 2025][liang2025]
292
+ 3. GMTKN55. A diverse and highly accurate dataset of general main-group
293
+ thermochemistry, kinetics, and noncovalent
294
  interactions from [Goerigk et. al 2017][goerigk2017]
295
  4. Geometry optimization datasets: (a) CCse21, equilibrium structures,
296
+ bond lengths, and bond angles from [Piccardo et. al 2015][piccardo2015];
297
+ (b) HMGB11, equilibrium structures and bond
298
+ lengths from [Grimme et. al 2015][grimme2015];
299
+ (c) LMGB35, equilibrium structures and bond lengths
300
+ from [Grimme et. al 2015][grimme2015]; and
301
  (d) W4-11-GEOM, equilibrium structures, bond
302
+ lengths, and bond angles.
303
+ 5. The dipole benchmark dataset from [Hait et al. 2018][hait2018]
304
+ 6. Conformer search benchmark dataset of 22 molecules spanning 24 to
305
+ 176 atoms, used for cost-scaling analysis, from
306
  [Grimme et al. 2019][grimme2019]
307
 
308
+ These six benchmark types serve to measure different performance aspects
309
+ of the functional. Benchmarks 1 and 2 focus on the accuracy of predicted
310
+ reaction energies. Benchmark 3 evaluates general main-group
311
+ thermochemistry, kinetics, and noncovalent interactions. Benchmark 4
312
+ evaluates geometry optimization and convergence to reference equilibrium
313
+ structures. Benchmark 5 measures dipole moments, providing a proxy for
314
+ the quality of the self-consistent electron density produced by the SCF
315
+ procedure. Finally, benchmark 6 assesses computational cost scaling with
316
+ respect to system size.
 
 
317
 
318
  The metrics for the different benchmark sets are:
319
 
 
324
  of reaction r as calculated by a high-accuracy method from the W4
325
  family (CCSDT(Q)/CBS to CCSDTQ56/CBS), and $\Delta E_r^\theta$ is
326
  the prediction of the reaction energy difference using SCF
327
+ calculations with our functional.
328
  2. Weighted total mean absolute deviations 2 (WTMAD-2) in kcal/mol for
329
  the GMTKN55 benchmark set
330
  $\text{WTMAD-2} = \frac1{\sum^{55}_{i=1} N_i} \sum_{i=1}^{55} N_i \frac{56.84\text{ kcal/mol}}{\overline{|\Delta E|}_i} \text{MAE}_i$
331
  Here $N_i$ is the number of reactions in subset *i*,
332
  $\overline{|\Delta E|}_i$ is the average energy difference in subset
333
+ *i* in kcal/mol, and $\text{MAE}_i$ is the mean absolute error in
334
  kcal/mol for subset *i*.
335
  3. For the geometry benchmark sets that report bond lengths, we measure
336
  the absolute error in bond lengths in Angstrom, averaged over the
 
338
  dataset. For the benchmark that also contains bond angles, we report
339
  the absolute error of the angles, averaged over the number of bonds
340
  and equilibrium structures in the dataset.
341
+ 4. For the dipole benchmark, we follow the metrics defined in
342
+ [Hait et al. 2018][hait2018]. For molecules
343
+ (indexed by *i*) for which only the reference magnitude of the
344
+ dipole moment $\mu_i^{\text{ref}} = |{\vec\mu}_i^{\text{ref}}|$ is
345
+ provided, the error is defined as
346
+ $\text{Error}_i = \frac{\mu_i^\theta - \mu_i^\text{ref}}{\max(\mu_i^\text{ref}, 1D)} \times 100\%$,
347
+ where $\mu_i^{\theta} = |{\vec\mu}_i^{\theta}|$ is the predicted
348
+ magnitude and *D* denotes the unit of Debye. For molecules for which
349
+ the reference dipole vector $\vec{\mu}_i^\text{ref}$ is also
350
+ available, we instead compute
351
+ $\text{Error}_i = \frac{|\vec{\mu}_i^\theta - \vec{\mu}_i^\text{ref}|}{\max(\mu_i^\text{ref}, 1D)} \times 100\%$.
352
+ The RMSE is then
353
+ $\text{RMSE} = \sqrt{\frac{1}{N} \sum_{i=1}^N \text{Error}_i^2}$.
 
 
 
354
  5. We fit a power law of the form
355
  $C(M) = \left(\frac{n(M)}{A}\right)^k$ to the 22 data points of the
356
  test set where *C(M)* and *n(M)* are the computational cost and
 
360
 
361
  ### Evaluation results
362
 
363
+ On W4-17, the Skala-1.1 functional predicts atomization energies at
364
+ chemical accuracy (~1 kcal/mol MAE). On GMTKN55, which covers general
365
+ main-group thermochemistry, kinetics, and noncovalent interactions, it
366
+ achieves a WTMAD-2 of 2.72 kcal/mol, surpassing state-of-the-art
 
 
 
367
  range-separated hybrid functionals while only requiring runtimes typical
368
  of semi-local DFT.
369
 
370
+ On the geometry optimization benchmarks, the functional converges to
371
+ reference equilibrium structures with errors comparable to a
372
+ range-separated hybrid functional. On the dipole prediction benchmark,
373
+ the error in dipole moment predictions is better than that of
 
374
  state-of-the-art range-separated hybrid functionals.
375
 
376
+ Finally, the scaling results show that the Skala-1.1 functional exhibits
377
+ the asymptotic scaling behavior of a meta-GGA functional, with an
378
+ approximate prefactor of 3 relative to r2SCAN.
379
 
380
  ## License
381
 
382
+ :::: dropdown
383
+ MIT License
384
+
385
+ ::: {.literalinclude lines="3-"}
386
+ ../../LICENSE.txt
387
+ :::
388
+ ::::
 
 
 
 
 
 
 
 
 
 
 
 
 
 
389
 
390
  ## Citation
391
 
 
397
  ``` bibtex
398
  @misc{luise2025,
399
  title={Accurate and scalable exchange-correlation with deep learning},
400
+ author={Giulia Luise and Chin-Wei Huang and Thijs Vogels and Derk P. Kooi and Sebastian Ehlert and Stephanie Lanius and Klaas J. H. Giesbertz and Amir Karton and Deniz Gunceler and Stefano Battaglia and Gregor N. C. Simm and P. Bernát Szabó and Megan Stanley and Wessel P. Bruinsma and Lin Huang and Xinran Wei and José Garrido Torres and Abylay Katbashev and Rodrigo Chavez Zavaleta and Bálint Máté and Sékou-Oumar Kaba and Roberto Sordillo and Yingrong Chen and David B. Williams-Young and Christopher M. Bishop and Jan Hermann and Rianne van den Berg and Paola Gori-Giorgi},
401
  year={2025},
402
  eprint={2506.14665},
403
  archivePrefix={arXiv},