awvwgk commited on
Commit
3318422
·
verified ·
1 Parent(s): f91be86

Add model card for Skala-1.1

Browse files
Files changed (1) hide show
  1. README.md +448 -0
README.md ADDED
@@ -0,0 +1,448 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: skala
4
+ tags:
5
+ - chemistry
6
+ - density-functional-theory
7
+ - exchange-correlation-functional
8
+ - computational-chemistry
9
+ - quantum-chemistry
10
+ ---
11
+
12
+ # Skala 1.1 model
13
+
14
+ ## Model details
15
+
16
+ In pursuit of the universal functional for density functional theory
17
+ (DFT), the OneDFT team from Microsoft Research AI for Science has
18
+ developed the Skala-1.1 exchange-correlation functional, as introduced
19
+ in [Accurate and scalable exchange-correlation with deep learning, Luise
20
+ et al. 2025](https://arxiv.org/abs/2506.14665). This approach departs
21
+ from the traditional route of incorporating increasingly expensive
22
+ hand-designed non-local features from Jacob\'s ladder into functional
23
+ forms to improve their accuracy. Instead, we employ a deep learning
24
+ approach with a scalable neural network that uses only inexpensive input
25
+ features to learn the necessary non-local representations.
26
+
27
+ The functional is based on a neural network architecture that takes as
28
+ input features on a 3D grid describing the electron density and derived
29
+ meta-generalized-gradient (meta-GGA) quantities. The architecture
30
+ performs scalable non-local message-passing on the integration grid via
31
+ a second, coarser grid, combined with shared local layers that enable
32
+ representation learning of both local and non-local features. These
33
+ representations are then used to predict the exchange-correlation energy
34
+ in an end-to-end data-driven manner.
35
+
36
+ To facilitate this learning, the model is trained on a dataset of
37
+ unprecedented size, containing highly accurate energy labels from
38
+ coupled cluster theory. The largest subset focuses on atomization
39
+ energies and was generated in collaboration with the University of New
40
+ England. This subset is released as part of the Microsoft Research
41
+ Accurate Chemistry Collection (MSR-ACC, [Accurate Chemistry Collection:
42
+ Coupled cluster atomization energies for broad chemical space, Ehlert et
43
+ al. 2025](https://arxiv.org/abs/2506.14492v5)). To broaden coverage of
44
+ other types of chemistry, the training dataset is further complemented
45
+ with in-house generated datasets covering conformers, ionization
46
+ potentials, electron affinities, proton affinities, noncovalent
47
+ interactions, distorted equilibrium geometries, and elementary
48
+ reactions, as well as a small amount of publicly available high-accuracy
49
+ data.
50
+
51
+ We demonstrate departure from the historical trade-off between accuracy
52
+ and efficiency is enabled by learning non-local representations of
53
+ electronic structure directly from data, bypassing the need for
54
+ increasingly costly hand-engineered features. The Skala-1.1 functional
55
+ surpasses state-of-the-art hybrid functionals in accuracy across the
56
+ main-group chemistry benchmark set GMTKN55, which covers general
57
+ main-group thermochemistry, kinetics, and noncovalent interactions, with
58
+ an error of 2.72 kcal/mol, while retaining the lower computational cost
59
+ characteristic of semi-local DFT. With this work, we demonstrate the
60
+ viability of our approach toward the universal density functional across
61
+ all of chemistry.
62
+
63
+ Users of this model are expected to have a basic understanding of the
64
+ field of quantum chemistry and density functional theory.
65
+
66
+ Developed by
67
+ : Chin-Wei Huang, Deniz Gunceler, Derk Kooi, Gregor Simm, Klaas
68
+ Giesbertz, Giulia Luise, Jan Hermann, Megan Stanley, Paola Gori
69
+ Giorgi, P. Bernát Szabó, Rianne van den Berg, Sebastian Ehlert,
70
+ Stefano Battaglia, Stephanie Lanius, Thijs Vogels, Wessel Bruinsma
71
+
72
+ Shared by
73
+ : Microsoft Research AI for Science
74
+
75
+ Model type
76
+ : Neural Network Density Functional Theory Exchange Correlation
77
+ Functional
78
+
79
+ License
80
+ : MIT
81
+
82
+ ## Direct intended uses
83
+
84
+ 1. The Skala-1.1 functional is shared with the research community to
85
+ facilitate reproduction of the evaluations presented in our paper.
86
+ 2. Evaluating reaction energy differences by computing the total energy
87
+ of all compounds in a reaction using a self-consistent field (SCF)
88
+ calculation with the Skala-1.1 exchange-correlation functional.
89
+ 3. Evaluating the total energy of a molecule using an SCF calculation
90
+ with the Skala-1.1 exchange-correlation functional. Note that, as
91
+ with all density functionals, energy differences are predicted much
92
+ more reliably than total energies of individual molecules.
93
+ 4. The SCF implementation provided uses PySCF, which runs the
94
+ functional on CPU. We also provide a traced version of the Skala-1.1
95
+ functional so that other, more optimized open-source SCF
96
+ codes—including GPU-enabled ones—can integrate it into their
97
+ pipelines, for instance through GauXC. A compatible fork of GauXC is
98
+ included in this repository.
99
+
100
+ ## Out-of-scope uses
101
+
102
+ 1. Evaluating the functional with a single pass given a fixed density
103
+ as input is not the intended way to evaluate the model. The model\'s
104
+ predictions should always be made by using it as part of an SCF
105
+ procedure.
106
+ 2. We do not include a training pipeline for the Skala-1.1 functional
107
+ in this code base.
108
+
109
+ ## Risks and limitations
110
+
111
+ 1. Interpretation of results requires expertise in quantum chemistry.
112
+ 2. The Skala-1.1 functional is trained on atomization energies,
113
+ conformers, proton affinities, ionization potentials, electron
114
+ affinities, elementary reaction pathways, distorted equilibrium
115
+ geometries, and non-covalent interactions, as well as a small amount
116
+ of total energies of atoms and transition metal atoms and dimer
117
+ properties. We have benchmarked performance on W4-17 for atomization
118
+ energies and on GMTKN55, which covers general main-group
119
+ thermochemistry, kinetics, and noncovalent interactions, to provide
120
+ an indication of generalization beyond the training set. We have
121
+ also evaluated robustness on dipole moment predictions and geometry
122
+ optimization.
123
+ 3. The Skala-1.1 functional has been trained on data containing the
124
+ following elements: H–Xe. It has been tested on data containing
125
+ H–Xe, Pb, and Bi.
126
+ 4. Given points 2 and 3 above, this is not a production model. We
127
+ advise testing the functional further before applying it to your
128
+ research and welcome any feedback.
129
+
130
+ ## Recommendations
131
+
132
+ 1. In our PySCF-based SCF implementation, the largest system tested
133
+ contained 180 atoms using the def2-TZVP basis set
134
+ (~5000 orbitals) on [Eadsv5
135
+ series](https://learn.microsoft.com/en-us/azure/virtual-machines/sizes/memory-optimized/eadsv5-series?tabs=sizebasic)
136
+ virtual machines. Larger systems may run out of memory.
137
+ 2. For implementations optimized for memory, speed, or GPU support, we
138
+ recommend integrating the functional with other open-source SCF
139
+ packages, for instance through GauXC. A compatible fork of GauXC is
140
+ included in this repository.
141
+ 3. Skala-1.1 will also be available through [Azure AI
142
+ Foundry](https://labs.ai.azure.com/projects/skala/), where it is
143
+ coupled with Microsoft\'s GPU-accelerated [Accelerated
144
+ DFT](https://arxiv.org/abs/2406.11185) application.
145
+
146
+ ## Training details
147
+
148
+ ### Training data
149
+
150
+ The following data is included in our training set:
151
+
152
+ **MSR-ACC**
153
+ : 99% of MSR-ACC/TAE25 (~78k reactions) containing atomization
154
+ energies for up to five non-hydrogen atoms. This data was generated
155
+ in collaboration with Prof. Amir Karton, University of New England,
156
+ with the W1-F12 composite protocol based on CCSD(T) and is released
157
+ as part of the [Microsoft Research Accurate Chemistry
158
+ Collection](https://arxiv.org/abs/2506.14492v5) (MSR-ACC).
159
+ Additionally the MSR-ACC subsets for larger TAEs (up to 9
160
+ non-hydrogen atoms), conformers, ionization potentials, electron
161
+ affinities, proton affinities, reaction paths and distorted
162
+ equilibrium structures were included. The labels for this data sets
163
+ are obtained with the W1w method and are part of the currently
164
+ unpublished subsets of the MSR-ACC.
165
+
166
+ **Atomic Data**
167
+ : Total energies, electron affinities and ionization potentials (up to
168
+ triple ionization) for atoms, from H to Ar (excluding Li and Be
169
+ because of basis set constraints).This data was produced in-house
170
+ with CCSD(T) by extrapolating to the complete basis set limit from
171
+ quadruple zeta (QZ) and pentuple zeta (5Z) basis set
172
+ calculations.The basis sets used for H and He were aug-cc-pV(Q+d)Z,
173
+ aug-cc-pV(5+d), while for the remaining elements B-Ar the basis sets
174
+ used were aug-cc-pCVQZ and aug-cc-pCV5Z. All basis sets were obtained
175
+ from the [Basis Set Exchange
176
+ (BSE)](https://www.basissetexchange.org/). Extrapolation of the
177
+ correlation energy was performed by fitting a simple Z\^(-3)
178
+ expression, while extrapolation of the Hartree-Fock energy was
179
+ performed using a two-point extrapolation.
180
+
181
+ **Transition metal properties**
182
+ : Additional data for transition metal atoms and dimers, including
183
+ ionization potentials, spin splittings and dissociation energies.
184
+ The reference energies were obtained from literature.
185
+
186
+ **NCI-Atlas**
187
+
188
+ : Five datasets from the [NCI-Atlas collection of non-covalent
189
+ interactions](http://www.nciatlas.org/):
190
+
191
+ - [D442x10](http://www.nciatlas.org/D442x10.html), dissociation
192
+ curves for dispersion bound van-der-Waals complexes
193
+ - [SH250x10](http://www.nciatlas.org/SH250.html), dissociation
194
+ curves for sigma-hole bound van-der-Waals complexes
195
+ - [R739x5](http://www.nciatlas.org/R739.html), compressed
196
+ van-der-Waals complexes
197
+ - [HB300SPXx10](http://www.nciatlas.org/HB300SPX.html), dissociation
198
+ curves for hydrogen bound van-der-Waals complexes
199
+ - [IHB100x10](http://www.nciatlas.org/IHB100.html), dissociation
200
+ curves for ionic hydrogen bound van-der-Waals complexes
201
+
202
+ **GDB9**
203
+ : The graph data base with up to non-hydrogen atoms computed at
204
+ W1-F12 level of theory from [Karton 2025][karton2025].
205
+
206
+ **BH9**
207
+ : Reactions and barrier heights from [Prasad et. al 2021][prasad2021]
208
+ The data set was filted for systems with up to ten
209
+ non-hydrogen atoms.
210
+
211
+ **NCIBLIND**
212
+ : Data set of non-covalent dissociation curves from [Taylor et. al 2016][taylor2016].
213
+
214
+ **Water2510**
215
+ : Data set of the potential energy surface of the water
216
+ dimer from [Smith et al. 2016][smith2016]. The data set was fully relabeled with W1w.
217
+
218
+ **DES370k**
219
+ : Subset with CCSD(T)/dCBS(aug-cc-pVQZ) non-covalent interaction
220
+ energies from [Donchev et al. 2021][donchev2021].
221
+
222
+ **MB2061**
223
+ : Dataset containing decomposition energies of artificial
224
+ molecules from [Gasevic et al. 2025][gasevic2025].
225
+
226
+ **W4-CC**
227
+ : Containing atomization energies of carbon
228
+ clusters from [Karton et al. 2009][karton2009].
229
+
230
+ For all training data we have created input density and derived meta-GGA
231
+ features using density matrices of converged SCF calculations with the
232
+ B3LYP functional (def2-QZVP and ma-def2-QZVP basis set) using a modified
233
+ version of the PySCF software package.
234
+
235
+ ### Training procedure
236
+
237
+ #### Preprocessing
238
+
239
+ The training datapoints are preprocessed as follows.
240
+
241
+ - For each molecule the density and derived meta-GGA features are
242
+ computed based on the density matrix of converged SCF calculations
243
+ with the B3LYP functional using a def2-QZVP or ma-def2-QZVP basis set
244
+ using a modified version of the PySCF software package.
245
+ - Density fitting was not applied for the SCF calculation.
246
+ - The density features were evaluated on an atom centered integration
247
+ grid of level 1.
248
+ - The radial integral was performed with the Treutler-Ahlrichs,
249
+ Gauss-Chebychev, Delley, or Mura-Knowles based on Bragg atomic radii
250
+ with Treutler based radii adjustment.
251
+ - The space-partitioning was performed with Becke partition and
252
+ Treutler-Ahlrichs radii adjustment, Stratmann-Scuseria-Frisch (SSF)
253
+ partition scheme, and Laqua-Kussmann-Ochsenfeld (LKO) partition
254
+ scheme.
255
+ - The angular grid points were pruned using the NWChem scheme.
256
+ - No density based cutoff was applied and all grid points were retained
257
+ for training.
258
+
259
+ #### Training hyperparameters
260
+
261
+ The training hyperparameter settings are detailed in the supplementary
262
+ of [Accurate and scalable exchange-correlation with deep learning, Luise
263
+ et al. 2025](https://arxiv.org/abs/2506.14492v6). This repository only
264
+ includes the code to evaluate the checkpoints provided, not the training
265
+ code.
266
+
267
+ #### Speeds, sizes, times
268
+
269
+ The training of our functional using the training dataset as detailed in
270
+ the section \"Training data\" took approximately 48h for 1M training
271
+ steps on a [ND A100 v4 series
272
+ VM](https://learn.microsoft.com/en-us/azure/virtual-machines/sizes/gpu-accelerated/ndasra100v4-series?tabs=sizebasic)
273
+ with 8 NVIDIA A100 GPU with 80 GB memory each, 96 CPU cores, 880 GB RAM,
274
+ and a 6 TB disk.
275
+
276
+ The model checkpoints have ~385,217 trainable parameters.
277
+
278
+ ## Evaluation
279
+
280
+ ### Testing data, factors, and metrics
281
+
282
+ We have evaluated our functional on several different benchmark sets:
283
+
284
+ 1. W4-17. A diverse and highly accurate dataset of atomization
285
+ energies from [Karton et. al 2017][karton2017]
286
+ 2. Transition metal data sets including
287
+ MOR41 from [Dohm et. al 2018][dohm2018],
288
+ ROST61 from [Maurer et. al 2021][maurer2021],
289
+ MOBH35 from [Semidalas et. al 2022][semidalas2022],
290
+ 3dTMV from [Neugebauer et. al 2023][neugebauer2023],
291
+ CuAgAu83 from [Chan 2019][chan2019],
292
+ DAPd from [Author et. al 2020][dapd2020],
293
+ 3d4dIPSS, TMB11, and TMD10 from [Liang et. al 2025][liang2025]
294
+ 3. GMTKN55. A diverse and highly accurate dataset of general main group
295
+ thermochemistry, kinetics and noncovalent
296
+ interactions from [Goerigk et. al 2017][goerigk2017]
297
+ 4. Geometry optimization datasets: (a) CCse21, equilibrium structures,
298
+ bond lengths and bond angles from [Piccardo et. al 2015][piccardo2015]
299
+ (b) HMGB11, equilibrium structures, bond
300
+ lengths from [Grimme et. al 2015][grimme2015]
301
+ (c) LMGB35, equilibrium structures, bond lengths,
302
+ and from [Grimme et. al 2015][grimme2015]
303
+ (d) W4-11-GEOM, equilibrium structures, bond
304
+ lengths and bond angles.
305
+ 5. The Dipole benchmark dataset from [Hait et al. 2018][hait2018]
306
+ 6. Conformer search benchmark dataset of 22 molecules spanning
307
+ molecular size from 24 to 176 atoms for cost scaling from
308
+ [Grimme et al. 2019][grimme2019]
309
+
310
+ The evaluation of our model using the 5 different types of benchmarks as
311
+ defined above serve to measure different performance aspects of our
312
+ functional. For example, 1 and 2 focus on the accuracy of predicted
313
+ reaction energies, and 3 focuses on the ability of our functional to
314
+ perform geometry optimization and to converge to the right equilibrium
315
+ molecular structure. Furthermore, 4 measures the dipole moment of the
316
+ molecules in the benchmark set, which provides a measure for the quality
317
+ of the self-consistent electron density that a converged SCF procedure
318
+ produces with our model. Finally, 5 determines the speed of employing
319
+ SCF with our model and compares its scaling behavior with respect to
320
+ system size with the scaling of traditional functionals.
321
+
322
+ The metrics for the different benchmark sets are:
323
+
324
+ 1. Mean Absolute Error (MAE) in kcal/mol for reactions in W4-17
325
+ $MAE = \frac{1}{N} \sum_{r=1}^N |\Delta E_r - \Delta E_r^\theta|$.
326
+ Here *N* is the number of reactions in W4-17, *r* is the index
327
+ denoting reactions in W4-17, $\Delta E_r$ is the energy difference
328
+ of reaction r as calculated by a high-accuracy method from the W4
329
+ family (CCSDT(Q)/CBS to CCSDTQ56/CBS), and $\Delta E_r^\theta$ is
330
+ the prediction of the reaction energy difference using SCF
331
+ calculations with our functional, and
332
+ 2. Weighted total mean absolute deviations 2 (WTMAD-2) in kcal/mol for
333
+ the GMTKN55 benchmark set
334
+ $\text{WTMAD-2} = \frac1{\sum^{55}_{i=1} N_i} \sum_{i=1}^{55} N_i \frac{56.84\text{ kcal/mol}}{\overline{|\Delta E|}_i} \text{MAE}_i$
335
+ Here $N_i$ is the number of reactions in subset *i*,
336
+ $\overline{|\Delta E|}_i$ is the average energy difference in subset
337
+ *i* in kcal/mol and $\text{MAE}_i$ is the mean absolute error in
338
+ kcal/mol for subset *i*.
339
+ 3. For the geometry benchmark sets that report bond lengths, we measure
340
+ the absolute error in bond lengths in Angstrom, averaged over the
341
+ number of bonds and the number of equilibrium structures in the
342
+ dataset. For the benchmark that also contains bond angles, we report
343
+ the absolute error of the angles, averaged over the number of bonds
344
+ and equilibrium structures in the dataset.
345
+ 4. We follow the metrics defined in `hait2018`{.interpreted-text
346
+ role="footcite"}. We measure the Root Mean Squared Error (RMSE) of
347
+ the dipole moment with respect to reference values provided by the
348
+ benchmark dataset. For those molecules (indexed with *i*) for which
349
+ only the reference magnitude of the dipole moment
350
+ $\mu_i^{\text{ref}} = |{\vec\mu}_i^{\text{ref}}|$ is provided, we
351
+ measure the RMSE of the predicted magnitude of the dipole moment
352
+ $\mu_i^{\theta} = |{\vec\mu}_i^{\theta}|$ is available, the error is
353
+ defined as
354
+ $\text{Error}_i = \frac{\mu_i^\theta - \mu_i^\text{ref}}{\max(\mu_i^\text{ref}, 1D)} \times 100\%$.
355
+ Here *D* denotes the unit of Debye. For those molecules for which
356
+ the reference value of the dipole vector $\vec{\mu}_i^\text{ref}$ is
357
+ also available we instead compute
358
+ $\text{Error}_i = \frac{|\vec{\mu}_i^\theta - \vec{\mu}_i^\text{ref}|}{max(\mu_i^\text{ref}, 1D)} \times 100\%$.
359
+ Using these errors we compute the RMSE as follows:
360
+ $\text{RMSE} = \sqrt{\frac{1}{N} \sum_{i=1}^N \text{Error}_i^2}$
361
+ 5. We fit a power law of the form
362
+ $C(M) = \left(\frac{n(M)}{A}\right)^k$ to the 22 data points of the
363
+ test set where *C(M)* and *n(M)* are the computational cost and
364
+ number of atoms of molecule *M*, respectively, and *A* and *k* are
365
+ fitted parameters. We report the scaling power *k* as the main
366
+ metric.
367
+
368
+ ### Evaluation results
369
+
370
+ We demonstrate that the combination of a large-scale high-accuracy
371
+ dataset combined with our deep learning architecture produces the Skala
372
+ functional that predicts atomization energies at chemical accuracy (1
373
+ kcal/mol), as measured on the public benchmark set W4-17. On the public
374
+ benchmark set GMTKN55, which covers general-main group thermochemistry,
375
+ kinetics and noncovalent interactions, our model makes predictions
376
+ around 2.72 kcal/mol. This accuracy is better than state-of-the-art
377
+ range-separated hybrid functionals while only requiring runtimes typical
378
+ of semi-local DFT.
379
+
380
+ On the geometry optimization benchmarks we demonstrate that we can
381
+ converge to the reference equilibrium structure with an error that is
382
+ comparable to a range-separated hybrid functional. On the dipole
383
+ prediction benchmark test we demonstrate that the error of our dipole
384
+ moment prediction with respect to reference values is better than
385
+ state-of-the-art range-separated hybrid functionals.
386
+
387
+ Finally, our scaling results demonstrate that our functional shows the
388
+ asymptotic scaling behavior of a metaGGA functional, with an approximate
389
+ prefactor of 3 compared to the r2SCAN.
390
+
391
+ ## License
392
+
393
+ :::: dropdown
394
+ MIT License
395
+
396
+ ::: {.literalinclude lines="3-"}
397
+ ../../LICENSE.txt
398
+ :::
399
+ ::::
400
+
401
+ ## Citation
402
+
403
+ When using Skala-1.1 in your research, please reference it including the
404
+ version number as follows:
405
+
406
+ > This work uses the Skala-1.1 functional.
407
+
408
+ ``` bibtex
409
+ @misc{luise2025,
410
+ title={Accurate and scalable exchange-correlation with deep learning},
411
+ author={Giulia Luise and Chin-Wei Huang and Thijs Vogels and Derk P. Kooi and Sebastian Ehlert and Stephanie Lanius and Klaas J. H. Giesbertz and Amir Karton and Deniz Gunceler and Megan Stanley and Wessel P. Bruinsma and Lin Huang and Xinran Wei and José Garrido Torres and Abylay Katbashev and Rodrigo Chavez Zavaleta and Bálint Máté and Sékou-Oumar Kaba and Roberto Sordillo and Yingrong Chen and David B. Williams-Young and Christopher M. Bishop and Jan Hermann and Rianne van den Berg and Paola Gori-Giorgi},
412
+ year={2025},
413
+ eprint={2506.14665},
414
+ archivePrefix={arXiv},
415
+ primaryClass={physics.chem-ph},
416
+ url={https://arxiv.org/abs/2506.14665},
417
+ }
418
+ ```
419
+
420
+ ## Model card contact
421
+
422
+ - Rianne van den Berg, <rvandenberg@microsoft.com>
423
+ - Paola Gori-Giorgi, <pgorigiorgi@microsoft.com>
424
+ - Jan Hermann, <jan.hermann@microsoft.com>
425
+ - Sebastian Ehlert, <sehlert@microsoft.com>
426
+
427
+ [karton2006]: https://doi.org/10.1007/s00214-005-0028-6
428
+ [karton2009]: https://doi.org/10.1080/00268970802708959
429
+ [karton2011]: https://doi.org/10.1016/j.cplett.2011.05.007
430
+ [karton2017]: https://doi.org/10.1002/jcc.24854
431
+ [goerigk2017]: https://doi.org/10.1039/C7CP04913G
432
+ [piccardo2015]: https://doi.org/10.1021/jp511432m
433
+ [grimme2015]: https://doi.org/10.1063/1.4927476
434
+ [hait2018]: https://doi.org/10.1021/acs.jctc.7b01252
435
+ [grimme2019]: https://doi.org/10.1021/acs.jctc.9b00143
436
+ [dohm2018]: https://doi.org/10.1021/acs.jctc.7b01183
437
+ [maurer2021]: https://doi.org/10.1021/acs.jctc.1c00659
438
+ [semidalas2022]: https://doi.org/10.1021/acs.jctc.1c01126
439
+ [neugebauer2023]: https://doi.org/10.1021/acs.jctc.3c00617
440
+ [chan2019]: https://doi.org/10.1021/acs.jpca.9b03976
441
+ [chan2023]: https://doi.org/10.1021/acs.jctc.3c01060
442
+ [liang2025]: https://doi.org/10.1021/acs.jctc.5c01380
443
+ [smith2016]: https://doi.org/10.1021/acs.jpclett.6b00780
444
+ [taylor2016]: https://doi.org/10.1063/1.4961095
445
+ [donchev2021]: https://doi.org/10.1038/s41597-021-00833-x
446
+ [prasad2021]: https://doi.org/10.1021/acs.jctc.1c00694
447
+ [gasevic2025]: https://doi.org/10.1021/acs.jcim.5c01364
448
+ [karton2025]: https://doi.org/10.1016/j.cplett.2025.142030