Abstract

Advances in computational chemistry create an ongoing need for larger and higher-quality datasets that characterize noncovalent molecular interactions. We present three benchmark collections of quantum mechanical data, covering approximately 3,700 distinct types of interacting molecule pairs. The first collection, which we refer to as DES370K, contains interaction energies for more than 370,000 dimer geometries. These were computed using the coupled-cluster method with single, double, and perturbative triple excitations [CCSD(T)], which is widely regarded as the gold-standard method in electronic structure theory. Our second benchmark collection, a core representative subset of DES370K called DES15K, is intended for more computationally demanding applications of the data. Finally, DES5M, our third collection, comprises interaction energies for nearly 5,000,000 dimer geometries; these were calculated using SNS-MP2, a machine learning approach that provides results with accuracy comparable to that of our coupled-cluster training data. These datasets may prove useful in the development of density functionals, empirically corrected wavefunction-based approaches, semi-empirical methods, force fields, and models trained using machine learning methods.

Measurement(s)

Molecular Interaction Process • interaction energy • energy

Technology Type(s)

ab initio quantum chemistry computational method

Factor Type(s)

molecular entity

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.13521638

Details

Title
Quantum chemical benchmark databases of gold-standard dimer interaction energies
Author
Donchev, Alexander G 1 ; Taube, Andrew G 1   VIAFID ORCID Logo  ; Decolvenaere, Elizabeth 1 ; Hargus Cory 1 ; McGibbon, Robert T 1 ; Ka-Hei, Law 1 ; Gregersen, Brent A 1 ; Li, Je-Luen 1 ; Kim, Palmo 1 ; Siva Karthik 1 ; Bergdorf, Michael 1   VIAFID ORCID Logo  ; Klepeis, John L 1   VIAFID ORCID Logo  ; Shaw, David E 2   VIAFID ORCID Logo 

 D. E. Shaw Research, New York, USA (GRID:grid.417724.3) (ISNI:0000 0004 0640 9990) 
 D. E. Shaw Research, New York, USA (GRID:grid.417724.3) (ISNI:0000 0004 0640 9990); Columbia University, Department of Biochemistry and Molecular Biophysics, New York, USA (GRID:grid.21729.3f) (ISNI:0000000419368729) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20524463
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2488028917
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.