Abstract

Machine learning interatomic potentials (MLIPs) enable accurate simulations of materials at scales beyond that accessible by ab initio methods and play an increasingly important role in the study and design of materials. However, MLIPs are only as accurate and robust as the data on which they are trained. Here, we present DImensionality-Reduced Encoded Clusters with sTratified (DIRECT) sampling as an approach to select a robust training set of structures from a large and complex configuration space. By applying DIRECT sampling on the Materials Project relaxation trajectories dataset with over one million structures and 89 elements, we develop an improved materials 3-body graph network (M3GNet) universal potential that extrapolates more reliably to unseen structures. We further show that molecular dynamics (MD) simulations with the M3GNet universal potential can be used instead of expensive ab initio MD to rapidly create a large configuration space for target systems. We combined this scheme with DIRECT sampling to develop a reliable moment tensor potential for titanium hydrides without the need for iterative augmentation of training structures. This work paves the way for robust high-throughput development of MLIPs across any compositional complexity.

Details

Title
Robust training of machine learning interatomic potentials with dimensionality reduction and stratified sampling
Author
Qi, Ji 1   VIAFID ORCID Logo  ; Ko, Tsz Wai 2   VIAFID ORCID Logo  ; Wood, Brandon C. 3 ; Pham, Tuan Anh 3 ; Ong, Shyue Ping 4   VIAFID ORCID Logo 

 University of California San Diego, Materials Science and Engineering Program, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242); Lawrence Livermore National Laboratory, Quantum Simulations Group, Livermore, USA (GRID:grid.250008.f) (ISNI:0000 0001 2160 9702) 
 University of California San Diego, Department of NanoEngineering, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242) 
 Lawrence Livermore National Laboratory, Quantum Simulations Group, Livermore, USA (GRID:grid.250008.f) (ISNI:0000 0001 2160 9702); Lawrence Livermore National Laboratory, Laboratory for Energy Applications for the Future (LEAF), Livermore, USA (GRID:grid.250008.f) (ISNI:0000 0001 2160 9702) 
 University of California San Diego, Materials Science and Engineering Program, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242); University of California San Diego, Department of NanoEngineering, La Jolla, USA (GRID:grid.266100.3) (ISNI:0000 0001 2107 4242) 
Pages
43
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20573960
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2931865404
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.