Abstract

Characterization of material structure with X-ray or neutron scattering using e.g. Pair Distribution Function (PDF) analysis most often rely on refining a structure model against an experimental dataset. However, identifying a suitable model is often a bottleneck. Recently, automated approaches have made it possible to test thousands of models for each dataset, but these methods are computationally expensive and analysing the output, i.e. extracting structural information from the resulting fits in a meaningful way, is challenging. Our Machine Learning based Motif Extractor (ML-MotEx) trains an ML algorithm on thousands of fits, and uses SHAP (SHapley Additive exPlanation) values to identify which model features are important for the fit quality. We use the method for 4 different chemical systems, including disordered nanomaterials and clusters. ML-MotEx opens for a type of modelling where each feature in a model is assigned an importance value for the fit quality based on explainable ML.

Details

Title
Extracting structural motifs from pair distribution function data of nanostructures using explainable machine learning
Author
Anker, Andy S. 1   VIAFID ORCID Logo  ; Kjær, Emil T. S. 1   VIAFID ORCID Logo  ; Juelsholt, Mikkel 2 ; Christiansen, Troels Lindahl 1 ; Skjærvø, Susanne Linn 1 ; Jørgensen, Mads Ry Vogel 3   VIAFID ORCID Logo  ; Kantor, Innokenty 4 ; Sørensen, Daniel Risskov 3   VIAFID ORCID Logo  ; Billinge, Simon J. L. 5   VIAFID ORCID Logo  ; Selvan, Raghavendra 6 ; Jensen, Kirsten M. Ø. 1 

 University of Copenhagen, Department of Chemistry and Nano-Science Center, Copenhagen, Denmark (GRID:grid.5254.6) (ISNI:0000 0001 0674 042X) 
 University of Oxford, Department of Materials, Oxford, UK (GRID:grid.4991.5) (ISNI:0000 0004 1936 8948) 
 Aarhus University, Department of Chemistry & iNANO, Aarhus, Denmark (GRID:grid.7048.b) (ISNI:0000 0001 1956 2722); Lund University, MAX IV Laboratory, Lund, Sweden (GRID:grid.4514.4) (ISNI:0000 0001 0930 2361) 
 Lund University, MAX IV Laboratory, Lund, Sweden (GRID:grid.4514.4) (ISNI:0000 0001 0930 2361); Technical University of Denmark, Department of Physics, Lyngby, Denmark (GRID:grid.5170.3) (ISNI:0000 0001 2181 8870) 
 Columbia University, Department of Applied Physics and Applied Mathematics, New York, USA (GRID:grid.21729.3f) (ISNI:0000000419368729); Brookhaven National Laboratory, Condensed Matter Physics and Materials Science Department, Upton, USA (GRID:grid.202665.5) (ISNI:0000 0001 2188 4229) 
 University of Copenhagen, Department of Computer Science, Copenhagen, Denmark (GRID:grid.5254.6) (ISNI:0000 0001 0674 042X); University of Copenhagen, Department of Neuroscience, Copenhagen, Denmark (GRID:grid.5254.6) (ISNI:0000 0001 0674 042X) 
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
e-ISSN
20573960
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2719938825
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.