Abstract

Adverse pregnancy outcomes, such as low birth weight (LBW) and preterm birth (PTB), can have serious consequences for both the mother and infant. Early prediction of such outcomes is important for their prevention. Previous studies using traditional machine learning (ML) models for predicting PTB and LBW have encountered two important limitations: extreme class imbalance in medical datasets and the inability to account for complex relational structures between entities. To address these limitations, we propose a node embedding-based graph outlier detection algorithm to predict adverse pregnancy outcomes. We developed a knowledge graph using a well-curated representative dataset of the Emirati population and two node embedding algorithms. The graph autoencoder (GAE) was trained by applying a combination of original risk factors and node embedding features. Samples that were difficult to reconstruct at the output of GAE were identified as outliers considered representing PTB and LBW samples. Our experiments using LBW, PTB, and very PTB datasets demonstrated that incorporating node embedding considerably improved performance, achieving a 12% higher AUC-ROC compared to traditional GAE. Our study demonstrates the effectiveness of node embedding and graph outlier detection in improving the prediction performance of adverse pregnancy outcomes in well-curated population datasets.

Details

Title
Node embedding-based graph autoencoder outlier detection for adverse pregnancy outcomes
Author
Khan, Wasif 1 ; Zaki, Nazar 2   VIAFID ORCID Logo  ; Ahmad, Amir 3 ; Masud, Mohammad M. 3 ; Govender, Romana 4 ; Rojas-Perilla, Natalia 5 ; Ali, Luqman 1 ; Ghenimi, Nadirah 4 ; Ahmed, Luai A. 6 

 United Arab Emirates University, Department of Computer Science and Software Engineering, College of Information Technology, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
 United Arab Emirates University, Department of Computer Science and Software Engineering, College of Information Technology, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666); ASPIRE Precision Medicine Research Institute Abu Dhabi (ASPIREPMRIAD), Al Ain, United Arab Emirates (GRID:grid.43519.3a) 
 United Arab Emirates University, Department of Information Systems and Security, College of Information Technology, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
 United Arab Emirates University, Department of Family Medicine, College of Medicine and Health Sciences, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
 United Arab Emirates University, Department of Analytics in the Digital Era, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
 United Arab Emirates University, Institute of Public Health, College of Medicine and Health Sciences, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666); United Arab Emirates University, Zayed Centre for Health Sciences, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
Pages
19817
Publication year
2023
Publication date
2023
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2889800994
Copyright
© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.