Introduction
Maternal health issues for women arise during pregnancy, childbirth, and the postnatal period [1]. At the time of pregnancy, women are at a higher risk of health complications which may lead to miscarriage and death in many cases. Every pregnancy stage must have a positive experience to ensure the good health of babies and women. Women who maintain their health during pregnancy and after childbirth, have a high probability of maintaining their health later in life. Such women have good birth outcomes, impacting childhood, adolescence, and adulthood of the children [2]. As a result, women’s health is important for both the mother and children. To reduce maternal and perinatal death and morbidity, actions across different stages are required. Before pregnancy, the overall lifestyle choices and health of women can affect maternal health risks. During pregnancy, high-quality care is necessary to ensure pregnancy health for mother and baby [3]. It is crucial to monitor both mother and baby’s health throughout the postpartum period [4], as both are at a higher risk of mortality during the first week of postpartum. The symptoms can be detected and managed early which can lower the risk of death.
The maternal mortality ratio refers to the number of women who die from pregnancy-associated factors per 100000 live births [5]. The world health organization (WHO) reports that approximately 300000 women died from pregnancy-associated causes in 2017 [6], which is 808 women every day. By analyzing the global maternal deaths per region, two-thirds (200000) of maternal deaths occur in sub-Saharan Africa and 19% (57000) of maternal deaths occur in South Asia alone [7]. In 2017, the five nations with the highest number of maternal fatalities are Tanzania (11000), Ethiopia (14000), the Democratic Republic of Congo (16000), India (35000), and Nigeria (67000) [6]. Pregnant women’s health factors such as age, and blood disorders including high or low blood pressure, blood glucose levels, body temperature, and heart rate can directly cause higher levels of complications during pregnancy resulting in loss of women’s pregnancy and death of both women and child. These health factors need to be controlled by special medications from the medical specialist whereby an early prediction of such health risks can potentially help the medical experts take appropriate actions to alleviate the probability of death.
In recent years, deep learning-based neural networks have been used in different domains to solve diverse problems [8]. Deep neural networks utilize layered architecture to learn the complex correlations among the variables. In deep learning, each successive layer utilizes the output from the previous layers. The deep learning techniques provide more accurate results, especially with the large amounts of input data [9]. Each successive layer is activated by receiving input from its neighboring neurons. Deep learning models are used for complex, precise, and accurate data analysis.
Deep learning-based models have been utilized for a rich variety of medical tasks like disease prediction using images, health records, and time-series data. Such models discover hidden patterns from medical data and help researchers and medical professionals to diagnose rapidly and accurately [10]. Deep learning provides the analysis of different infections accurately and helps doctors with better treatment. As a result, better medical decisions are made. Deep learning is also used for patient care and supporting clinicians [11]. Deep learning models analyze unstructured and structured medical records including diagnosis with better accuracy [12]. Medical applications [13] of deep neural networks include medical imaging, healthcare data analytics, maternal health care, drug discovery [14], genomics analysis [15], and many more. Taking due account of recent applications, suitability, and efficiency of deep learning models, this study adopts deep neural models to predict health risks associated with pregnancy and makes the following contributions
* A novel deep neural network architecture is proposed which considers decision trees, Bidirectional long short term memory (BiLSTM), and temporal convolutional network (TCN) to make DT-BiLTCN for feature extraction later to be used to train machine learning models. In this regard, support vector machine (SVM), extra tree classifier (ETC), logistic regression (LR), and decision tree classifier (DTC) are used to predict maternal health risk during women’s pregnancy. In addition, an ensemble model, BiLTCN comprising BiLSTM and TCN is also used.
* The maternal health exploratory data analysis (MHEDA) is performed to study the health conditions which serve as the strongest indicators to predict different maternal health risks during pregnancy.
* The performance of the DT-BiLTCN is further enhanced by two factors in essence including data resampling for data balancing and fine-tuning of hyperparameters of the proposed model. The synthetic minority oversampling technique (SMOTE) is utilized to balance the health dataset in this study while hyperparameters are tuned to select the set of best-fit hyperparameters for better performance.
* A comparative analysis of several existing models is carried out with the proposed approach to analyze its efficiency. The receiver operating characteristic (ROC) accuracy curve analysis is conducted to analyze the performance of the proposed BiLTCN model at different risk thresholds.
The rest of this study is organized as follows. Related work is discussed in Section 2. Section 3 describes the architecture and methodological details of the BiLTCN model. Results and discussions are given in Section 4. In the end, Section 5 provides the conclusion.
Related work
The importance of maternal health led many researchers to devise models and approaches for the timely prediction of health risks during pregnancy using both traditional and machine learning techniques. Some studies focus on observing pregnant women’s conditions for analyzing and recording health risk factors while other aims at the prediction. For example, a pregnancy risk detection system (PRDS) is constructed to detect the pregnancy risk level based on the pregnant woman’s experienced symptoms [16]. The research observations are conducted continuously over time at Panimbang health center. Several health risk factors are observed like age over 35 or under 20 years old, having given birth before, history of disorder during pregnancy, and miscarriage. The pregnancy risk level is divided into three categories of high risk, low risk, and moderate risk.
Pregnant women’s health risk levels are predicted and monitored by using cloud-based machine learning techniques in Bangladesh [17]. The technique identified the maternal health risk intensity by analyzing the pregnant women’s health factors. A total of 1014 samples have been collected from numerous sources by utilizing wearable sensing devices. Python and WEKA toolkit platforms are used for model building. The maternal health risk levels are divided into low risk, mid risk, and high risk with the help of medical experts. The health factors like age, blood pressure, and blood sugar are identified as the key factors for high risk. Experimental results show that DT achieved a 97% accuracy score using hyperparameter tuning [18]. The pregnancy data is analyzed and utilized to build a machine learning classifier or health risk prediction during the pregnancy of women in [19]. The pregnancy data of 600 samples are collected from three medical centers in Bangalore. The employed DT model achieved a 71% accuracy score. The relative absolute error value is 99% and root related squared error value is 99% for the DT classifier. [20] predict health complications by using two machine learning-based classifiers. The study aims to reduce the fetal and maternal mortality rate by analyzing the pregnancy-related dataset. The classifiers are applied to identify the maternal health risk of pregnant women. Similarly, Naive Bayes (NB) and decision tree (DT) are employed in [21]. DT achieved an accuracy score of 66% while the relative absolute error value is 74% and root related squared error value is 110%.
Maternal mortality rate health risk factors are identified and observed using different data repositories in [22]. The study uses the root mean square error (RMSE) based on k-fold cross-validation to evaluate the proposed model performance. The world indicator 2015 (WDI-2015) dataset is utilized for the model building which contains 1350 samples from 1960 to 2015. The linear regression showed an RMSE of 0.709. The study asserts the need of reducing the rural population growth and ensuring necessities for better child care to reduce the pregnancy-related mortality rate. An efficient classification and regression tree (CART) binary DT approach is proposed to predict the high pregnancy risk using fetal health status in [23]. The cardiotocography dataset from UCI consisting of 2126 fetal cardiotocograms is utilized for risk prediction. Using a 5-fold cross-validation, it achieves an 88% accuracy.
Besides using the machine learning models on the collected data, some approaches devise telemedicine or electronic online monitoring systems for data collection. For example, a telemedicine framework is proposed to help health doctors in predicting pregnancy health risks in the Philippines [24]. The cloud-based dataset [25] of 97 samples was utilized to train and test four machine learning techniques such as DT, random forest (RF), k nearest neighbor (KNN), and support vector machine (SVM). RF achieved a 90% accuracy score by employing hyperparameter tuning. The telemedicine framework on imputing pregnant women data displayed the negative or positive cases concerning the possibility of pregnancy high-risk factors. Similarly, an Internet of things (IoT) based integrated system is developed in [26] to monitor the fetal and maternal signals for high-risk pregnancy predictions. IoT devices are used for fetal and maternal monitoring. A 1D convolutional neural network (CNN) [27] is utilized to predict the health risks during pregnancy. The maternal clinical factors and the heart rate of the fetal are monitored. The utilized maternal factors are diastolic/systolic blood pressure, oxygen saturation, temperature, heart rate, and uterine tonus activity. The proposed system achieved an accuracy score of 92% for fetal and maternal emergencies. Similarly, CNN is reported to have superior performance for distracted driver detection in [28].
Ensemble models show better performance than individual models, so they have been adopted for pregnancy health risk prediction as well. An ensemble classifiers-based [29] approach is utilized to predict the birth mode in this study [30]. The study provides the proper identification of health risk levels accomplice with pregnant woman delivery and helps reduce the mortality rate in Bangladesh. The research dataset consisting of 4493 samples is collected from the Demographic and Health Survey (BDHS-2014) in Bangladesh. The ensemble classifier achieved an 86% of accuracy score for birth mode classification. The maternal health risk prediction based on electronic health registries using an RF is proposed in [31]. The relevant features are selected by the recursive feature elimination (RFE) for the decision-making task. The systematic process of data analysis, preparation, and modeling are developed. The study uses several machine learning-based models with a grid search strategy using k-fold cross-validation and hyperparameters tuning. RF achieves an accuracy score of 93% in terms of maternal health risk prediction.
The discussion of the above-cited studies indicates that several aspects need further research efforts. First, the prediction accuracy is comparatively low for the majority of the works, except for a few single and ensemble models. Second, predominantly, studies focus on binary classification and do not consider the risk levels. Third, deep learning models are seldom studied. This study employs two deep learning models and predicts the health risks related to pregnant women into low, medium, and higher levels.
Materials and methods
This section elaborates on steps related to employed techniques and methods to predict maternal risk. The flow diagram of the adopted methodology is visualized in Fig 1. It contains several steps which are followed sequentially.
* Step 1: The maternal health-related dataset is acquired in this step. The dataset is obtained from the Kaggle repository and is publicly available. Originally, Marzia et al. created the dataset using the IoT-based risk monitoring system [32]. The doctors structured the dataset and made it available for research purposes. The collected data contain various prominent health factors related to maternal health risk prediction.
* Step 2: MHEDA is applied to examine the pregnancy risk factors and get useful insights from the collected data. MHEDA contains numerous analysis graphs. The statistical data analysis, violin graphs, pie charts, feature relation analysis, and many more graphs are analyzed to get useful health factors and health conditions during pregnancy.
* Step 3: This step contains the dataset resampling technique implementation. The collected health dataset is imbalanced which hinders the full potential of the proposed approach. Training of the proposed model may become overfit due to class imbalanced data distributions. To balance the dataset, SOMTE is utilized.
* Step 4: The encoding of the category of health dataset is utilized in this step. The low risk, medium risk, and high-risk labels are converted to numeric forms 0, 1, and 2, respectively. The label encoding is applied to convert the target label into machine-readable labels. Now the dataset is in a structured format for the next research steps.
* Step 5: The dataset splitting is carried out in this step. The splitting depends upon the split of two portions of data one for proposed model training and the other portion of data for model testing and evaluations. The 80% of data is used for the train portion and 20% data for model evaluations.
* Step 6: The DT-BiLTCN deep learning-based model is built in this step. The DT-BiLTCN is a hybrid of DT, Bidirectional LTSM, and TCN models. It is tested and evaluated with 20% of the dataset. The DT-BiLTCN model is used for feature extraction. The extracted feature set is later used to train all the machine learning models and the evaluation is carried out using accuracy, classification report, F1 score, ROC curve accuracy, precision, and recall score.
* Step 7: Based on the model performance evaluation result, hyperparameter tuning is employed in this step. The model hyperparameter tunning is applied to get more accurate results from the proposed model. The number of epochs, model layers, vocabulary size, loss function, accuracy matrix, loss optimizer, and the total trainable parameters involved are the hyperparameters of the proposed model.
[Figure omitted. See PDF.]
Maternal health dataset
The maternal health data has been collected from different hospitals, community clinics, maternal health cares through the IoT-based risk monitoring system [33]. Marzia et al. from the Institution of Daffodil International University, Dhaka, Bangladesh created the dataset and it is publicly available [32]. The benchmark dataset is publicly available on Kaggle [34]. The dataset contains 7 features including age, SystolicBP, DiastolicBP, BS, BodyTemp, HeartRate, and RiskLevel as target classes. Table 1 describes the maternal health dataset using features-related information. The total maternal health data sample size is 1218.
[Figure omitted. See PDF.]
Maternal health exploratory data analysis
MHEDA refers to the process of discovering dataset patterns, hypothesis tests, and assumption checks by utilizing the graphical data representations and statistics summary of the dataset. MHEDA helps to summarize the main dataset characteristics and features relation analysis. The feature relation analysis and data visualization methods help in the proposed model’s prediction process. Results of statistical dataset features analysis are given in Table 2. The feature statistic is based on the count, mean, standard deviation (std. dev.), minimum (min), 25%, 50%, 75%, and maximum (max) values. The analysis demonstrates that the dataset contains 1218 rows for each feature. The minimum values show the lowest limit, and the maximum values show the highest limit for all features. The minimum age of involved patients is 10, and the maximum age is 70 years. The analysis demonstrates that with the maximum age of 70 there is an indication of low risk during pregnancy. The systolic BP has a minimum value of 70, and the highest value is 160. Diastolic BP has a minimum value of 49, and the highest value is 100. Patients’ minimum and maximum heart rate values are 70 and 90. All the dataset features are analyzed statistically to check the behavior of each patient’s values in predicting the maternal health risk.
[Figure omitted. See PDF.]
Fig 2 shows the analysis of different features of the collected dataset. It shows that each feature has a different correlation between high, medium, and low health risk levels indicating the importance of features to predict each class of risks. The bar chart distribution analysis between the age of pregnant women and the risk level is analyzed in Fig 2a. The analysis shows that from the age of 10 to 13 there is a high level of risk during pregnancy. In the age range of 18 to 30, the risk level is normal in all aspects. Between the age of 40 to 60 pregnant women face a high-risk level. The distribution analysis between the systolic blood pressure of pregnant women and the risk level is analyzed in Fig 2b. The analysis describes that systolic blood pressure of 140 causes a high-risk level. The 80 to 100 has a low-level risk of systolic blood pressure. The distribution analysis between the diastolic blood pressure of pregnant women and the risk level is analyzed in Fig 2c. The analysis shows that diastolic blood pressure at 100 causes high pregnancy risk levels.
[Figure omitted. See PDF.]
The correlation analysis is a statistical technique utilized to find out one feature’s change concerning the other feature of the dataset. We found the degree of the meaningful relationship and association among different dataset feature variables. The correlation analysis gives an association measure value between different dataset variables. The analysis demonstrates that the body temperature features are highly negatively correlated with all dataset features. The features systolic BP and diastolic BP have a low negative correlation with only the heart rate feature. Other dataset features have a positive correlation. A high positive correlation is between the systolic BP and diastolic BP features in context to the classification problem. The maternal health dataset feature correlation analysis is shown in Fig 3.
[Figure omitted. See PDF.]
The 3-dimensional (3D) relation analysis of prominent dataset features with maternal pregnancy risk level and the age of the women are examined. The 3D scatter plots are used to plot data points on three axes to show the relationship between three variables. The conducted analysis is visualized in Fig 4.
[Figure omitted. See PDF.]
The blood glucose (BS)levels in terms of the molar concentration, age, and risk level are analyzed in Fig 4a. We take the age, BS, and risk levels on the x, y, and z-axis, respectively to show the maternal health risk. It is observed that the risk level is low when the BS is between 6 to 8 and the age varies from 10 to 70. Risk level changes to medium when the BS is between 8 to 18 and age is between 10 to 50. The risk level is high when the BS is between 10 to 18 and the age is between 10 to 50.
In Fig 4b age, heart rate, and risk level is taken on the x, y, and z-axis, respectively. The graph shows that the risk level is low when the heart rate is above 60 and the age is from 10 to 70. There is a low-risk level when the age is 10 to 35. The risk level is medium when the heart rate is above 80 and the age is between 10 to 50. The risk level is high when the heart rate is above 80 and the age is between 10 to 70.
In Fig 4c age, diastolic blood pressure (BP), and risk level are taken on the x, y, and z-axis, respectively to show the maternal health risk. The risk level is low under the age of 70 and DBP is above 50. There are higher chances to remain at risk level low when the age is between 10 to 30 and DBP is above 60. When the age is under 50 and DBP is above 50 the risk level is medium. The risk level is high when the DBP is above 70 and the age is under 60.
In Fig 4d age, body temperature, and risk level are taken on the x, y, and z-axis, respectively. When the body temp is 97 and the age is between 10 to 70 the risk level is low. The graph also shows that when the age is under 30 the body temperature varies from 96 to 103 however the risk level remains low. When the body temperature is 97 and the age is under 40 then the risk level is medium and the graph also shows that the body temperatures vary from 97 to 103 under the age of 40 the risk level remains medium. The risk level is high when the body temperature is above 100 and under the age of 30.
The pair plot is utilized to find the best set of data features that elaborate a relationship between two features and is used to form the most separated clusters. The pair plot analysis among all the dataset features with the risk levels are visualized in Fig 5. This analysis demonstrated the distributions of every single feature with all other dataset features. The feature plots are in the matrix form in which the column represents the y-axis and the row represents the x-axis. For each feature, the main diagonal is the univariate distribution. In this analysis, the hue parameter is set to the value risk level feature. The feature scatters subplots are utilized to find the pairwise relationships among the distributions
[Figure omitted. See PDF.]
Maternal health data resampling
The maternal data resampling is conducted to achieve the best results from the proposed BiLTCN approach. Data resampling is applied to generate samples of equal weights and quantify the uncertainty of risk levels. SMOTE [35] is utilized to balance the maternal health dataset. From Fig 6, it can be observed that the number of samples for each class is balanced. The actual dataset contains 406 instances for low risk, 336 for mid risk, and 272 for high risk. By applying the data balancing technique, all classes have an equal 406 instances.
[Figure omitted. See PDF.]
Maternal risk label encoding
The label encoding is performed to convert the target category (label) into a machine (model) readable form [36]. The maternal risk Level categories low risk, mid risk, and high risk are encoded as 0,1,2 respectively. These encoded labels are used for maternal risk prediction by the BiLTCN model.
Proposed DT-BiLTCN feature engineering
The proposed approach combines DT and BiLTCN for feature engineering on two rationales. First, DT and BiLTCN individually perform well for the task at hand. Owing to their performance, these models are combined in the proposed approach. In addition, we experimented with other combinations, however, the performance of those combinations was inferior. Secondly, it is empirically found that the use of prediction probabilities as features for model training yields better results as compared to using raw features. Therefore, this study proposes the use of prediction probabilities as the features for maternal health risk prediction.
Machine learning and deep learning-based feature engineering approaches are deployed in this study to improve the performance of models. Feature engineering is performed using a combination of DT and BiLTCN. BiLTCN is the hybrid of Bidirectional LSTM [37] and TCN deep learning techniques. DT-BiLTCN approach is utilized as a feature engineering technique in this study. The feature set extraction and formation mechanism from the dataset using the DT-BiLTCN is shown in Fig 7.
[Figure omitted. See PDF.]
The use of the DT-BiLTCN model for feature extraction, in the perspective of maternal health risk prediction, is illustrated in Fig 8. DT-BiLTCN approach is based on the hybrid of DT and BiLTCN models, so a brief description of each of these models is provided here.
[Figure omitted. See PDF.]
Decision tree.
DT is a supervised machine learning model and is commonly used for classification tasks. Classification or prediction is based on decision rules inferred from training data. DT has a tree-structured model, where the features of data are represented on internal nodes; the decision rules are represented by branches, and leaf nodes represent the outcomes. DT compares the values of the root attribute of the tree with the record attribute to predict the target class. The comparison of values continues until it reaches the leaf node of the tree. DT model is selected based on the reported performance in existing literature [38, 39]. Tree split criteria are important which determine the level of impurity and vary regarding classification and regression tasks. For example, Gini and Entropy are the commonly used criteria for classification while for regression tasks, mean squared error and half Poison deviance are used. This study uses Entropy with the following equation(1)
BiLTCN model.
A deep learning-based novel BiLTCN model is also proposed in this study for performance comparison with the DT-BiLTCN model. The BiLTCN is the hybrid of Bidirectional LSTM and TCN deep learning networks. LSTM is an improved form of recurrent neural network (RNN) [40, 41] that allows information patterns to persist for a long time. There are three parts of LSTM which are referred to as gates. Forget gate is the first part, the input gate is the second part, and the output gate is the last part of LSTM they are defined as(2) (3) (4)where Xt shows the input for the current timestamp, Uf is the weight attached with the input, (t − 1) is the hidden state for the previous timestamp, Wf is the weight matrix attached with the hidden state, Ui is the input weight matrix, H(t − 1) is the hidden state of the previous timestamp, and Wi is the weight matrix of input attached with a hidden state.
A BiLSTM [42] model contains two LSTM models where one LSTM takes the input information sequence in the forward direction (past to future) and the other LSTM in the backward direction (future to past). The BiLSTM input flow in both directions is used to preserve the future and past sequence information. The input X information sequence is computed in the forward and backward direction shown in Eqs 6, and 7. The final output cell Y at time t is constituted in both directions as in Eq 8.(5) (6) (7) (8)
The TCN [43] model is a variation of the CNN and is commonly used for sequence modeling tasks. The TCN working is to encode information from an input sequence. The TCN model has longer memory as compared to RNN with equal capacity. TCN utilizes causal convolutions, where at time t an output is convolved with only elements from the time t and the prior from the previous layer [44]. TCN utilizes a fully convolutional network (FCN) architecture [45]. The input layer has the length of each hidden layer. In the causal convolutional layer zero padding is added to the length (kernel size—1) to keep consequent layers the same length as the previous layer.(9) (10)
The dataset samples are fed into the DT and BiLTCN models for training which provide the predicted class probabilities as the output. However, instead of using class probabilities for prediction, they are used to train machine learning models. The feature set obtained from DT and BiLTCN are combined to make the hybrid feature set and is represented by Vhf (11)where DTf and BiLTCNf are the feature sets provided by the DT and BiLTCN, respectively, and are calculated as(12) (13)where P(c1)DT and P(c1)Bi show the probability of class 1 from the DT and BiLTCN models, respectively when the input data X is fed into these models. The X is the input dataset features (independent variables) that contain the whole records.
The proposed BiLTCN model build configuration parameters are analyzed in Table 3. The table contains an analysis of the number of layers, the neuron units, layer activation function, output shape of layer, and the total parameters used. The first layer of the model is the embedding layer [46] which contains 50000 neuron units, (None, 6, 64) output shape, and 3200000 total parameters used. The second layer is the BiLSTM layer having 128 neuron units, (None, 6, 256) output shape, and 197632 total parameters are used. The next layer is the TCN layer having 64 neuron units, (None, 64) output shape, and 201536 total parameters are used. The next layers are the family of dense layer networks. The 3 dense layers have 32, 16, 8 neuron units, (None, 32), (None, 16), (None, 8) output shape, ReLU is activation function, and 2080, 528, 136 total parameters are used, respectively. The last layer is the output layer having 3 neurons output units with Softmax as activation function, (None, 3) output shape, and 27 are the total parameters.
[Figure omitted. See PDF.]
The layered architectural analysis [47] of our proposed approach is illustrated in Fig 9. The first layer of the architecture is the input layer having the (None, 6) input and output shape. The second layer is the Embedding layer having the (None, 6) input shape and (None, 6, 64) output shape. Then in the layer stack, the Bidirectional LSTM layers are involved. The Bidirectional LSTM layer has an input shape of (None, 6, 64) and an output shape is (None, 6, 256). The next layer in the stack is the Temporal Convolutional Network layers that have an input shape of (None, 6, 256) and output shape of (None, 64). Then a family of 4 dense layers is involved. The last layer is the output layer of the architecture. The output layer has the input shape of (None, 8) and the output shape of (None, 3). This analysis demonstrated the layered stack of the proposed BiLTCN approach.
[Figure omitted. See PDF.]
The hyperparameter tuning is applied to find out a set of best-fit hyperparameters [48] for the proposed BiLTCN model. The involved hyperparameter is the number of epochs, number of layers, model vocabulary size, the loss function, accuracy matrix, loss optimizer, and the total trainable parameters. The final best-fit hyperparameters for the BiLTCN model are given in Table 4. The proposed model is trained for 50 epochs. The model contains seven layers and the vocabulary size is 50000. The categorical cross-entropy is used as the loss function. The loss optimizer utilized is Adam with a learning rate of 0.001. The total number of model parameters is 3601939.
[Figure omitted. See PDF.]
Results and discussions
This section contains the experimental results of machine learning models with respect to different perspectives. The performance of models is evaluated using imbalanced and balanced datasets. BiLTCN and DT are both evaluated as feature extractors, as well as classifiers. The performance of the proposed DT-BiLTCN is compared with BiLTCN as well regarding the feature extractor.
Experiment setup
All experiments are performed using HP EliteBook 8440p with CPU 2.53GHz 2 cores, Intel core i5, having a RAM of 4.00 GB. The Python TensorFlow tool [49] is utilized for proposed model building and performance evaluations. Data splitting helps to optimize hyperparameters of models, estimate the model generalization performance, and avoid model overfitting [50]. We have split the maternal health data into 0.8 for training purposes and 0.20 for testing purposes. The random state instance value is 42 for data spiting. The utilized evaluation metrics are recall, precision, accuracy, F1 score, and ROC curve accuracy score values.
Performance metrics
The accuracy shows the total number of accurate predictions from the predictive model out of all the predictions. It is calculated as(14)where TPR is the true positive rate and shows correctly identified by our predictive model, FPR is the false positive rate and incorrectly identified positive samples, TNR is the true negative rate showing the correctly predicted negative samples and FNR is the false negative rate and indicates the incorrectly identified negative samples.
Precision is the proportion of accurately predicted samples by the proposed model to all expected positive data samples. It is calculated using(15)
Recall shows the preciseness of the model and is calculated using(16)
The F1 sore performance metric is based on the combination of recall and precision and is calculated by taking the harmonic mean of recall and precision.(17)
The ROC curve accuracy performance metric is the probability curve that shows the TPR in comparison to FPR at different threshold values. The higher values of the ROC, the superior the proposed learning technique classification performance.(18)
Results of machine learning models using original features
Initial experiments are carried out using the imbalanced data with different train and test split ratios to analyze the performance of deployed models. SVM, ETC, LR, and DTC are employed for these experiments, and results are provided in Table 5. Among the employed models, the DTC achieves the highest accuracy of 80 each for 80:20 and 85:15 train-test split ratios. Due to an imbalanced dataset, predominantly the models perform poorly for maternal health risk prediction.
[Figure omitted. See PDF.]
Further experiments are performed using the SMOTE balanced dataset for making equal samples for three classes of the dataset and results are displayed in Table 6. Indicating an accuracy of 84%, BiLTCN outperforms other models when used with the balanced dataset. It is a high individual accuracy as compared to models’ performance with the imbalanced dataset, so for further experiments, only the balanced dataset is used.
[Figure omitted. See PDF.]
Performance evaluation of BiLTCN.
Experiments are performed to analyze the accuracy and other performance metrics for the proposed BiLTCN model. Epoch-wise results regarding the training accuracy and loss are presented in Table 7 indicating an average accuracy of 0.88 for the BiLTCN.
[Figure omitted. See PDF.]
Experimental results in Table 8 demonstrate the classification report for each maternal health risk category. The low-risk level category (0) has an 80% precision score, 82% recall score, and 81% F1 score. The medium risk level category (1) has an 80% precision score, 76% recall score, and 78% F1 score. The high-risk level category (2) has a 90% precision score, 89% recall score, and 90% F1 score. Averaged scores for precision, recall, and F1 are also provided. The precision score for micro, macro, and weighted average is 83% each while the samples-average score is 82%. Averaged scores for recall and F1 also correspond to the precision scores and show better results using the balanced dataset.
[Figure omitted. See PDF.]
The BiLTCN approach evaluation results with hyperparameter tuning are demonstrated in Table 9. The 50 epochs are used for model training. The training accuracy achieved by the model is 88% while the testing accuracy, precision, recall, and F1 scores are 84% each. BiLTCN obtains a ROC accuracy (micro) score of 97% for maternal health risk prediction.
[Figure omitted. See PDF.]
The ROC accuracy curve analysis of the BiLTCN approach is examined in Fig 10. It shows that the low-risk class achieves a 95% ROC accuracy, the mid-risk class achieves a 94% ROC accuracy, and the ROC accuracy for the high-risk class is 99%. These results show that the BiLTCN approach is better than other employed machine learning models for predicting maternal health risks.
[Figure omitted. See PDF.]
The results of the BiLTCN are summarized in Fig 11 regarding the average recall, precision, F1 score, and ROC. The higher the ROC accuracy score, the better the classification model. The micro-ROC accuracy score of the employed model is 97% which demonstrates the best performance score.
[Figure omitted. See PDF.]
Performance analysis using proposed DT-BiLTCN feature engineering
The performance analysis with the proposed DT-BiLTCN technique is carried out in this section. Experiments are performed using both the imbalanced and balanced datasets. Table 10 shows the results with the imbalanced dataset. Results indicate that SVM shows superb performance as compared to other models even with the imbalanced dataset using the proposed approach and obtains a 95% accuracy.
[Figure omitted. See PDF.]
The feature set is extracted by using DT-BiLTCN which is input to machine learning models for maternal risk prediction. Table 11 shows the comparative analysis of the accuracy score among the applied learning techniques. Results show that using the DT-BiLTCN-based features, the performance of the models is substantially improved. The highest accuracy is obtained by the SVM which is 98%. Similarly, its precision, recall, and F1 scores are also 98% each which shows superior performance as compared to other models.
[Figure omitted. See PDF.]
The category-wise classification report of the SVM is given in Table 12. Results show that SVM achieves a 100% precision for the low-risk class, while 98% and 97% for medium-risk and high-risk categories of maternal health risks. Recall and F1 scores also correspond to high performance for all categories of maternal health risk which indicates the superiority of the proposed DT-BiLTCN feature engineering approach.
[Figure omitted. See PDF.]
In addition to using the proposed DT-BiLTCN approach, DT and BiLTCN are also used as feature engineering approaches to carry out the performance comparison and the results are provided in Table 13. Results suggest that models perform superbly better when trained on the features extracted using the DT model. The class probabilities used by the DT are used to train the model, which improves their learning process and elevates their maternal health risk prediction performance. Results using DT features are far better than using the BiLTCN features.
[Figure omitted. See PDF.]
Despite the better performance of machine learning models with DT features, the performance of DT-BiLTCN features is still better, as shown in Fig 12. Results show that the highest accuracy of 98% is achieved when DT-BiLTCN features are used. Since the proposed approach combines features from better-performing DT and BiLTCN models, it contains an appropriate feature set that improves the training of the models and elevates their performance.
[Figure omitted. See PDF.]
Validation of proposed DT-BiLTCN
The proposed approach is validated using 10-fold cross-validation using the superior performing SVM model which obtained a 98% accuracy score. In addition, the results of all employed models are also provided in Table 14. As per the given results, the superior performance of SVM is validated where it obtains a 94% accuracy with a standard deviation of only ±0.02.
[Figure omitted. See PDF.]
Comparison of probability-based and original features
The original maternal health dataset features are small in size and are not linearly separable to a great extent. This non-linearly separable behavior results in poor performance of machine learning models. The proposed feature engineering technique is utilized to design the probability-based feature set by using DT and BiLTCN. The generated feature set is more linearly separable and distinguishes the target classes with a higher margin.
The experimental results demonstrate that by using the DTBiLTCN technique, the performance of the machine learning models is enhanced greatly. A visual representation of feature distribution is analyzed to verify the enhanced performance, as given in Fig 13. Fig 13a demonstrates that the dataset with the original feature is not linearly separable. However, by using the proposed feature engineering technique the feature space becomes more linearly separable as visualized in Fig 13b. This analysis validates the performance enhancement by using the proposed feature engineering technique.
[Figure omitted. See PDF.]
We also determined the computational cost for the applied machine learning model with the original and probability-based feature set. The analysis demonstrates that the proposed approach is significant in terms of accuracy as well as efficiency. The computational cost for the proposed technique is significantly better than other applied techniques. The highest accuracy is achieved with the lowest computational time, as analyzed in Table 15.
[Figure omitted. See PDF.]
Performance comparison with existing approaches
The comparative analysis of the existing state-of-the-art approaches in the context of maternal health risk prediction is demonstrated in Table 16. For this purpose, recent research works are compared with the proposed approach. The selected works include both machine learning and deep learning models. Performance analysis corroborates the superior performance of machine learning models when used with the proposed DT-BiLTCN feature extraction approach.
[Figure omitted. See PDF.]
Discussions
This study proposes a novel deep neural network architecture that leverages DT, Bi-LSTM, and TCN networks to make an ensemble for feature engineering. Based on the individual superior performance of these models, they are combined for feature engineering to predict maternal health risk during pregnancy. Empirically, the performance of prediction probability is found superior to that of original features. So, the feature set comprises the prediction probabilities from these models. For class imbalance, SMOTE is utilized. Experiments are performed using imbalanced and balanced data, with original features from the data and the features extracted using the proposed approach. Extensive experiments and analyses show that the proposed approach is superior regarding maternal health risk prediction. SMOTE-balanced data tend to show better results for a few models while the performance of the proposed approach is substantially elevated. Experimental results using DTC, LR, ETC, KNN, RFC, and SVM with the proposed DT-BiLTCN feature engineering approach show that accuracy, precision, recall, and F1 scores of 0.98 can be obtained when SVM is used with the proposed approach. This performance is significantly better than using the original features from the dataset. In addition, k-fold cross-validation results validate this performance. Performance comparison with existing state-of-the-art approaches further corroborates these results.
Conclusions and future work
Maternal health analysis and risk detection during pregnancy are very important to reduce the probability of health complications during pregnancy, childbirth, and postpartum periods. This study presents a novel approach for the automatic prediction of maternal health risks associated with pregnancy. A novel approach DT-BiLTCN is proposed for feature extraction which provides better features for machine learning models’ effective training. Of the employed models, SVM shows an excellent accuracy of 98% for low, medium, and high-risk categories of maternal health risks. When compared with the performance of DT and BiLTCN features, the proposed approach outperforms them regarding ROC and other metrics. Regarding, maternal health exploratory data analysis, the diastolic and systolic blood pressure, heart rate, and age of pregnant women are found to be the key health factors that can cause high-risk levels during pregnancy. We believe that the findings of this study are helpful for doctors, maternal health professionals, and the government to predict the risk factors associated with maternal mortality and can be used to alleviate such outcomes.
Supporting information
S1 Dataset.
https://doi.org/10.1371/journal.pone.0276525.s001
(ZIP)
Citation: Raza A, Siddiqui HUR, Munir K, Almutairi M, Rustam F, Ashraf I (2022) Ensemble learning-based feature engineering to analyze maternal health during pregnancy and health risk prediction. PLoS ONE 17(11): e0276525. https://doi.org/10.1371/journal.pone.0276525
About the Authors:
Ali Raza
Roles: Conceptualization, Formal analysis, Writing – original draft
Affiliation: Department of Computer Science, Khwaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan
Hafeez Ur Rehman Siddiqui
Roles: Conceptualization, Investigation, Methodology
Affiliation: Department of Computer Science, Khwaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan
https://orcid.org/0000-0003-0671-2060
Kashif Munir
Roles: Data curation, Formal analysis, Project administration
Affiliation: Department of Information Technology, Khwaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan
https://orcid.org/0000-0001-5114-4213
Mubarak Almutairi
Roles: Funding acquisition, Resources, Visualization
E-mail: [email protected] (MA); [email protected] (IA)
Affiliation: College of Computer Science and Engineering, University of Hafr Albatin, Albatin, Saudi Arabia
Furqan Rustam
Roles: Software, Visualization, Writing – original draft
Affiliation: Department of Software Engineering, School of Systems and Technology, University of Management and Technology Lahore, Lahore, Pakistan
Imran Ashraf
Roles: Supervision, Validation, Writing – review & editing
E-mail: [email protected] (MA); [email protected] (IA)
Affiliation: Information and Communication Engineering, Yeungnam University, Gyeongsan, Korea
https://orcid.org/0000-0002-8271-6496
1. Finlayson K, Crossland N, Bonet M, Downe S. What matters to women in the postnatal period: a meta-synthesis of qualitative studies. PloS one. 2020;15(4):e0231415. pmid:32320424
2. Turban JL, King D, Kobe J, Reisner SL, Keuroghlian AS. Access to gender-affirming hormones during adolescence and mental health outcomes among transgender adults. PLos one. 2022;17(1):e0261039. pmid:35020719
3. Bogren M, Denovan A, Kent F, Berg M, Linden K. Impact of the Helping Mothers Survive Bleeding After Birth learning programme on care provider skills and maternal health outcomes in low-income countries—An integrative review. Women and Birth. 2021;34(5):425–434. pmid:33041235
4. Mercan Y, Tari Selcuk K. Association between postpartum depression level, social support level and breastfeeding attitude and breastfeeding self-efficacy in early postpartum women. PloS one. 2021;16(4):e0249538. pmid:33798229
5. WHO U. Maternal mortality ratio (modeled estimate, per 100,000 live births) |Data; 2019. Available from: https://data.worldbank.org/indicator/SH.STA.MMRT?end=2017&start=2000&view=chart.
6. Max Roser and Hannah Ritchie. Maternal Mortality—Our World in Data;. Available from: https://ourworldindata.org/maternal-mortality.
7. Mehboob R, Gilani SA, Khalid S, Hassan A, Alwazzan A. Maternal Mortality Ratio in Low Income Developing Countries. In: Global Women’s Health. IntechOpen; 2021.
8. Kaur S, Singla J, Nkenyereye L, Jha S, Prashar D, Joshi GP, et al. Medical Diagnostic Systems Using Artificial Intelligence (AI) Algorithms: Principles and Perspectives. IEEE Access. 2020;8:228049–228069.
9. Manifold B, Men S, Hu R, Fu D. A versatile deep learning architecture for classification and label-free prediction of hyperspectral images. Nature machine intelligence. 2021;3(4):306–315. pmid:34676358
10. Zeng M, Li M, Fei Z, Wu FX, Li Y, Pan Y, et al. A Deep Learning Framework for Identifying Essential Proteins by Integrating Multiple Types of Biological Information. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2021;18(1):296–305. pmid:30736002
11. Berrar D, Dubitzky W. Deep learning in bioinformatics and biomedicine. Briefings in Bioinformatics. 2021;22(2):1513–1514. pmid:33693457
12. Theis J, Galanter WL, Boyd AD, Darabi H. Improving the In-Hospital Mortality Prediction of Diabetes ICU Patients Using a Process Mining/Deep Learning Architecture. IEEE Journal of Biomedical and Health Informatics. 2022;26(1):388–399. pmid:34181560
13. Wang D, Mo J, Zhou G, Xu L, Liu Y. An efficient mixture of deep and machine learning models for COVID-19 diagnosis in chest X-ray images. PloS one. 2020;15(11):e0242535. pmid:33201919
14. Neves BJ, Braga RC, Alves VM, Lima MN, Cassiano GC, Muratov EN, et al. Deep Learning-driven research for drug discovery: Tackling Malaria. PLoS computational biology. 2020;16(2):e1007025. pmid:32069285
15. Randhawa GS, Soltysiak MP, El Roz H, de Souza CP, Hill KA, Kari L. Machine learning using intrinsic genomic signatures for rapid classification of novel pathogens: COVID-19 case study. Plos one. 2020;15(4):e0232391. pmid:32330208
16. Krisnanik E, Tambunan K, Irmanda HN. Analysis of Pregnancy Risk Factors for Pregnant Women Using Analysis Data Based on Expert System. Proceedings—1st International Conference on Informatics, Multimedia, Cyber and Information System, ICIMCIS 2019. 2019; p. 151–156.
17. Ahmed M, Kashem MA. IoT Based Risk Level Prediction Model for Maternal Health Care in the Context of Bangladesh. 2020 2nd International Conference on Sustainable Technologies for Industry 40, STI 2020. 2020;0:19–20.
18. Pirjatullah, Kartini D, Nugrahadi DT, Muliadi, Farmadi A. Hyperparameter Tuning using GridsearchCV on The Comparison of The Activation Function of The ELM Method to The Classification of Pneumonia in Toddlers. In: 2021 4th International Conference of Computer and Informatics Engineering (IC2IE); 2021. p. 390–395.
19. Lakshmi BN, Indumathi TS, Ravi N. A novel health monitoring approach for pregnant women. 2015 International Conference on Emerging Research in Electronics, Computer Science and Technology, ICERECT 2015. 2016;48:324–328.
20. N LB, S DIT, Ravi DN. A Comparative Study of Classification Algorithms for Risk Prediction in Pregnancy. TENCON 2015—2015 IEEE Region 10 Conference. 2015;.
21. Indumathi N, Ramalakshmi R, Ajith V. Analysis of risk factors in the Firework Industries: Using Decision Tree Classifier. In: 2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE); 2021. p. 811–814.
22. Jurzwk H, Wkh RI, Dv F, Dv Z. Building Prediction Models for Maternal Mortality; p. 3–8.
23. Ramla M, Sangeetha S, Nickolas S. Fetal Health State Monitoring Using Decision Tree Classifier from Cardiotocography Measurements. Proceedings of the 2nd International Conference on Intelligent Computing and Control Systems, ICICCS 2018. 2019;(Iciccs):1799–1803.
24. Bautista JM, Quiwa QAI, Reyes RSJ. Machine learning analysis for remote prenatal care. IEEE Region 10 Annual International Conference, Proceedings/TENCON. 2020;2020-Novem:397–402.
25. Masood A, Yang P, Sheng B, Li H, Li P, Qin J, et al. Cloud-Based Automated Clinical Decision Support System for Detection and Diagnosis of Lung Cancer in Chest CT. IEEE Journal of Translational Engineering in Health and Medicine. 2020;8:1–13. pmid:31929952
26. Marques JAL, Han T, Wu W, Do Vale Madeiro JP, Neto AVL, Gravina R, et al. IoT-Based Smart Health System for Ambulatory Maternal and Fetal Monitoring. IEEE Internet of Things Journal. 2021;8(23):16814–16824.
27. Alzubi JA, Kumar A, Alzubi O, Manikandan R, et al. Efficient Approaches for Prediction of Brain Tumor using Machine Learning Techniques. Indian Journal of Public Health Research & Development. 2019;10(2).
28. Alzubi JA, Jain R, Alzubi O, Thareja A, Upadhyay Y. Distracted driver detection using compressed energy efficient convolutional neural network. Journal of Intelligent & Fuzzy Systems. 2022;42(2):1253–1265.
29. Zhu Z, Wang Z, Li D, Zhu Y, Du W. Geometric Structural Ensemble Learning for Imbalanced Problems. IEEE Transactions on Cybernetics. 2020;50(4):1617–1629. pmid:30418931
30. Bin Alam MS, Patwary MJA, Hassan M. Birth Mode Prediction Using Bagging Ensemble Classifier: A Case Study of Bangladesh. 2021 International Conference on Information and Communication Technology for Sustainable Development, ICICT4SD 2021—Proceedings. 2021; p. 95–99.
31. Pereira SSL, Valter Costa Filho R, Ramos R, Oliveira M, Moreira MWL, Rodrigues JJPC, et al. Improving Maternal Risk Analysis in Public Health Systems. 2020 5th International Conference on Smart and Sustainable Technologies, SpliTech 2020. 2020.
32. UCI Machine Learning Repository: Maternal Health Risk Data Set Data Set — archive.ics.uci.edu;. https://archive.ics.uci.edu/ml/datasets/Maternal+Health+Risk+Data+Set.
33. Afreen H, Bajwa IS. An IoT-Based Real-Time Intelligent Monitoring and Notification System of Cold Storage. IEEE Access. 2021;9:38236–38253.
34. CSAFRIT. Maternal Health Risk Data | Kaggle; 2022. Available from: https://www.kaggle.com/datasets/csafrit2/maternal-health-risk-data.
35. Yi H, Jiang Q, Yan X, Wang B. Imbalanced Classification Based on Minority Clustering Synthetic Minority Oversampling Technique With Wind Turbine Fault Detection Application. IEEE Transactions on Industrial Informatics. 2021;17(9):5867–5875.
36. Zeng X, Huang J, Ding C. Soft-Ranking Label Encoding for Robust Facial Age Estimation. IEEE Access. 2020;8:134209–134218.
37. Bhaskar N, Suchetha M, Philip NY. Time Series Classification-Based Correlational Neural Network With Bidirectional LSTM for Automated Detection of Kidney Disease. IEEE Sensors Journal. 2021;21(4):4811–4818.
38. Jalal N, Mehmood A, Choi GS, Ashraf I. A novel improved random forest for text classification using feature ranking and optimal number of trees. Journal of King Saud University-Computer and Information Sciences. 2022;.
39. Shafi I, Din S, Hussain Z, Ashraf I, Choi GS. Adaptable reduced-complexity approach based on state vector machine for identification of criminal activists on social media. IEEE Access. 2021;9:95456–95468.
40. Paiva E, Paim A, Ebecken N. Convolutional Neural Networks and Long Short-Term Memory Networks for Textual Classification of Information Access Requests. IEEE Latin America Transactions. 2021;19(5):826–833.
41. Oguntala GA, Hu YF, Alabdullah AAS, Abd-Alhameed RA, Ali M, Luong DK. Passive RFID Module With LSTM Recurrent Neural Network Activity Classification Algorithm for Ambient-Assisted Living. IEEE Internet of Things Journal. 2021;8(13):10953–10962.
42. Ahmadzadeh E, Kim H, Jeong O, Kim N, Moon I. A Deep Bidirectional LSTM-GRU Network Model for Automated Ciphertext Classification. IEEE Access. 2022;10:3228–3237.
43. Tang P, Du P, Xia J, Zhang P, Zhang W. Channel Attention-Based Temporal Convolutional Network for Satellite Image Time Series Classification. IEEE Geoscience and Remote Sensing Letters. 2022;19:1–5.
44. MartÃn-Morató I, Cobos M, Ferri FJ. Adaptive Distance-Based Pooling in Convolutional Neural Networks for Audio Event Classification. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2020;28:1925–1935.
45. Zhao F, Tian M, Xie W, Liu H. A New Parallel Dual-Channel Fully Convolutional Network Via Semi-Supervised FCM for PolSAR Image Classification. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. 2020;13:4493–4505.
46. Zhang F, Dvornek N, Yang J, Chapiro J, Duncan J. Layer Embedding Analysis in Convolutional Neural Networks for Improved Probability Calibration and Classification. IEEE Transactions on Medical Imaging. 2020;39(11):3331–3342. pmid:32356739
47. Zhang Z, Leng J, Ma L, Miao Y, Li C, Guo M. Architectural Implications of Graph Neural Networks. IEEE Computer Architecture Letters. 2020;19(1):59–62.
48. Cho H, Kim Y, Lee E, Choi D, Lee Y, Rhee W. Basic Enhancement Strategies When Using Bayesian Optimization for Hyperparameter Tuning of Deep Neural Networks. IEEE Access. 2020;8:52588–52608.
49. Sheng TJ, Islam MS, Misran N, Baharuddin MH, Arshad H, Islam MR, et al. An Internet of Things Based Smart Waste Management System Using LoRa and Tensorflow Deep Learning Model. IEEE Access. 2020;8:148793–148811.
50. Park Y, Ho JC. Tackling Overfitting in Boosting for Noisy Healthcare Data. IEEE Transactions on Knowledge and Data Engineering. 2021;33(7):2995–3006.
51. Ahmed H, Traore I, Saad S. Detection of online fake news using n-gram analysis and machine learning techniques. In: International conference on intelligent, secure, and dependable systems in distributed and cloud environments. Springer; 2017. p. 127–138.
52. Ahmed M, Kashem MA, Rahman M, Khatun S. Review and Analysis of Risk Factor of Maternal Health in Remote Area Using the Internet of Things (IoT). In: InECCE2019. Springer; 2020. p. 357–365.
53. Subramani P, BD P, et al. Prediction of muscular paralysis disease based on hybrid feature extraction with machine learning technique for COVID-19 and post-COVID-19 patients. Personal and ubiquitous computing. 2021; p. 1–14. pmid:33679282
54. Marmolejos L, AlOmar EA, Mkaouer MW, Newman C, Ouni A. On the use of textual feature extraction techniques to support the automated detection of refactoring documentation. Innovations in Systems and Software Engineering. 2022;18(2):233–249.
55. Rustam F, Khalid M, Aslam W, Rupapara V, Mehmood A, Choi GS. A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis. Plos one. 2021;16(2):e0245909. pmid:33630869
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 Raza et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Maternal health is an important aspect of women’s health during pregnancy, childbirth, and the postpartum period. Specifically, during pregnancy, different health factors like age, blood disorders, heart rate, etc. can lead to pregnancy complications. Detecting such health factors can alleviate the risk of pregnancy-related complications. This study aims to develop an artificial neural network-based system for predicting maternal health risks using health data records. A novel deep neural network architecture, DT-BiLTCN is proposed that uses decision trees, a bidirectional long short-term memory network, and a temporal convolutional network. Experiments involve using a dataset of 1218 samples collected from maternal health care, hospitals, and community clinics using the IoT-based risk monitoring system. Class imbalance is resolved using the synthetic minority oversampling technique. DT-BiLTCN provides a feature set to obtain high accuracy results which in this case are provided by the support vector machine with a 98% accuracy. Maternal health exploratory data analysis reveals that the health conditions which are the strongest indications of health risk during pregnancy are diastolic and systolic blood pressure, heart rate, and age of pregnant women. Using the proposed model, timely prediction of health risks associated with pregnant women can be made thus mitigating the risk of health complications which helps to save lives.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer