Introduction
Birthweight is a key index of maternal–fetal wellness and reflects the health profile of the newborn. Several factors influence fetal growth, including maternal dietary intake, circulatory system function, uteroplacental function, thyroid function, genetic conditions, and various risk factors such as diabetes, hypertension, and medical history1. Sustainable Development Goal 3.2 (SDG) aims to eliminate preventable neonatal deaths and requires all countries to reduce their neonatal mortality rate by 20302. Globally, about one in seven neonates is born as a low-birthweight baby (LBW)3,4.
The World Health Organization (WHO) classifies a birthweight of less than 2500 g as LBW, presenting a significant global challenge with both short and long-term complications4,5. LBW is further categorized into very low birthweight (VLBW), weighing less than 1500 g, and extremely low birthweight, which is less than 1000 g6. An estimated 19.8 million live newborns are classified as LBW, with majority of this population residing in Southern Asia and Sub-Saharan Africa7. Data from the National Family Health Survey indicates that the prevalence of LBW in India decreased from 22% in 2005–2006 to 17.5% in 2015–20168,9. Figure 1 illustrates the etiology, complications, prevention, treatment, and management of LBW.
[See PDF for image]
Fig. 1
Illustration of etiology, complications, prevention, treatment, and management of LBW.
LBW can result from various factors, including preterm birth, being small for gestational age (SGA), and intrauterine growth restriction (IUGR), or a combination of these issues. The causes of LBW are multifactorial and may include maternal age, low maternal weight and height, anemia, birth order, smoking history, socioeconomic status, short birth intervals, multiple pregnancies, poor nutrition, and inadequate preconception care, especially in developing and low-income countries. LBW newborns are at higher risk of short-term complications like respiratory distress, hypothermia, sepsis, feeding difficulties, intraventricular haemorrhage and necrotising enterocolitis. Even if VLBW neonates survive, they face an increased risk of long-term complications, such as neurodevelopmental delays, cognitive deficits, and disorders affecting the respiratory, cardiovascular, and immune systems10, 11, 12, 13–14. VLBW babies have more risk of perinatal morbidity and mortality as compared to LBW neonates too. A meta-analysis has shown that neonates weighing less than 2.5 kg have a 45% higher risk of developing type 2 diabetes compared to those weighing more than 2.5 kg15. Currently, fetal biometry measurements acquired during ultrasound aid in the estimation of fetal weight. Identifying fetuses at risk of LBW helps in treatment planning and early intervention, both of which can improve postnatal outcomes16.
Researchers are exploring multidisciplinary approaches that integrate artificial intelligence (AI) tools with medical data. These predictive models aid in improving patient care and diagnostic accuracy17. Table 1 briefly describes the previous studies on birthweight prediction. Supervised and unsupervised machine learning (ML) algorithms are being deployed to develop these models. These algorithms learn from a training dataset, identify relevant features, and make predictions based on the patterns in the new data18. A key consideration when implementing ML algorithms in clinical data is evaluating their effectiveness compared with the insights and judgments provided by medical professionals19.
Table 1. Description of previous studies on LBW prediction.
Reference | Objective | Algorithms used | Outcome |
---|---|---|---|
Reza, T.B. et al.20 | LBW feature selection and prediction using ML | Boruta algorithm and Wrapper method Several ML classifiers | The wrapper method is the most effective way to select features. RF classification worked best for classification |
Ranjbar A et al.21 | LBW prediction with ML | 8 different learning models | Classification with XGBoost outperformed all others |
de Morais FL.et al.22 | LBW prediction with tree-based ML model | 5 ML classifiers | Attribute selection and eliminating duplicate data improves the model |
Naimi, A.I et al.23 | Fetal growth prediction with ML | Regression-based and data mining techniques | Smoking while pregnant raised the chances of SGA |
Tao, J., et al.24 | Fetal birthweight prediction by a temporal ML method | Convolutional Neuron Networks (CNN), RF, Linear-Regression, Support Vector Regression (SVR), Back Propagation Neural Network (BPNN), and hybrid-LSTM | Hybrid-LSTM has the highest accuracy of 93.3 |
Rubaiya et al.25 | Unravelling birthweight determinants: Integrating ML, spatial analysis, and district-level mapping | Regression tree | Creates maps at the district level for regions that are at high risk of LBW |
To address the limitations of traditional AI models, explainable artificial intelligence (XAI) is gaining traction in predictive modelling. XAI aims to enhance model transparency, effectiveness and makes them accessible to nonexperts in various settings. The popularity of XAI has been increasing in response to the AI “black box” dilemma, which seeks to improve transparency. XAI methods facilitate communication between humans and AI systems. Furthermore, AI-based healthcare systems hold great promise for meeting the increasing demand for high-quality medical services26. Figure 2 illustrates our interdisciplinary framework for predicting birthweight.
[See PDF for image]
Fig. 2
Shows the interdisciplinary approach used to predict birthweight.
In our study, a total of 19 clinically important features, as detailed in Table 2, were systematically selected and collected prospectively during early and mid-pregnancy from 237 pregnant women to develop a model for predicting birthweight. A stacked ensemble model, along with XAI techniques, was developed to classify birthweight into three categories: normal birthweight (NBW), low birthweight (LBW), and very low birthweight (VLBW). This three-class classification was chosen because the risk of neonatal complications increases as birthweight decreases27. Therefore, a multiclass classification approach was considered clinically appropriate for the analysis.
Table 2. Description of maternal and fetal features with target variable chosen in the study.
S. no. | Antenatal markers | Marker description | Clinical importance |
---|---|---|---|
Maternal anthropometric parameters | |||
1 | Age at conception | Continuous variable expressed in number of years | Research shows that a mother’s age can significantly impact fetal development due to pregnancy related issues. This is primarily because age related changes in DNA expression can occur during critical phases of embryonic and fetal development. Mitochondria, the energy-producing organelles, are inherited solely from the mother. Unlike nuclear DNA, mitochondrial DNA lacks robust repair mechanisms, making it more susceptible to mutations as mothers age28,29 |
2 | Height | Continuous variable expressed in cm | Numerical data expressed in centimetres. Numerous studies suggest a strong relationship between maternal height and birthweight30, 31–32 |
3 | Maternal weight in the first trimester | Continuous variable expressed in kg | Numerical data expressed in kilograms. Studies suggest direct associations between maternal weight, BMI, gestational weight gain, with birthweight and infant adiposity33,34 |
4 | Body mass index (BMI) | Continuous variable expressed in kg/m2 | Maternal BMI is a strong indicator of pregnancy outcomes and serves as a marker for a healthy weight34 |
5 | Parity | Variables are categorised as primigravida, multigravida and encoded | Maternal parity may influence birthweight, with studies showing that newborns of primigravida mothers typically have LBW than those of multigravida mothers35,36 |
6 | Type of conception | Variables are categorised as natural conception, invitro fertilisation pregnancy, ovulation induction conception, and encoded | The type of conception influences birthweight. Studies also suggest differences in developmental outcomes with assisted reproductive technology37,38 |
First-trimester markers-maternal parameters | |||
7 | Haemoglobin (Hb) | Continuous variable expressed in g/dl | Early pregnancy hemoglobin levels were measured during the first trimester, and a meta-analysis indicates a strong correlation between these levels and pregnancy outcomes39 |
8 | Glycated hemoglobin (HbA1c) | Continuous variable expressed in % | HbA1c acts as a representative of glycemic control. Poor glycemic control may lead to ketoacidosis, infection, macrosomia, dystocia, spontaneous miscarriage, and congenital anomalies40,41 |
9 | Thyroid-stimulating hormone (TSH) | Continuous variable expressed in μIU/mL | Numerous studies have explored the effect of TSH on pregnancy, though the findings to date have shown variability and continue to be the subject of ongoing discussion42,43 |
First trimester anomaly scan-fetal parameters | |||
10 | Nuchal translucency(NT) | Continuous variable expressed in mm | It is the normal accumulation of fluid present behind the neck of the fetus, usually measured between 11–13 weeks 6 days of pregnancy during FTAS44 |
11 | Crown-rump length (CRL) | Continuous variable expressed in mm | It is the length from fetal head to the rump for estimating the fetal age during FTAS44 |
12 | Pregnancy-Associated Plasma Protein A (PAPP-A) | Continuous variable expressed in MoM | It is a growth factor in normal fetal development and predicts the development of preeclampsia45,46. Fetal β-hCG and PAPP-A assessment is done in the first trimester to screen for chromosomal abnormalities such as trisomies 21 (Down’s syndrome), 18 (Edwards’ syndrome)and 13 (Patau’s syndrome)44 |
Mid trimester anomaly scan—fetal parameters | |||
13 | Bi-parietal-diameter (BPD) | Continuous variable expressed in mm | Transverse view of the fetal head at the level of the thalami47 |
14 | Head circumference (HC) | Continuous variable expressed in mm | |
15 | Abdominal circumference (AC) | Continuous variable expressed in mm | Transverse section of the fetal abdomen47 |
16 | Femur length (FL) | Continuous variable expressed in mm | The longest axis of the ossified diaphysis of the femur bone is measured47 |
17 | Estimated fetal weight (EFW) | Continuous variable expressed in grams | EFW helps in monitoring the fetal growth and development. It is calculated by the composite measurement of BPD, HC, AC, and FL47 |
Risk factors | |||
18 | Gestational diabetes mellitus (GDM) | Variables are categorised and encoded as normoglycemic and hyperglycemic | Diabetes mellitus is significantly linked to increased birthweight and a higher likelihood of delivering infants classified as large for gestational age (LGA) or with macrosomia48,49 |
19 | Hypertension (HTN) | Variables are categorised and encoded as normotensive and hypertensive | Maternal HTN has been established as a significant risk factor associated with LBW50 |
Target variable | |||
20 | Birthweight | Expressed in grams | Classified as follows51 NBW : More than 2500 g LBW : Less than 2500 g VLBW : Less than 1500 g |
The additional contributions of this study are as follows:
Nineteen clinical features from early and mid-pregnancy were considered.
Data analysis was performed using Jamovi (version: 2.6.26) and JASP team (2024) JASP (version:0.18.3) to investigate variations and patterns.
Multiple ML models, such as Random Forest (RF), Logistic Regression (LR), Decision Tree (DT), K-Nearest Neighbors (KNN), CatBoost, LightGBM, AdaBoost, and Extreme Gradient Boosting (XGBoost) were employed to predict birthweight. These algorithms were integrated using a novel stacking architecture.
XAI techniques, namely SHAP (Shapley Additive Explanations), LIME (Local Interpretable Model-agnostic Explanations), and Anchor, were used to enhance model interpretability.
Materials and methods
Dataset description
This observational prospective study was conducted between August 2022 and August 2024 at Dr. TMA Pai Hospital in Udupi and Kasturba Hospital in Manipal, under the Department of Obstetrics and Gynecology. Ethical clearance was obtained from the Institutional Ethics Committee of Kasturba Medical College and Kasturba Hospital (clearance number: IEC1:122/2022). The clinical trial was registered with the Clinical Trials Registry-India (CTRI) under the registration number CTRI/2022/08/044770. All methods were performed in accordance with relevant guidelines and regulations. Ultrasound examinations followed the protocols set by the International Society of Ultrasound in Obstetrics and Gynecology (ISUOG)44,47.
Only singleton pregnant women visiting for their first-trimester anomaly scan (FTAS) were recruited for the study. The sample size was determined using a single proportional formula: n = 4PQ/D2, based on a literature review. We identified the prevalence value as 0.667 and the margin of error (d) as 5%, with a 95% confidence interval. Consequently, the required sample size was calculated to be 356. Only 237 out of the 356 pregnant women who delivered at our institute hospitals were included for the analysis. Informed consent was obtained from all participants during recruitment. The dataset includes 19 clinical markers of these 237 pregnant women, along with the birthweights of their neonates. Maternal anthropometric parameters, such as age, height, weight, and BMI at the time of the FTAS, were collected. We also gathered information on parity, Hb level, HbA1C, thyroid-stimulating hormone (TSH), and pregnancy-associated plasma protein A (PAPP-A) from dual marker tests performed as a part of standard of care.
Ultrasound parameters, such as the crown-rump length (CRL) of the fetus, was measured to determine fetal age, and nuchal translucency (NT) thickness was assessed during the FTAS. Fetal biometry, which includes biparietal diameter (BPD), head circumference (HC), abdominal circumference (AC), femur length (FL), and estimated fetal weight (EFW), was evaluated during a routine mid-trimester anomaly scan (MTAS) performed between 18 and 24 weeks of pregnancy. Common risk factors, such as gestational diabetes mellitus (GDM) and hypertension (HTN), significantly influence pregnancy outcomes; therefore, this was documented52.
Our study aims to identify both modifiable and non-modifiable risk factors during early and mid-pregnancy in order to help prevent LBW. We systematically selected 19 clinically relevant features for the birthweight prediction model, which are detailed in Table 2. An experienced obstetrician reviewed all selected variables to ensure their medical significance. This review confirmed that only features essential to patient care were included. Importantly, all selected parameters are part of routine clinical practice and do not require any additional testing for this study. Figure 3 represents the flow of the study.
[See PDF for image]
Fig. 3
The flowchart illustrates the participant selection process for the study and its outcomes.
Statistical analysis
Statistical analysis was conducted using Jamovi (version: 2.6.26) and JASP team (2024) JASP (version:0.18.3). The data were assessed for normality using the Shapiro–Wilk test. Normally distributed data are presented as mean ± SD, if not as medians with quartiles [M (P25, P75)]. Inferential statistical analyses were performed using Welch’s ANOVA test given the unequal sample sizes across the three groups (VLBW, LBW and NBW)53. Multinomial logistic regression (MLR) was done to find the association between birthweight and antenatal parameters, and Pearson’s correlation was used to identify the relationship between continuous variables. The detailed results are given in the following figures and tables. P < 0.05 was considered as statistically significant.
Table 3 provides the descriptive statistics for continuous variables and comparison of birthweight with antenatal markers via Welch’s one-way ANOVA. Statistically significant p values are represented by an asterisk.
Table 3. Descriptive statistics of maternal and fetal attributes and comparison of birthweight with antenatal markers via Welch’s one-way ANOVA.
Maternal features | Welch’s ANOVA test | ||
---|---|---|---|
Attributes | Mean ± SD | F | p-value |
Maternal height (cm) | 155 ± 6.48 | 10.71 | 0.003* |
Maternal BMI (kg/m2) | 23.1 ± 3.97 | 1.54 | 0.258 |
Median IQR | |||
Age (years) | 29 [27, 32] | 1.07 | 0.379 |
Weight (kg) | 55 [47, 62] | 10.72 | 0.003* |
Hb (g/dl) | 11.4 [12.3, 12.9] | 0.60 | 0.565 |
HbA1C (%) | 5.2 [5, 5.4] | 0.83 | 0.458 |
TSH (μIU/mL ) | 1.73 [1.22, 2.75] | 2.19 | 0.146 |
PAPP-A (MoM) | 1.1 [0.81, 1.64] | 4.61 | 0.036* |
Ultrasound fetal features | |||
FTAS | Median IQR | ||
CRL (mm) | 61[55.6, 65.1] | 1.35 | 0.302 |
NT (mm) | 1.3[1.1, 1.5] | 4.01 | 0.036* |
MTAS | Median IQR | ||
BPD (mm) | 46 [48, 48] | 0.09 | 0.916 |
HC (mm) | 172 [166,178] | 0.10 | 0.902 |
FL (mm) | 31 [30, 33] | 0.57 | 0.587 |
EFW (g) | 314 [288, 353] | 0.02 | 0.980 |
Mean ± SD | |||
Fetal AC (mm) | 147 ± 11.5 | 0.19 | 0.832 |
Histograms and pie charts are used to explore different patterns in the dataset, as shown in Fig. 4. The pie diagram represents the categorical data, such as GDM, HTN and birthweight.
[See PDF for image]
Fig. 4
Representation of the characteristics of important attributes. The distributions of continuous variables are represented in the histogram, and categorical variables such as hypertension, gestational diabetes mellitus, and birthweight are represented in the pie diagram.
:
MLR analysis was done to predict the outcome variable and to find the association between birthweight and antenatal parameters. From the results, it is observed that the models have moderate fit (R2McF = 0.324, p = 0 0.027). These findings infer the model’s effectiveness between predictors and outcome variables. The estimates, odds ratios, p-values, and 95% confidence intervals for key predictor factors associated with antenatal features and birthweight outcomes are presented in the Table 4. The results showed that the birthweight endpoints as VLBW and LBW is associated with height, BMI, NT and FL. When NBW and VLBW were considered as references, only CRL had an association of OR 1.396, with p values of 0.036*. An odds ratio of 9.030 with a p-value (0.017) indicates a strong association between FL and birthweight. There is slightly high clinical association identified between maternal parity and reference variable OR (7.59, p = 0.153) but it is not statistically significant. The OR values of maternal weight (2.330), TSH (2.013) and Hb (1.847) had an influence on the reference variable. The OR values of CRL (1.365), PAPP-A (2.071), BPD (1.213), HC (1.017), and AC (1.104) also influenced the prediction of VLBW. Lower maternal BMI (0.108, p = 0.015) can contribute to lower birthweights. Therefore, the results suggest that maternal anthropometry, parity, first trimester parameters such as CRL and PAPP-A, and second trimester parameters such as BPD, HC, AC, FL and HTN as risk factors can affect the weight of the baby. Though the association is moderate in prediction, identifying any deviations in the above parameters may help to recognise the babies at risk of LBW. The required measures can then be taken in early and mid-pregnancy to minimise the development of VLBW and LBW as pregnancy advances.
Table 4. Multinomial logistic regression.
Reference | Predictor | Estimate | Odds ratio | 95% confidence interval | p | |
---|---|---|---|---|---|---|
Lower | Upper | |||||
LBW-VLBW | Age | − 0.096 | 0.909 | 0.600 | 1.377 | 0.652 |
Height | − 0.565 | 0.569 | 0.325 | 0.994 | 0.048* | |
Weight | 0.846 | 2.330 | 0.946 | 5.742 | 0.066 | |
BMI | − 2.226 | 0.108 | 0.018 | 0.644 | 0.015* | |
Parity: Multigravida-Primigravida | 2.027 | 7.591 | 0.472 | 122.045 | 0.153 | |
Hb | 0.614 | 1.847 | 0.459 | 7.439 | 0.388 | |
HbA1C | − 1.633 | 0.195 | 0.014 | 2.730 | 0.225 | |
TSH | 0.699 | 2.013 | 0.641 | 6.319 | 0.231 | |
CRL | 0.311 | 1.365 | 0.995 | 1.872 | 0.054 | |
NT | − 2.831 | 0.059 | 0.006 | 0.584 | 0.016* | |
PAPP-A | 0.728 | 2.071 | 0.127 | 33.799 | 0.609 | |
BPD | 0.193 | 1.213 | 0.434 | 3.390 | 0.713 | |
HC | 0.017 | 1.017 | 0.686 | 1.509 | 0.933 | |
AC | 0.099 | 1.104 | 0.695 | 1.756 | 0.675 | |
FL | 2.201 | 9.030 | 1.490 | 54.721 | 0.017* | |
EFW | − 0.121 | 0.886 | 0.758 | 1.036 | 0.129 | |
GDM: No–Yes | − 0.398 | 0.672 | 0.032 | 14.309 | 0.799 | |
HTN: No–Yes | − 1.330 | 0.265 | 0.030 | 2.313 | 0.229 | |
NBW-VLBW | Age | 0.004 | 1.005 | 0.681 | 1.482 | 0.982 |
Height | − 0.185 | 0.831 | 0.480 | 1.438 | 0.508 | |
Weight | 0.600 | 1.823 | 0.744 | 4.468 | 0.189 | |
BMI | − 1.546 | 0.213 | 0.036 | 1.257 | 0.088 | |
Parity: Multigravida-Primigravida | 1.324 | 3.759 | 0.262 | 54.004 | 0.330 | |
Hb | 0.526 | 1.692 | 0.441 | 6.488 | 0.443 | |
HbA1C | − 1.229 | 0.293 | 0.022 | 3.963 | 0.355 | |
TSH | 0.498 | 1.645 | 0.538 | 5.033 | 0.383 | |
CRL | 0.333 | 1.396 | 1.023 | 1.904 | 0.036* | |
NT | − 2.189 | 0.112 | 0.012 | 1.030 | 0.053 | |
PAPP-A | 1.018 | 2.769 | 0.180 | 42.494 | 0.465 | |
BPD | 0.012 | 1.012 | 0.375 | 2.733 | 0.981 | |
HC | 0.115 | 1.122 | 0.764 | 1.647 | 0.559 | |
AC | 0.092 | 1.097 | 0.697 | 1.725 | 0.690 | |
FL | 1.631 | 5.109 | 0.878 | 29.725 | 0.069 | |
EFW | − 0.110 | 0.896 | 0.769 | 1.043 | 0.156 | |
GDM: No–Yes | − 0.707 | 0.493 | 0.026 | 9.237 | 0.636 | |
HTN: No–Yes | 1.733 | 5.660 | 0.624 | 51.308 | 0.123 |
Figure 5 represents Pearson’s correlation matrix, which explains the correlation between birthweight and other variables of interest in the study. Maternal height has a positive correlation of 0.33, and first-trimester parameters such as CRL, NT, and PAPP-A had an influence on birthweight. HTN had a correlation of 0.12.
[See PDF for image]
Fig. 5
Pearson’s matrix represents the correlation between birthweight and different attributes. Maternal height, CRL, NT, PAPP-A, and HTN are correlated with birthweight (BW).
Data preprocessing
Preprocessing the dataset is a crucial step in ML. Missing values were replaced with the respective median values to maintain consistency. Continuous features were scaled using standardization to ensure they have a mean of zero and a standard deviation of one. Categorical attributes were encoded into numerical values, as most classifiers cannot handle text inputs. Additionally, data balancing techniques were applied to address any class imbalances and improve model performance54,55.
Accuracy tends to decrease when there is a substantial disparity among data points. Regardless of the units used, classifiers often give more importance to features with larger values. In ML, two common methods for feature scaling are normalization and standardization. Since standardization is more robust to outliers, it was chosen for this study. After preprocessing, the dataset was split into training and testing sets using an 80:20 ratio. Notably, the dataset exhibited a significant class imbalance, with considerably fewer VLBW and LBW cases compared to NBW.
The models tend to favour majority classes, which can lead to biased conclusions when working with imbalanced data. To address this issue, we employed an oversampling method called the Synthetic Minority Over-sampling Technique (SMOTE)54. This technique utilizes the K-nearest neighbors algorithm to generate new synthetic samples, proving particularly effective for borderline instances. In this study, we avoided under-sampling to ensure that we did not miss any significant trends or patterns in the data. We also chose not to balance the test data to maintain data integrity.
Feature selection is crucial for reducing redundant data and minimizing noise. However, it is important to note that SMOTE can introduce noise while creating synthetic data and handling missing values, which may affect the model’s accuracy. To tackle these challenges, we implemented several ML models as well as stacking ensemble models56,57. Additionally, we used XAI techniques such as SHAP, LIME, and anchor to enhance the model’s transparency and performance.
The next step was to address the curse of dimensionality, reduce overfitting by enhancing generalization, and shorten the training period. A mutual information curve was used to identify the features that significantly contributed to the model. The highest importance was observed at the beginning of the curve58. Figure 6 illustrates a bar diagram based on this mutual representation. The analysis showed that the most significant contributors to the model were NT, HC, HbA1C, age, AC, height, and BPD. In contrast, the type of conception, TSH, PAPP-A, GDM, and HTN ranked lower in terms of priority. This may be because the pregnant women received treatment during pregnancy, and these factors are modifiable; thus, the birth outcomes improved with treatment.
[See PDF for image]
Fig. 6
Representation of mutual information. It represents the importance of different variables according to the importance in descending order. NT, HC, HbA1c, age, and AC are considered important features and occupy the initial portion from left to right.
Machine learning optimisation
ML employs algorithms that systematically analyze and model complex relationships within data. By leveraging previous data as input, ML enables algorithms to predict outcomes with high accuracy. In this study, several ML classifiers including RF, LR, DT, KNN, AdaBoost, CatBoost, LGBM, and XGBoost were utilized. These classifiers were selected for their proven effectiveness in creating robust predictive models, especially when dealing with real-world medical datasets. They are capable of handling categorical and numerical data, managing missing values, and are easy to implement, making them well-suited for medical applications59, 60, 61, 62, 63–64. Furthermore, a stacked ensemble model was utilized, combining eight different models to enhance the accuracy of each individual classifier. This stacking strategy aggregates predictions from various baseline models, leading to improved overall predictive performance. Figure 7 shows the ML methodology used in this study.
[See PDF for image]
Fig. 7
Eight different types of ML classifiers applied in the study and a stacked model is developed.
To train our predictive model, we used a combination of preprocessing and validation techniques to enhance performance and ensure robustness. This included data normalization, hyperparameter tuning via grid search, cross-validation, and class balancing using SMOTE. Normalisation of data helps reduce the influence of outliers and ensures that the model treats all features equally. As a result, the model can more effectively detect meaningful patterns in the data, which contributes to improved prediction accuracy and overall model performance65. The effectiveness of a classifier is influenced by the chosen hyperparameters and their tuning. The model performs well on a validation set if the hyperparameters are fine-tuned before the learning process begins. Hyperparameter values to identify the combination of features that creates the best model performance. To prevent overfitting and to assess the generalization ability of predictive models cross-validation data resampling method was done66,67. Grid search and tenfold cross validation was done as hyperparametric optimisation. By utilizing the grid search method, the trained model is ensured to capture various patterns within the dataset. This approach essentially creates a grid that includes every possible combination of the specified hyperparameter values. Each model’s performance is evaluated based on a scoring system, and ultimately, the model that yields the best results is selected68. Cross-validation was utilized in the early stages of model evaluation to assess performance and reduce the risk of overfitting. We focused particularly on precision and recall, as these metrics are crucial for minimizing false positives and false negatives. The ML framework involved dividing the dataset into multiple subsets, or "folds." The model was trained on all but one-fold and validated on the remaining fold. This process was repeated, allowing each fold to serve as the validation set once. The results were then averaged to provide a more robust estimate of model performance. We implemented a tenfold cross-validation approach, which is widely used in ML evaluation. In this method, the dataset is divided into ten equal parts. Each part is used as a validation set once, while the other nine parts are employed for training. This iterative strategy ensures that the model’s predictive performance is tested on unseen data, enabling an evaluation of its generalization capability. By analyzing performance metrics across the folds, we assessed whether the model was overfitting, underfitting, or generalizing appropriately69,70.
To reduce the bias in imbalanced data of VLBW, LBW and NBW, we applied SMOTE, which generates new synthetic examples for the minority class by interpolating between existing minority. This approach helps expand the decision boundary of the minority class, reducing overfitting and improving generalizability54. In this study, we selected various classification and loss metrics to evaluate the models. The metrics included precision, recall, accuracy, the F1 score, Matthew’s correlation coefficient, Jaccard score, and Hamming loss.
XAI techniques, namely SHAP, LIME, and Anchors, were utilized in the study71,72. The results of the study were explained through three XAI (SHAP, LIME, and Anchor) approaches after training and testing the ML models. An elaborate ML method called SHAP is a sophisticated ML method that explains how each feature contributes to a prediction by assigning a relevance value to it. This model-agnostic tool is highly versatile, as it can be applied to any ML model. SHAP provides both local and global explanations, offering detailed insights into individual predictions as well as the overall behavior of the model. By accurately representing variations in the model’s predictions through SHAP values, consistency and reliability are ensured. Additionally, SHAP efficiently sets the corresponding values of missing features to zero. This helps identify the most significant features in a model and clarifies how each feature influences the prediction outcome. LIME works particularly well with smaller datasets and delivers insights into individual predictions that are easier to comprehend. It is versatile across diverse forms of data because of its model-agnostic nature, which enables its use in different models. Anchors explain significant features through a set of ‘rules’ and ‘conditions’. Two parameters are used to measure each anchor (condition): precision and coverage. Precision determines the accuracy of explanations. Coverage defines the number of instances that are predicted using the same conditions, which aids in understanding the model’s prediction. This adaptability improves trust in AI-driven diagnostic procedures by helping patients and clinicians acquire confidence in the diagnostic outcomes generated by AI73. These XAI algorithms offer insights in the form of graphs and tables, making the results simple for ML users. Figure 8 represents the flow of research from ethical clearance to the development of the prediction model.
[See PDF for image]
Fig. 8
The flow chart represents the study procedure from ethical clearance to the study outcome in the development of birthweight prediction model.
Results
This study provides a detailed analysis of 237 pregnant women, utilizing early and mid-pregnancy antenatal parameters to predict birthweight. We applied conventional statistical analysis, various ML classifiers, and XAI techniques to develop reliable and optimal clinical models.
In this section, we discuss the performance of the classifiers used. Table 5 summarizes the results from various models. The AdaBoost model achieved the highest performance, with a maximum accuracy of 77%, the highest precision of 73%, a recall value of 77%, and an F1 score of 72%. The stacked model reached an accuracy of 75%. The Jaccard score indicated a high degree of overlap in the outputs of the two sets, with the AdaBoost classifier model yielding the highest score of 0.61. Additionally, the maximum Matthews correlation coefficient (MCC) obtained through the AdaBoost classifier model was 0.33, indicating a significant correlation between the predicted and actual outputs.
Table 5. Results obtained by the various models.
Classifiers | Accuracy | Precision | Recall | F1 score | Hamming loss | Jaccord score | Mathews correlation coefficient |
---|---|---|---|---|---|---|---|
1. RF | 0.71 | 0.53 | 0.71 | 0.60 | 0.29 | 0.51 | − 0.08 |
2. LR | 0.58 | 0.65 | 0.58 | 0.60 | 0.41 | 0.44 | 0.12 |
3. DT | 0.58 | 0.62 | 0.58 | 0.60 | 0.41 | 0.42 | 0.05 |
4. KNN | 0.65 | 0.61 | 0.65 | 0.62 | 0.35 | 0.49 | 0.01 |
5. AdaBoost | 0.77 | 0.73 | 0.77 | 0.72 | 0.22 | 0.61 | 0.33 |
6. CatBoost | 0.73 | 0.66 | 0.73 | 0.65 | 0.27 | 0.54 | 0.11 |
7. LGBM | 0.73 | 0.66 | 0.73 | 0.65 | 0.27 | 0.54 | 0.11 |
8. XG Boost | 0.71 | 0.62 | 0.71 | 0.63 | 0.29 | 0.52 | 0.04 |
9. Stacked model | 0.75 | 0.72 | 0.75 | 0.69 | 0.25 | 0.57 | 0.23 |
Results of the performance measures of birthweight prediction of the models. RF, random forest; LR, logistic regression; DT, decision tree; LGBM, light gradient boosting machine; XG, boost extreme gradient boosting. AdaBoost showed high accuracy and precision, recall the least hamming loss, and a high Jaccord score. Mathew’s correlation is also given with each model.
Figure 9 shows the confusion matrix generated for the stacked model applied to the dataset. The performance of the classification algorithm is summed and visualised using a confusion matrix. The confusion matrix describes the performance of the classification method. Generally, the outputs are classified as true positives, true negatives, false positives, or false negatives. It can be used to calculate a variety of model performance metrics. The model showed good overall prediction accuracy for NBW. Notably, cases of NBW were identified much more frequently than those of LBW. However, the prediction performance for VLBW instances was significantly lower, most likely due to a lack of training data for this category.
[See PDF for image]
Fig. 9
Confusion matrix of multi class obtained for the stacked model.
Figure 10 illustrates the ranked importance of the features determined using the SHAP model. The key features, in descending order of significance, include height, parity, HTN, NT, and HbA1C. In contrast, features such as conception type, BPD, FL, GDM, age, EFW, Hb were deemed less important by the model.
[See PDF for image]
Fig. 10
SHAP plot indicating different attributes arranged in descending order. Important attributes such as height, parity, HTN, NT, HbA1C occupy the initial levels.
Figure 11 shows the LIME model for the prediction of birthweight. The LIME model identified parity and height as strong predictors. Additionally, the CRL and TSH are recognised as contributing factors in determining birthweight. Sixty-one per cent of the NBW and 39% of the LBW babies were differentiated. The prediction of VLBW was negligible because the dataset contained minimal VLBW babies.
[See PDF for image]
Fig. 11
Model explainability using LIME. Parity and height influence the outcome.
The anchors for different birthweights are listed in Table 6. Similarly, parity, HC, BMI, height, CRL, and HbA1c are considered significant anchor conditions for determining birthweight.
Table 6. Model interpretability with anchor.
Instance | Patient prediction | Anchor condition | Precision | Coverage |
---|---|---|---|---|
1 | Very low weight | Parity < = 0.11 AND BMI < = 24.20 | 0.65 | 0.30 |
2 | Very low weight | Parity < = 0.00 AND BMI < = 24.20 | 0.63 | 0.27 |
3 | Low weight | Parity < = 0.11 AND Height > 153.22 | 0.73 | 0.26 |
4 | Low weight | HC < = 172.00 AND Height < = 148.56 | 0.77 | 0.16 |
5 | Low weight | Parity < = 0.00 AND CRL > 64.00 | 0.74 | 0.12 |
6 | Low weight | Parity < = 0.00 AND HC > 175.00 | 0.76 | 0.11 |
7 | Low weight | Height > 157.09 AND HbA1C < = 5.21 | 0.68 | 0.12 |
8 | Normal | Height > 153.22 AND HbA1C < = 5.10 | 0.69 | 0.14 |
9 | Normal | Parity < = 0.00 AND Height > 153.22 | 0.74 | 0.24 |
10 | Normal | Height > 157.09 AND CRL > 64.00 | 0.80 | 0.06 |
Discussion
AI algorithms show promising potential in advancing medical research because of their ability to process and analyze large-scale datasets and facilitate the development of predictive tools74. In recent years, researchers have adopted a multidisciplinary strategy to integrate computer science, engineering, and maternal–fetal medicine into obstetrical care. ML has demonstrated encouraging results in improving predictive models for various obstetric outcomes, such as GDM, preeclampsia, preterm birth, fetal ultrasound image analysis, gestational weight gain, early diagnosis of fetal alcohol spectrum disorder, fetal hypoxia detection, and birthweight75,76. Among these applications, predicting birthweight is particularly important for public health, especially in low-resource settings were estimating gestational age accurately can be difficult. LBW is a crucial indicator of neonatal health, which further emphasizes the need for reliable prediction methods77. Our study employed conventional statistical methods and multiple ML algorithms to predict birthweight, supporting more effective treatment planning beginning in early pregnancy to help avert progression to LBW newborns. To enhance model transparency, we incorporated three XAI approaches. These explainers allow doctors and other medical professionals to quickly identify variations in key markers.
Table 7 illustrates the significance of the various features assessed via conventional statistical methods and ML techniques. Maternal height, NT thickness, parity, CRL, HbA1C, HTN and PAPP-A are considered important attributes of birthweight according to our model developed via statistical tools, mutual information graphs and XAI techniques. The differences in feature importance arise primarily because statistical methods and ML algorithms operate on different principles. Statistical models rely on predefined assumptions to infer relationships between variables, while ML algorithms focus on identifying complex patterns in data to enhance predictive accuracy. Despite the variation in feature rankings across models, it is noteworthy that the features consistently identified as important possess established clinical relevance. All variables included in this study were carefully selected based on evidence-backed international guidelines and the expert consensus of experienced clinicians, ensuring both statistical rigor and clinical applicability.
Table 7. Important features for predicting birthweight with different statistical and ML algorithms.
Welch’s ANOVA | Multinominal logistic regression | Pearson’s correlation matrix | Mutual information | SHAP | LIME | Anchor |
---|---|---|---|---|---|---|
NT PAPP-A Height | CRL PAPP-A BPD HC AC FL HTN | Height CRL NT PAPP-A HTN | NT HC HbA1c Age AC | Height Parity HTN NT HbA1C | Parity Height CRL | Parity BMI Height CRL HbA1c |
According to our model, important predictors of birthweight include maternal height, NT thickness, parity, age, BMI, CRL, HbA1C, HTN, and PAPP-A.
Maternal height
Our findings indicate that maternal heights of less than or equal to 148.56 cm, combined with fetal head circumferences less than 172 mm measured during MTAS, are associated with high instances of LBW. This aligns with existing studies that suggest maternal height significantly influences birthweight. Conversely, maternal heights exceeding 153.22 cm, along with HbA1c levels of less than or equal to 5.10, are linked to a higher incidence of NBW. Research indicates that taller mothers tend to have heavier infants, with this trend varying based on ethnicity and genetic factors. Specifically, studies involving Malaysian ethnic groups show that newborn birthweight increases with maternal height, with an estimated increase of 7.08 g in birthweight for every 1 cm increase in maternal height. Data from European countries suggest that this increase can range from 15 to 17 g per centimetre of maternal height. A study conducted on 8541 mothers and their children from various ethnic backgrounds in Rotterdam, the Netherlands, examined the relationship between maternal body measurements and fetal weight throughout pregnancy, as well as birth outcomes. The findings revealed that maternal BMI starts to impact fetal growth from the middle of pregnancy onwards. Additionally, factors such as maternal height, pre-pregnancy BMI, and weight gain during pregnancy were associated with the likelihood of having babies who are either smaller or larger than expected for their gestational age. However, the exact biological mechanisms underlying these associations are not yet fully understood. Various studies around the globe also support the influence of maternal height on birthweight78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90–91.
BMI is considered one of the factors associated with birthweight according to anchor. According to the anchor, a maternal BMI < = 24.20 kg/m2 is associated with LBW, as BMI is a composite value of height and weight. The mother’s weight in the current study was measured during the first trimester. Retnakaran et al. found that weight gain before 18 weeks of gestation, particularly from pre-pregnancy to 14 weeks and from 14 to 18 weeks, was significantly associated with higher birthweight. In contrast, weight gain after 18 weeks did not show a notable impact, emphasizing that early pregnancy is a critical period for interventions aimed at optimizing birth outcomes. These findings demonstrate that maternal weight status in early pregnancy is a significant predictor of birthweight92. A study conducted in India by Mohapatra et al. supports these findings, showing a strong relationship between early pregnancy BMI and LBW. A low BMI during the early stages of pregnancy has been linked to LBW and preterm birth. Furthermore, the socioeconomic status of pregnant women may influence their weight gain by affecting access to healthy food, nutrition, antenatal care, and health supplements93,94.
NT: NT thickness has been identified as a significant factor affecting birthweight. This finding is consistent with research by Hackmon et al., who established a strong correlation between birthweight ratios and fetal measurements, including CRL, BPD, and NT. These results suggest that early fetal biometric measurements may reflect physiological variations in birthweight95. NT thickness greater than 3 mm however requires further evaluation at specialized centers due to the increased risk of chromosomal and structural anomalies96. Moreover, Kalem et al. found a link between higher maternal age and elevated NT multiples of the median (MoM) with an increased risk of neonatal intensive care unit admission97.
CRL
Menstrual history, precisely the length and regularity of the menstrual cycle, plays a role in dating pregnancy in the first trimester. The growth of the fetus is linear, with less standard deviation between 9 and 13 weeks of pregnancy, which is determined by the CRL. If a discrepancy arises between the observed value with fetal biometry through USG and the menstrual cycle, the dating is revised as the corrected estimated delivery date (C-EDD). These findings lay the foundation for tracking fetal growth as pregnancy advances. With this type of calculation of gestational age, over-estimation or underestimation is reduced. The misclassification of SGA infants, large babies, early inductions, or post-term deliveries is minimized98, 99, 100–101. The XAI technique called anchor suggested that a CRL greater than 64 mm and a maternal height greater than 157.09 cm resulted in NBW.
Parity: Parity of the mother was determined to be one of the contributing factors for birthweight in our study results. All the XAI techniques, SHAP, LIME, and anchor, consider parity to be one of the factors influencing birthweight. This finding emphasizes the importance of considering maternal parity in clinical decisions regarding birthweight. A meta-analysis has demonstrated that nulligravida mothers tend to have infants that weigh 280 g less than those of multigravida mothers. Physiological and anatomical changes that occur during the first pregnancy may impact subsequent pregnancies. Increases in uterine size, uteroplacental circulation and nutritional status are potential factors that benefit mothers with multiple pregnancies. Consequently, these elements help reduce the likelihood of LBW36,102, 103, 104, 105, 106–107. However, as age advances, the risk of anaemia, premature birth, and macrosomia increases108,109.
PAPP-A: PAPP-A is considered a contributing factor in MLR, Welch’s ANOVA, and Pearson’s correlation. However, XAI did not consider it an important variable. PAPP-A is estimated during aneuploidy screening in the first trimester. It also acts as a predictor of preeclampsia and has an influential role in birthweight. Low levels of PAPP-A are also indicators of placental dysfunction110,111. Consequently, any change in placental blood flow can lead to a decreased nutrient supply to the developing fetus, resulting in LBW.
HbA1c: HbA1c is a critical feature of mutual information, SHAP and anchor. Zhao et al., who reported that mothers with elevated blood sugar levels during pregnancy had a greater risk of macrosomic infants112.
HTN: HTN was identified as an important factor by MLR, Pearson’s correlation and SHAP. Gestational HTN and preeclampsia cause endothelial dysfunction, which subsequently decreases uteroplacental blood perfusion. This underlies the pathophysiological basis of LBW and preterm birth113,114.
Maternal age: Maternal age is identified as one of the factors measured by mutual information; however, age and birthweight are negatively correlated, with a Pearson correlation coefficient of − 0.094. Research indicates that there are increased risks of pregnancy complications, such as stillbirth, preterm birth, and intrauterine growth restriction, associated with maternal age of 19 years or younger, as well as with advanced maternal age of 35 years or older. Additionally, these risks are influenced by various factors, including socioeconomic status, educational attainment, parity, and the tendency for well-qualified women to delay pregnancy in order to pursue their careers115, 116, 117, 118, 119, 120, 121–122.
Impact of treatment on model predictions
Pregnant women were recruited for the study between 11 and 13 weeks + 6 days of gestation. The lower gestational age limit of 11 weeks was chosen to ensure the inclusion of viable pregnancies, as most pregnancy losses between 6 and 10 weeks occur due to fetal chromosomal abnormalities, including trisomies, monosomies, and polyploidy123,124. Pregnancies complicated by lethal anomalies or intrauterine fetal demise were excluded during follow-up. It is well established that conditions such as GDM and HTN are linked to adverse pregnancy outcomes. Therefore, pregnant women identified as high risk, those with a history of LBW, preterm birth, HTN, and GDM received appropriate clinical intervention. Low-dose aspirin for HTN, dietary counselling, metformin, or insulin for GDM were prescribed when indicated. In our prospective study, we found that early diagnosis and timely treatment of these conditions likely contributed to a reduction in complications such as VLBW, LBW even in high-risk pregnancies.
The stacked model developed in this study achieved an accuracy of 75%, while the Adaboost model reached an accuracy of 77%. It is important to interpret these performance metrics within the context of real-world clinical data. In practice, when pregnant women are identified with risk factors, they receive prompt treatment, which often helps mitigate adverse outcomes such as LBW or preterm delivery. Consequently, the model’s predictive power may be underestimated because timely interventions can reduce the occurrence of complications that the model aims to predict.
We suggest that the observed accuracy reflects both the model’s predictive capabilities and the effectiveness of clinical interventions. Untreated populations might exhibit higher model accuracy due to a greater frequency of adverse outcomes; however, such a situation would be unethical. In clinical settings where treatment is standard, achieving very high accuracy is inherently challenging. While this success in prevention is clinically desirable, it can also lead to an underestimation of the model’s sensitivity or accuracy. Nevertheless, our model employs standard point-of-care variables, which enhances its real-time application with early and mid-pregnancy markers. Additionally, by excluding rare or niche variables, we improve the model’s generalizability and applicability across broader populations. This may reflect clinical practice patterns rather than represent a true limitation.
From a healthcare perspective, the goal is not only prediction but also prevention. Therefore, the reported accuracy of 75% signifies not just the model’s predictive power but also the positive impact of real-time medical interventions, reinforcing the model’s practical value in supporting clinical decision-making. Table 7 indicates the features such as maternal height, NT thickness, parity, age, early pregnancy BMI, CRL, HbA1C, PAPP-A, HTN as the strong predictors of LBW. In obstetrics, the inverted pyramid theory suggests that early detection and prioritization of high-risk fetuses can significantly reduce unfavourable postnatal outcomes125. This approach emphasizes the importance of timely referrals, cost-effectiveness, and efficient resource allocation. Our proposed model aligns with this principle by assisting clinicians in the early identification of LBW pregnancies using early pregnancy markers, enabling proactive interventions to improve neonatal outcomes.
Numerous machine learning studies have aimed to predict fetal birthweight, predominantly relying on sociodemographic variables. However, there remains a significant gap in the use of clinical data from early and mid-pregnancy within these models. Table 8 presents various ML techniques that have been previously employed in this area. To our knowledge, no published studies have utilized three distinct XAI methods to predict birthweight using real clinical data from early and mid-pregnancy. This makes our approach innovative and underscores the contribution of this study to the existing body of literature.
Table 8. Summary of different models for birthweight prediction.
Reference | Dataset | Learning methods used | Accuracy obtained | Critical predictors | XAI method |
---|---|---|---|---|---|
Bekele, W.T126 | Sociodemographic | LR, DT,NB, KNN, RF, SVM (Support Vector Machine), GB, and XGB | RF Accuracy:91.60% Sensitivity 91.60% | Gender Marriage to birth interval Mother’s occupation Mother’s age | None |
Arayeshgari M et al.127 | Maternal and neonatal demographic characteristics | DT, RF, ANN, SVM and LR | LR Sensitivity 74% Specificity 89% Accuracy 88% | Gestational age Number of abortions Parity Consanguinity Maternal age at delivery Neonatal sex | None |
Islam Pollob et al.128 | Anthropometric, Socio demographic and pregnancy history | LR and DT | LR Sensitivity 99.5% Specificity 17.7% AUC 0.59 | Region Education Wealth Height Twin pregnancy | None |
Khan W et al.129 | Arab pregnant women | Multiple ML models | RF algorithm with an MAE value of 294.53 | Diabetes Gestational age HTN | None |
Hussain et al.130 | Indian pregnant women- certain features from conception to birth | Gaussian NB, RF | RF outperforms Gaussian Naïve Bayes | Physical and mental health of the mother influences the fetus | None |
Alabbad D.A et al.131 | KFUH and IEEE datasets | SVM, DT, RF, ET (extra trees), GaussianNB, XGBoost, AdaBoost, and LGBM | ET accuracy 98% (KFUH dataset) RF accuracy 96% (IEEE dataset) | Age Weight Height Gender | SHAP |
Dola S et al.132 | Anthropometric, socioeconomic, maternal, and paternal factors | LR, RF, XGBoost, conditional inference tree, and attention mechanism | XGBoost showed the highest performance | Maternal height Prepregnancy Weight Weight gain during pregnancy Parental ethnicity | Partial dependence plots (PDP), SHAP |
Liu Q et al133 | ML to predict fetal macrosomia | RF, KNN, AdaBoost, SVM, naïve Bayes, and LR | LR and ensemble model demonstrated good performance | Maternal and fetal parameters | SHAP |
Current study | Birthweight prediction using early and mid-pregnancy antenatal markers | RF, LR, DT, KNN, AdaBoost, CatBoost, LGBM, XGBoost and ensembled model | AdaBoost | Maternal height NT Parity CRL HbA1C PAPP-A | SHAP, LIME, Anchor |
Strength and limitation
A major strength of our study is the use of real-world clinical data that is routinely collected during antenatal care. This enhances the model’s applicability and relevance to everyday clinical practice. Our AI-driven model successfully identified key predictors of LBW, including maternal height, NT thickness, parity, age, CRL, HbA1c, and BMI from early and mid-pregnancy. These variables are standard components of antenatal screening and can be accessed even in low-resource settings, increasing the model’s utility and scalability. The model not only aids in birthweight predictions but also supports timely clinical decision-making and early interventions. Importantly, the integration of XAI methods enhances the model’s transparency and interpretability, fostering trust among clinicians and promoting its adoption in clinical workflows. The model provides insights into high-risk pregnancies that may result in LBW or VLBW, allowing healthcare teams to proactively plan medical management strategies. This can significantly reduce the financial, psychological, and emotional burden on families and alleviate stress among healthcare professionals. Our work lays a foundation for further research not only in prediction but also in prevention strategies for LBW. Early identification of modifiable risk factors may ultimately enhance maternal and neonatal outcomes, contributing to improved quality of life for mothers, infants, families, and society at large. We also wish to highlight that, to our knowledge, multiclass classification of birthweight outcomes remains relatively underexplored in the literature, despite its greater clinical relevance compared to binary classification. Specifically identifying VLBW cases enables more targeted treatment planning and efficient resource allocation, which can help prevent severe neonatal complications such as sepsis or respiratory distress. Most existing research in this domain relies on binary classification, typically combining LBW and VLBW infants into a single category (< 2500 g). However, given the substantial differences in clinical management between LBW and VLBW infants, we aimed to develop a model capable of distinguishing VLBW cases (< 1500 g). Although our model’s current predictive accuracy for VLBW remains limited, we view this work as a meaningful step toward addressing a critical research gap and supporting future advancements in this area.
Our study has several limitations. First, the information was gathered exclusively from individuals of Indian ethnicity, making the findings applicable only to pregnant Indian women and limiting the generalizability of the model. To enhance the classifier’s reliability, datasets from various ethnicities should be included in future research. This study employed supervised learning methods, excluding unsupervised and reinforcement learning algorithms. We acknowledge that our sample size was limited, consisting of only 237 pregnant women with a data imbalance, particularly in VLBW infants. Although we initially aimed for a sample of 356, only 237 women delivered at our institution, primarily due to the socio-cultural practice of giving birth at their maternal home. While transfer learning could potentially address issues related to sample size, we opted not to rely on it extensively in this study due to computational limitations and our focus on XAI for better interpretability. Furthermore, external validation of the model was not conducted for ethical reasons, as well as due to the nature of the data used in this study. This lack of validation limits the generalizability of the model across diverse populations and clinical settings. However, we did perform cross-validation by separating the dataset into training and test sets. Therefore, a larger sample size that includes diverse medical conditions and ethnicities would strengthen the robustness of the model.
Conclusion
The current study used multiple supervised learning algorithms and XAI techniques to predict birthweight using early and mid-pregnancy clinical markers. The developed model may help medical practitioners make more accurate birthweight predictions that are readily available using routine antenatal features. Furthermore, the integration of a cloud database incorporating sociodemographic, genetic, clinical, antenatal, and postnatal data can support advancements in fetal precision medicine.
Acknowledgements
We sincerely thank all the study participants for their consent and involvement in this research. The figures were created using BioRender. We gratefully acknowledge the financial support provided by the Manipal Academy of Higher Education through the Dr. TMA Pai Doctoral Fellowship.
Author contributions
P.M. collected the data, curated, formally analysed, and wrote the original draft. C.K. curated the data, formal analysis, software, visualization, writing and reviewing. J.V. curated the data, statistical analysis. V.A. performed the methodology, reviewed the manuscript. B.K.R. reviewed and edited the manuscript. P.S. edited and reviewed the manuscript. S.K.B. conceptualized, methodology, project administration, resources, supervision, visualization, writing and review of main manuscript.
Funding
Open access funding provided by Manipal Academy of Higher Education, Manipal
Data availability
The datasets generated and analysed during the current study are not publicly available due to the sensitive nature of the data but are available from the corresponding author on reasonable request.
Competing interests
The authors declare no competing interests.
Ethics declarations
Ethical clearance was obtained from the Kasturba Medical College and Kasturba Hospital Institutional Ethics Committee, and the ethical clearance number is IEC1:122/2022. The clinical trial registry number of the study is CTRI/2022/08/044770.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
1. Lees, CC et al. ISUOG Practice Guidelines: diagnosis and management of small-for-gestational-age fetus and fetal growth restriction. Ultrasound Obstet. Gynecol.; 2020; 56, pp. 298-312.1:STN:280:DC%2BB38jpslOnsQ%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32738107]
2. Martin. Health—United Nations Sustainable Development. United Nations Sustainable Development https://www.un.org/sustainabledevelopment/health/#tab-3f22056b0e91266e8b2 (2023).
3. Krasevec, J et al. Study protocol for UNICEF and WHO estimates of global, regional, and national low birthweight prevalence for 2000 to 2020. Gates Open Res.; 2022; 6, 80. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37265999][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10229761]
4. Blencowe, H et al. National, regional, and worldwide estimates of low birthweight in 2015, with trends from 2000: a systematic analysis. Lancet Glob. Health; 2019; 7, pp. e849-e860. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31103470][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6560046]
5. Jia, CH; Feng, ZS; Lin, XJ et al. Short term outcomes of extremely low birth weight infants from a multicenter cohort study in Guangdong of China. Sci. Rep.; 2022; 12, 11119.2022NatSR.1211119J1:CAS:528:DC%2BB38XhslegurjI [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35778441][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9249781]
6. Low birth weight. https://www.who.int/data/nutrition/nlis/info/low-birth-weight.
7. Okwaraji, YB et al. National, regional, and global estimates of low birthweight in 2020, with trends from 2000: a systematic analysis. Lancet; 2024; 403, pp. 1071-1080. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/38430921]
8. International Institute for Population Sciences (IIPS) and Macro International. 2007. National Family Health Survey (NFHS-3), 2005–06: India: Volume I. Mumbai: IIPS.
9. Singh, D et al. Prevalence and correlates of low birth weight in India: findings from National Family Health Survey 5. BMC Pregnancy Childbirth; 2023; 23, pp. 1-10.
10. Sabbaghchi, M; Jalali, R; Mohammadi, M. A systematic review and meta-analysis on the prevalence of low birth weight infants in Iran. J. Pregnancy; 2020; 2020, pp. 1-7.
11. Institute of Medicine (US) Committee on Improving Birth Outcomes. In Improving Birth Outcomes: Meeting the Challenge in the Developing World (eds. Bale, J. R., Stoll, B. J., Lucas, A. O.) (National Academies Press, 2003). https://www.ncbi.nlm.nih.gov/books/NBK222095/.
12. Derakhshi, B; Esmailnasab, N; Ghaderi, E; Hemmatpour, S. Risk factor of preterm labor in the west of Iran: a case-control study. DOAJ; 2014; 43, pp. 499-506.
13. Moradi, G., Zokaeii, M., Goodarzi, E. & Khazaei, Z. Maternal risk factors for low birth weight infants: a nested case-control study of rural areas in Kurdistan (western of Iran). J. Prev. Med. Hyg. (2021).
14. Katz, J et al. Mortality risk in preterm and small-for-gestational-age infants in low-income and middle-income countries: a pooled country analysis. Lancet; 2013; 382, pp. 417-425. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/23746775][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3796350]
15. Knop, MR et al. Birth weight and risk of Type 2 diabetes mellitus, cardiovascular disease, and hypertension in adults: a meta-analysis of 7,646,267 participants from 135 studies. J. Am. Heart Assoc.; 2018; 7, [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30486715][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6405546]e008870.
16. Wu, M; Shao, G; Zhang, F; Ruan, Z; Xu, P; Ding, H. Estimation of fetal weight by ultrasonic examination. Int. J. Clin. Exp. Med.; 2015; 8, pp. 540-545. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25785028][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4358483]
17. Dwivedi, YK et al. Artificial intelligence (AI): multidisciplinary perspectives on emerging challenges, opportunities, and agenda for research, practice and policy. Int. J. Inf. Manag.; 2021; 57, 101994.
18. Alanazi, A. Using machine learning for healthcare challenges and opportunities. Inform. Med. Unlocked; 2022; 30, 100924.
19. Reddy, S. Explainability and artificial intelligence in medicine. Lancet Digit. Health; 2022; 4, pp. e214-e215.1:CAS:528:DC%2BB3sXis1SksL4%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35337639]
20. Reza, TB; Salma, N. Prediction and feature selection of low birth weight using machine learning algorithms. J. Health Popul. Nutr.; 2024; 43, pp. 1-10.
21. Ranjbar, A et al. Machine learning-based approach for predicting low birth weight. BMC Pregnancy Childbirth; 2023; 23, pp. 1-9.
22. de Morais, FL et al. Utilization of tree-based machine learning models for predicting low birth weight cases. BMC Pregnancy Childbirth; 2025; 25, 207. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/40011804][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11863662]
23. Naimi, AI; Platt, RW; Larkin, JC. Machine learning for fetal growth prediction. Epidemiology; 2018; 29, 290. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29199998][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5792310]
24. Tao, J; Yuan, Z; Sun, L; Yu, K; Zhang, Z. Fetal birthweight prediction with measured data by a temporal machine learning method. BMC Med. Inform. Decis. Mak.; 2021; 21, pp. 1-10.
25. Rubaiya, N; Mansur, M; Alam, MM; Rayhan, MI. Unraveling birth weight determinants: integrating machine learning, spatial analysis, and district-level mapping. Heliyon; 2024; 10, 1:STN:280:DC%2BB1cngslSgsg%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/38562507][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10982972]e27341.
26. Sadeghi, Z et al. A review of explainable artificial intelligence in healthcare. Comput. Electr. Eng.; 2024; 118, 109370.
27. Gill, SV; May-Benson, TA; Teasdale, A; Munsell, EG. Birth and developmental correlates of birth weight in a sample of children with potential sensory processing disorder. BMC Pediatr.; 2013; 13, pp. 1-9.
28. Tarín, JJ; García-Pérez, MA; Cano, A. Potential risks to offspring of intrauterine exposure to maternal age-related obstetric complications. Reprod. Fertil. Dev.; 2016; 29, pp. 1468-1475.
29. Wang, S et al. Changing trends of birth weight with maternal age: a cross-sectional study in Xi’an city of Northwestern China. BMC Pregnancy Childbirth; 2020; 20, pp. 1-7.
30. Softa, SM; Aldardeir, N; Aloufi, FS; Alshihabi, SS; Khouj, M; Radwan, E. The association of maternal height with mode of delivery and fetal birth weight at King Abdulaziz University Hosc, Jeddah,. Saudi Arabia. Cureus; 2022; 14, [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36060402]e27493.
31. Rochow, N et al. Maternal body height is a stronger predictor of birth weight than ethnicity: analysis of birth weight percentile charts. J. Perinat. Med.; 2018; 47, pp. 22-29. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29870393]
32. Spada, E; Chiossi, G; Coscia, A; Monari, F; Facchinetti, F. Effect of maternal age, height, BMI and ethnicity on birth weight: an Italian multicenter study. J. Perinat. Med.; 2017; 46, pp. 1016-1021.
33. Ludwig, DS; Currie, J. The association between pregnancy weight gain and birthweight: a within-family comparison. Lancet; 2010; 376, pp. 984-990. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/20691469][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2974327]
34. Hussey, MR et al. Associations of placental lncRNA expression with maternal pre-pregnancy BMI and infant birthweight in two birth cohorts. J. Dev. Orig. Health Dis.; 2025; 16, pp. 1-10.
35. Garces, A et al. Association of parity with birthweight and neonatal death in five sites: The Global Network’s Maternal Newborn Health Registry study. Reprod. Health; 2020; 17, pp. 1-8.4037644
36. Shah, PS. Parity and low birth weight and preterm birth: a systematic review and meta-analyses. Acta Obstet. Gynecol. Scand.; 2010; 89, pp. 862-875. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/20583931]
37. Wang, YA et al. Preterm birth and low birth weight after assisted reproductive technology-related pregnancy in Australia between 1996 and 2000. Fertil. Steril.; 2005; 83, pp. 1650-1658. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/15950632]
38. Zhong, X et al. Effect of parental physiological conditions and assisted reproductive technologies on the pregnancy and birth outcomes in infertile patients. Oncotarget; 2016; 8, pp. 18409-18416. [PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5392338]
39. Rahmati, S; Azami, M; Badfar, G; Parizad, N; Sayehmiri, K. The relationship between maternal anemia during pregnancy with preterm birth: a systematic review and meta-analysis. J. Matern. Fetal Neonatal Med.; 2018; 33, pp. 2679-2689.
40. Sherwani, SI; Khan, HA; Ekhzaimy, A; Masood, A; Sakharkar, MK. Significance of HbA1C test in diagnosis and prognosis of diabetic patients. Biomark. Insights; 2016; 11, BMI.S38440.
41. Gabbe, S. Management of diabetes mellitus complicating pregnancy. Obstet. Gynecol.; 2003; 102, pp. 857-868. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/14551019]
42. Li, J et al. Maternal TSH levels at first trimester and subsequent spontaneous miscarriage: a nested case–control study. Endocr. Connect.; 2019; 8, pp. 1288-1293.1:CAS:528:DC%2BB3cXisVCgtr4%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31525729][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6765319]
43. Nishioka, E et al. Relationship between maternal thyroid-stimulating hormone (TSH) elevation during pregnancy and low birth weight: A longitudinal study of apparently healthy urban Japanese women at very low risk. Early Hum. Dev.; 2015; 91, pp. 181-185.1:CAS:528:DC%2BC2MXhtFKmurg%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25676185]
44. Bilardo, CM et al. ISUOG Practice Guidelines (updated): performance of 11–14-week ultrasound scan. Ultrasound Obstet. Gynecol.; 2023; 61, pp. 127-143.1:STN:280:DC%2BB28nksVWrsA%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36594739]
45. Kalousová, M; Muravská, A; Zima, T. Pregnancy-associated plasma protein A (PAPP-A) and preeclampsia. Adv. Clin. Chem.; 2014; 63, pp. 169-209. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24783354]
46. Shiefa, S; Amargandhi, M; Bhupendra, J; Moulali, S; Kristine, T. First trimester maternal serum screening using biochemical markers PAPP-A and free Β-HCG for Down syndrome, Patau syndrome and Edward syndrome. Indian J. Clin. Biochem.; 2012; 28, pp. 3-12. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24381414][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3547446]
47. Salomon, LJ et al. ISUOG Practice Guidelines (updated): performance of the routine mid-trimester fetal ultrasound scan. Ultrasound Obstet. Gynecol.; 2022; 59, pp. 840-856.1:STN:280:DC%2BB2MnktlOrtw%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35592929]
48. Ogonowski, J; Miazgowski, T. Intergenerational transmission of macrosomia in women with gestational diabetes and normal glucose tolerance. Eur. J. Obstet. Gynecol. Reprod. Biol.; 2015; 195, pp. 113-116.1:CAS:528:DC%2BC2MXhs12qu7fM [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26512436]
49. Yang, Y et al. The association of gestational diabetes mellitus with fetal birth weight. J. Diabetes Complications; 2018; 32, pp. 635-642. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29907325]
50. Ardissino, M et al. Maternal hypertension increases risk of preeclampsia and low fetal birthweight: Genetic evidence from a Mendelian randomization study. Hypertension; 2022; 79, pp. 588-598.1:CAS:528:DC%2BB38XislOlt7k%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35138876]
51. Cutland, CL et al. Low birth weight: case definition & guidelines for data collection, analysis, and presentation of maternal immunization safety data. Vaccine; 2017; 35, pp. 6492-6500. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29150054][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5710991]
52. Aziz, F; Khan, MF; Moiz, A. Gestational diabetes mellitus, hypertension, and dyslipidemia as the risk factors of preeclampsia. Sci. Rep.; 2024; 14, pp. 1-11.
53. Celik, N. Welch’s ANOVA: Heteroskedastic skew-t error terms. Commun. Stat. Theory Methods; 2022; 51, pp. 3065-3076.4399884
54. Mujahid, M et al. Data oversampling and imbalanced datasets: an investigation of performance for machine learning and feature engineering. J Big Data; 2024; 11, 87.
55. Cerda, P; Varoquaux, G; Kégl, B. Similarity encoding for learning with dirty categorical variables. Mach. Learn.; 2018; 107, pp. 1477-1494.3835275
56. Alkhawaldeh, IM; Albalkhi, I; Naswhan, AJ. Challenges and limitations of synthetic minority oversampling techniques in machine learning. World J. Methodol.; 2023; 13, pp. 373-378. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/38229946][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10789107]
57. Abedin, MZ; Guotai, C; Hajek, P; Zhang, T. Combining weighted SMOTE with ensemble learning for the class-imbalanced prediction of small business credit risk. Complex Intell. Syst.; 2022; 9, pp. 3559-3579.
58. Kraskov, A; Stögbauer, H; Grassberger, P. Estimating mutual information. Phys. Rev. E; 2004; 69, 2004PhRvE.69f6138K2096503 066138.
59. Shipe, ME; Deppen, SA; Farjah, F; Grogan, EL. Developing prediction models for clinical use using logistic regression: an overview. J. Thorac. Dis.; 2019; 11, pp. S574-S584. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31032076][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6465431]
60. Montomoli, J et al. Machine learning using the extreme gradient boosting (XGBoost) algorithm predicts 5-day delta of SOFA score at ICU admission in COVID-19 patients. J. Intensive Med.; 2021; 1, pp. 110-116. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36785563][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8531027]
61. Jabbar, M. A., Deekshatulu, B. L. & Chandra, P. Classification of heart disease using K-nearest neighbor and genetic algorithm. arXiv preprintarXiv:1508.02061 (2015).
62. Zheng, J et al. Clinical data based XGBoost algorithm for infection risk prediction of patients with decompensated cirrhosis: a 10-year (2012–2021) multicenter retrospective case-control study. BMC Gastroenterol.; 2023; 23, pp. 1-13.2023AJ..166..1Z [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36593456][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9809024]
63. Loef, B et al. Using random forest to identify longitudinal predictors of health in a 30-year cohort study. Sci. Rep.; 2022; 12, pp. 1-12.
64. Wallace, ML et al. Use and misuse of random forest variable importance metrics in medicine: demonstrations through incident stroke prediction. BMC Med. Res. Methodol.; 2023; 23, pp. 1-12.
65. Singh, D; Singh, B. Investigating the impact of data normalization on classification performance. Appl. Soft Comput.; 2019; 97, 105524.
66. Kaliappan, J. et al. Impact of cross-validation on machine learning models for early detection of intrauterine fetal demise. Diagnostics (Basel)13, 1692 (2023).
67. Elgeldawi, E; Sayed, A; Galal, AR; Zaki, AM. Hyperparameter tuning for machine learning algorithms used for Arabic sentiment analysis. Informatics; 2021; 8, 79.
68. Shekar, B. H. & Dagnew, G. Grid search-based hyperparameter tuning and classification of microarray cancer data. In Proc. Int. Conf. Adv. Comput. Commun. Paradigms (ICACCP) (2019). https://doi.org/10.1109/ICACCP.2019.8882943
69. Chadaga, K et al. Explainable artificial intelligence approaches for COVID-19 prognosis prediction using clinical markers. Sci. Rep.; 2024; 14, pp. 1-14.
70. Tougui, I; Jilbab, A; Mhamdi, JE. Impact of the choice of cross-validation techniques on the results of machine learning-based diagnostic applications. Healthc. Inform. Res.; 2021; 27, pp. 189-199. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34384201][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8369053]
71. Rodríguez-Pérez, R; Bajorath, J. Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions. J. Comput. Aided Mol. Des.; 2020; 34, pp. 1013-1026.2020JCAMD.34.1013R [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32361862][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7449951]
72. Lundberg, SM et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell.; 2020; 2, pp. 56-67. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32607472][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7326367]
73. Abdullakutty, F et al. Histopathology in focus: a review on explainable multi-modal approaches for breast cancer diagnosis. Front. Med.; 2024; 11, 1450103.
74. Javaid, M; Haleem, A; Pratap Singh, R; Suman, R; Rab, S. Significance of machine learning in healthcare: Features, pillars and applications. Int. J. Intell. Netw.; 2022; 3, pp. 58-73.
75. Arain, Z; Iliodromiti, S; Slabaugh, G; David, AL; Chowdhury, TT. Machine learning and disease prediction in obstetrics. Curr. Res. Physiol.; 2023; 6, 1:CAS:528:DC%2BB3sXht1Wnt7jJ [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37324652][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10265477]100099.
76. Alzubaidi, M et al. Ensemble transfer learning for fetal head analysis: From segmentation to gestational age and weight prediction. Diagnostics; 2022; 12, 2229. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36140628][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9497941]
77. Hughes, MM; Black, RE; Katz, J. 2500-g low birth weight cutoff: history and implications for future research and policy. Matern. Child Health J.; 2016; 21, 283. [PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5290050]
78. Zeegers, B et al. Birthweight charts customised for maternal height optimises the classification of small and large-for-gestational age newborns. Acta Paediatr.; 2024; [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/39412950][DOI: https://dx.doi.org/10.1111/APA.17332]
79. Teoh, Z.H., Mariapun, J., Ko, V.S.Y., Dominic, N.A., Jeganathan, R., Karalasingam, S.D. & Thirunavuk Arasoo, V.J. Maternal height, and ethnicity and birth weight: A retrospective cohort study of uncomplicated term vaginal deliveries in Malaysia. Birth.51, 620–628 (2024).
80. Sutan, R; Mohtar, M; Mahat, AN; Tamil, AM. Determinant of low birth weight infants: a matched case control study. Open J. Prev. Med.; 2014; 4, pp. 91-99.
81. To, WWK; Cheung, W; Kwok, JSY. Paternal height and weight as determinants of birth weight in a Chinese population. Am. J. Perinatol.; 1998; 15, pp. 545-548.1:STN:280:DyaK1M7gsFamuw%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/9890253]
82. Pölzlberger, E; Hartmann, B; Hafner, E; Stümpflein, I; Kirchengast, S. Maternal height and pre-pregnancy weight status are associated with fetal growth patterns and newborn size. J. Biosoc. Sci.; 2017; 49, pp. 392-407. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/27692008]
83. Ay, L et al. Maternal anthropometrics are associated with fetal size in different periods of pregnancy and at birth: the Generation R Study. BJOG; 2009; 116, pp. 953-963.1:STN:280:DC%2BD1Mvhsl2qtQ%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19522798]
84. Mathai, M et al. Ethnicity and fetal growth in Fiji. Aust. N. Z. J. Obstet. Gynaecol.; 2004; 44, pp. 318-321. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/15282003]
85. Figueras, F et al. Customized birthweight standards for a Spanish population. Eur. J. Obstet. Gynecol. Reprod. Biol.; 2008; 136, pp. 20-24.1:STN:280:DC%2BD1c%2FhtF2gsg%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17287065]
86. Britto, RPDA et al. Influence of maternal height and weight on low birth weight: a cross-sectional study in poor communities of northeastern Brazil. PLoS ONE; 2013; 8, e000000.
87. Mongelli, M; Figueras, F; Francis, A; Gardosi, J. A customised birthweight centile calculator developed for an Australian population. Aust. N. Z. J. Obstet. Gynaecol.; 2007; 47, pp. 128-131. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17355302]
88. McCowan, L; Stewart, AW; Francis, A; Gardosi, J. A customised birthweight centile calculator developed for a New Zealand population. Aust. N. Z. J. Obstet. Gynaecol.; 2004; 44, pp. 428-431. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/15387864]
89. Zhang, G et al. Assessing the causal relationship of maternal height on birth size and gestational age at birth: a Mendelian randomization analysis. PLoS Med.; 2015; 12, e100000.
90. Kaur, S et al. Risk factors for low birth weight among rural and urban Malaysian women. BMC Public Health; 2019; 19, 539. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31196034][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6565527]
91. Inoue, S et al. Association between short maternal height and low birth weight: a hospital-based study in Japan. J. Korean Med. Sci.; 2016; 31, pp. 353-359.1:CAS:528:DC%2BC2sXhtVyisr8%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26955234][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4779858]
92. Retnakaran, R et al. Association of timing of weight gain in pregnancy with infant birth weight. JAMA Pediatr.; 2018; 172, pp. 136-142. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29279903]
93. Mohapatra, I; Harshini, N; Samantaray, SR; Naik, G. Association between early pregnancy body mass index and gestational weight gain in relation to neonatal birth weight. Cureus; 2022; 14, e23000.
94. Arora, P; Tamber Aeri, B. Gestational weight gain among healthy pregnant women from Asia in comparison with Institute of Medicine (IOM) Guidelines-2009: a systematic review. J. Pregnancy; 2019; 2019, 3849596. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30941218][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6421048]
95. Hackmon, R et al. Do early fetal measurements and nuchal translucency correlate with term birth weight?. J. Obstet. Gynaecol. Can.; 2017; 39, pp. 750-756. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/28733063]
96. Jackson, M; Rose, NC. Diagnosis and management of fetal nuchal translucency. Semin. Roentgenol.; 1998; 33, pp. 333-338.1:STN:280:DyaK1M%2Fhs1Khtw%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/9800243]
97. Kalem, Z; Kaya, AE; Bakırarar, B; Kalem, MN. Fetal nuchal translucency: is there an association with birthweight and neonatal wellbeing?. Turk. J. Obstet. Gynecol.; 2019; 16, pp. 35-40. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31019838][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6463433]
98. Napolitano, R et al. Pregnancy dating by fetal crown–rump length: a systematic review of charts. BJOG; 2014; 121, pp. 556-565.1:STN:280:DC%2BC2czhslaktg%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24387345]
99. Fries, N et al. The impact of optimal dating on the assessment of fetal growth. BMC Pregnancy Childbirth; 2021; 21, pp. 1-8.
100. Salomon, LJ. Early fetal growth: concepts and pitfalls. Ultrasound Obstet. Gynecol.; 2010; 35, pp. 385-389.1:STN:280:DC%2BC3c3ktl2jtA%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/20373482]
101. Papageorghiou, AT et al. International standards for early fetal size and pregnancy dating based on ultrasound measurement of crown–rump length in the first trimester of pregnancy. Ultrasound Obstet. Gynecol.; 2014; 44, 641.1:STN:280:DC%2BC2cbktFGktg%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25044000][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4286014]
102. Kozuki, N et al. The associations of parity and maternal age with small-for-gestational-age, preterm, and neonatal and infant mortality: A meta-analysis. BMC Public Health; 2013; 13,
103. Hinkle, SN et al. The association between parity and birthweight in a longitudinal consecutive pregnancy cohort. Paediatr. Perinat. Epidemiol.; 2013; 28, 106. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24320682][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3922415]
104. Krulewitch, CJ; Herman, AA; Yu, KF; Johnson, YR. Does changing paternity contribute to the risk of intrauterine growth retardation?. Paediatr. Perinat. Epidemiol.; 1997; 11, pp. 41-47. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/9018714]
105. Luo, ZC et al. The effects and mechanisms of primiparity on the risk of pre-eclampsia: a systematic review. Paediatr. Perinat. Epidemiol.; 2007; 21, pp. 36-45. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17593196]
106. Sørnes, T; Bakke, T. Uterine size, parity and umbilical cord length. Acta Obstet. Gynecol. Scand.; 1989; 68, pp. 439-441. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/2520789]
107. Clapp, JF; Capeless, E. Cardiovascular function before, during, and after the first and subsequent pregnancies. Am. J. Cardiol.; 1997; 80, pp. 1469-1473. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/9399724]
108. Dasa, TT; Okunlola, MA; Dessie, Y. Effect of grand multiparity on the adverse birth outcome: a hospital-based prospective cohort study in Sidama Region, Ethiopia. Int. J. Womens Health; 2022; 14, 363. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35300284][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8923638]
109. Lin, L; Lu, C; Chen, W; Li, C; Guo, VY. Parity and the risks of adverse birth outcomes: a retrospective study among Chinese. BMC Pregnancy Childbirth; 2021; 21, pp. 1-11.1:CAS:528:DC%2BB3MXltlaksrs%3D
110. Baer, RJ; Lyell, DJ; Norton, ME; Currier, RJ; Jelliffe-Pawlowski, LL. First trimester pregnancy-associated plasma protein-A and birth weight. Eur. J. Obstet. Gynecol. Reprod. Biol.; 2016; 198, pp. 1-6.1:CAS:528:DC%2BC28XitFemtA%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26773241]
111. Turrado Sánchez, EM; De Miguel Sánchez, V; Macía Cortiñas, M. Correlation between PAPP-A levels determined during the first trimester and birth weight at full-term. Reprod. Sci.; 2023; 30, 3235. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37237249]
112. Zhao, D et al. Association between maternal blood glucose levels during pregnancy and birth outcomes: a birth cohort study. Int. J. Environ. Res. Public Health; 2023; 20, 2102.1:CAS:528:DC%2BB3sXjsVSgur4%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36767469][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9915873]
113. Everett, TR; Lees, CC. Beyond the placental bed: placental and systemic determinants of the uterine artery Doppler waveform. Placenta; 2012; 33, pp. 893-901.1:STN:280:DC%2BC38fovFansA%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/22902007]
114. Liu, Y et al. Impact of gestational hypertension and preeclampsia on low birthweight and small-for-gestational-age infants in China: a large prospective cohort study. J. Clin. Hypertens.; 2021; 23, 835.1:CAS:528:DC%2BB3MXotFWgsb8%3D
115. Gibbs, CM; Wendt, A; Peters, S; Hogue, CJ. The impact of early age at first childbirth on maternal and infant health. Paediatr. Perinat. Epidemiol.; 2012; 26, pp. 259-284. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/22742615][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4562289]
116. Markovitz, BP; Cook, R; Flick, LH; Leet, TL. Socioeconomic factors and adolescent pregnancy outcomes: distinctions between neonatal and post-neonatal deaths?. BMC Public Health; 2005; 5, 279.
117. Sharma, V et al. Young maternal age and the risk of neonatal mortality in rural Nepal. Arch. Pediatr. Adolesc. Med.; 2008; 162, pp. 828-835. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18762599][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2535853]
118. Alam, N. Teenage motherhood and infant mortality in Bangladesh: maternal age-dependent effect of parity one. J. Biosoc. Sci.; 2000; 32, pp. 229-236.1:STN:280:DC%2BD3c3ivVChuw%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/10765612]
119. Restrepo-Méndez, MC et al. Childbearing during adolescence and offspring mortality: findings from three population-based cohorts in southern Brazil. BMC Public Health; 2011; 11, 781. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/21985467][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3207956]
120. Paranjothy, S; Broughton, H; Adappa, R; Fone, D. Teenage pregnancy: who suffers?. Arch. Dis. Child.; 2009; 94, pp. 239-245.1:STN:280:DC%2BD1M7ktlKrtg%3D%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19019886]
121. Lawlor, DA; Mortensen, L; Andersen, AMN. Mechanisms underlying the associations of maternal age with adverse perinatal outcomes: a sibling study of 264,695 Danish women and their firstborn offspring. Int. J. Epidemiol.; 2011; 40, pp. 1205-1214. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/21752786]
122. Fall, CHD et al. Association between maternal age at childbirth and child and adult outcomes in the offspring: a prospective study in five low-income and middle-income countries (COHORTS collaboration). Lancet Glob. Health; 2015; 3, pp. e366-e377. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25999096]
123. Alves, C., Jenkins, S. M. & Rapp, A. Early pregnancy loss (spontaneous abortion) (StatPearls, 2023).
124. Jackson, T; Watkins, E. Early pregnancy loss. J. Am. Acad. Physician Assist.; 2021; 34, pp. 22-27.
125. Nicolaides, KH. A model for a new pyramid of prenatal care based on the 11 to 13 weeks’ assessment. Prenat. Diagn.; 2011; 31, pp. 3-6. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/21210474]
126. Bekele, WT. Machine learning algorithms for predicting low birth weight in Ethiopia. BMC Med. Inform. Decis. Mak.; 2022; 22, pp. 1-16.
127. Arayeshgari, M; Najafi-Ghobadi, S; Tarhsaz, H; Parami, S; Tapak, L. Machine learning-based classifiers for the prediction of low birth weight. Healthc. Inform. Res.; 2023; 29, 54. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36792101][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9932310]
128. Pollob, SMAI; Abedin, MM; Islam, MT; Islam, MM; Maniruzzaman, M. Predicting risks of low birth weight in Bangladesh with machine learning. PLoS ONE; 2022; 17, e000000.
129. Khan, W et al. Infant birth weight estimation and low birth weight classification in United Arab Emirates using machine learning algorithms. Sci. Rep.; 2022; 12, pp. 1-12.2022NatSR.12..1K
130. Hussain, Z; Borah, MD. Birth weight prediction of newborn baby with application of machine learning techniques on features of mother. J. Stat. Manag. Syst.; 2020; 23, pp. 1079-1091.
131. Alabbad, DA et al. Birthweight range prediction and classification: a machine learning-based sustainable approach. Mach. Learn. Knowl. Extr.; 2024; 6, pp. 770-788.
132. Dola, SS; Valderrama, CE. Exploring parental factors influencing low birth weight on the 2022 CDC natality dataset. BMC Med. Inform. Decis. Mak.; 2024; 24, 367. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/39616348][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11608488]
133. Liu, Q et al. Machine learning approaches for predicting fetal macrosomia at different stages of pregnancy: a retrospective study in China. BMC Pregnancy Childbirth; 2025; 25, pp. 1-10.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Low birthweight (LBW) is a significant health challenge worldwide, as these neonates experience both short- and long-term disabilities. Factors affecting maternal and fetal health during early to mid-pregnancy can greatly influence fetal development. Prediction of birthweight using machine learning (ML) models with antenatal data may help in better clinical management. However, the lack of explainability in these models has raised concerns within the medical community. To address this issue, our study aims to develop a more practical ML model by incorporating explainable artificial intelligence (XAI). We prospectively collected real-world clinical data of 19 maternal and fetal clinical features from 237 singleton pregnancies. Statistical analyses were conducted using Jamovi (version: 2.6.26) and JASP team (2024) JASP (version: 0.18.3). Multiple ML classifiers were employed. We developed a stacked ensemble model that integrated various algorithms, including a custom-stacked ensemble approach and three XAI methodologies: Shapley Additive Explanations (SHAP), Local Interpretable Model-agnostic Explanations (LIME), and Anchor. These methods provided meaningful explanations to help construct reliable and optimal clinical predictive models. Among the ML classifiers evaluated, the AdaBoost model achieved the highest performance, with a maximum accuracy of 77%, a precision of 73%, a recall of 77%, and an F1 score of 72%. The stacked model demonstrated an accuracy of 75%, indicating its possibility in clinical application. However, the accuracy of these models might be affected by the limited dataset, which included pregnant women undergoing treatment for thyroid abnormalities, diabetes, and hypertension. Our developed model identified several key attributes that influence birthweight, such as maternal height, nuchal translucency thickness, parity, crown-rump length, glycated hemoglobin, hypertensive disorders of pregnancy, and pregnancy-associated plasma protein A. This model can assist medical professionals in making more precise birthweight predictions using routinely collected antenatal parameters, enabling timely medical decisions and treatments.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
1 Manipal Academy of Higher Education, Department of Obstetrics and Gynecology,Dr TMA Pai Hospital (Udupi), Melaka Manipal Medical College, Manipal, India (GRID:grid.411639.8) (ISNI:0000 0001 0571 5193)
2 Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, India (GRID:grid.411639.8) (ISNI:0000 0001 0571 5193)
3 Statistics, Manipal College of Health Professions, Manipal Academy of Higher Education, Manipal, India (GRID:grid.411639.8) (ISNI:0000 0001 0571 5193)
4 Manipal Academy of Higher Education, Division of Fetal Medicine Department of Obstetrics and Gynaecology, Kasturba Medical College, Manipal, India (GRID:grid.411639.8) (ISNI:0000 0001 0571 5193)
5 Department of Physiotherapy, Manipal College of Health Professions, Manipal Academy of Higher Education, Manipal, India (GRID:grid.411639.8) (ISNI:0000 0001 0571 5193)