Introduction
Preterm birth (PTB) is defined as “birth before 37 weeks of gestation” [1] (abbreviations listed in S1 Table). It consists of PTB with premature rupture of membranes (PROM), preterm labor and birth without PROM and other PTB. Its frequency varies from 5% to 9% in developed countries but in general it has increased with the growth of indicated PTB and due to other factors, e.g., 9.5% in 1981 to 12.7% in 2005 for the United States [1]. Several studies reported that preterm has positive associations with maternal depression and stress [2–5]. Based on a review, preterm birth was a major risk factor for maternal depression, neurodevelopmental delay and childhood disability [2]. According to an animal experiment, prenatal maternal stress was a major cause of preterm birth and neonatal immunity in mice [3]. The results of the studies above agree with those of two other reviews stating that maternal depression and stress during pregnancy affect preterm birth [4, 5].
Likewise, temporomandibular disorder (TMD) and gastrointestinal disease are expected to have positive associations with depression and stress [6–11]. TMD can be defined as “disorder affecting joints and muscles” [6]. Its symptoms are joint pain, muscle pain and limited mouth opening, whereas its etiological causes are occlusion, trauma, pain stimulus, parafunctional activity and psychological stress [6]. A prospective study of female students before the university entrance reported a positive relationship between stress and TMD [7]. This positive relationship between stress and TMD was affirmed by a review of 33 original studies [8] and a survey of 112 participants in a general hospital [9]. Also, two other reviews highlighted a positive association between depression and gastrointestinal disease such as gastroesophageal reflux disease (GERD) and irritable bowel syndrome [10, 11]. Based on the findings above, one would expect a positive relationship among PTB, TMD and GERD. But no literature has been available on this topic. In this context, this study employs machine learning analysis with population data for the associations of PTB with TMD and gastrointestinal diseases.
Methods
Participants and variables
The source of the population-based retrospective cohort was Korea National Health Insurance claims for 489,893 primiparous women with delivery at the age of 25–40 in 2017. This retrospective cohort study was approved by the Institutional Review Board (IRB) of Korea University Anam Hospital on June 12, 2023 (2020AN0014). Informed consent was waived by the IRB. The data were accessed for the research during July 1, 2023-August 31, 2023. The authors did not have access to information that could identify individual participants during or after data collection. The dependent variables were four categories of PTB (birth before 37 weeks of gestation) in 2017 based on the ICD-10 Code: PTB 1—PTB with PROM only; PTB 2—preterm labor and birth without PROM; PTB 3—PTB 1 or PTB 2; PTB 4—PTB 3 or other indicated PTB (S2 Table). The 21 independent variables were: (1) two demographic/socioeconomic predictors in 2016, i.e., age, low socioeconomic status with the range of 1 (the highest) to 20 (the lowest) in terms of an insurance fee; (2) five dental diseases for any of the years 2002–2016, including dental cavity, periodontitis, salivary gland disease, tooth loss, TMD; (3) four gastrointestinal diseases for any of the years 2002–2016, i.e., Crohn’s disease, GERD, irritable bowel syndrome, ulcerative colitis; (4) four obstetric conditions for any of the years 2002–2016, that is, infertility, hypertension, diabetes, gestational diabetes; (5) six medication predictors in 2016, i.e., benzodiazepine, calcium channel blocker, nitrate, progesterone, sleeping pill, tricyclic antidepressant. These 21 independent variables were selected according to previous studies and data availability. The disease and medication data were screened from ICD-10 and ATC codes, respectively (S2 and S3 Tables).
Machine learning analysis
Logistic regression and the random forest were used for the prediction of PTB. A random forest is a group of decision trees which make majority votes on the dependent variable (“bootstrap aggregation”) [12, 13]. The 489,893 cases with full information were divided into training and validation sets with an 80:20 ratio (391,914 vs. 97,979 cases). The validation criteria were accuracy (a ratio of correct predictions among 97,979 cases) and the area under the receiver-operating-characteristic curve (AUC) (area under the plot of sensitivity vs. 1—specificity). Random forest variable importance was derived for finding important predictors of PTB and evaluating the strengths of its associations with the predictors. Random forest Shapley Additive Explanation (SHAP) values were calculated to analyze the directions of these associations. The permutation importance of a predictor indicates the decrease of model accuracy from data permutation on the predictor. It is an average over all trees with a value of 0 to 1 in the case of the random forest. The SHAP value of a predictor for a participant measures the difference between what machine learning predicts for the probability of PTB with and without the predictor. For example, let us assume that the SHAP values of TMD for PTB have the range of (−0.10, 0.23). Here, some participants have SHAP values as low as −0.10, and other participants have SHAP values as high as 0.23. The inclusion of a predictor (TMD) into machine learning will decrease or increase the probability of the dependent variable (PTB) by the range of −0.10 and 0.23. In other words, there exists a positive association between TMD and PTB in general [12].
In practice, experts in artificial intelligence use random forest permutation importance to derive the rankings and values of all predictors for the prediction of the dependent variable. Then, they employ the SHAP plots to evaluate the directions of associations between the predictors and the dependent variable. Linear or logistic regression used to play this role before the SHAP approach took it over. This is because the SHAP approach has a notable strength compared to linear or logistic regression: the former considers all realistic scenarios, unlike the latter with an unrealistic assumption of ceteris paribus, i.e., “all the other variables staying constant”. Let us assume that there are three predictors of PTB, i.e., low socioeconomic status, GERD and TMD. As defined above, the SHAP value of TMD for PTB for a particular participant is the difference between what machine learning predicts for the probability of PTB with and without TMD for the participant. Here, the SHAP value for the participant is the average of the following four scenarios for the participant: (1) low socioeconomic status excluded, GERD excluded; (2) low socioeconomic status included, GERD excluded; (3) low socioeconomic status excluded, GERD included; and (4) low socioeconomic status included, GERD included (13). Finally, it can be noted that R-Studio 1.3.959 (R-Studio Inc.: Boston, United States) was employed for the analysis during January 1, 2023-February 28, 2023.
Results
Descriptive statistics are shown in Table 1. The corresponding means and standard deviations of age and socioeconomic status in 2016 were 31.10 and 3.59. The respective proportions of those with GERD, TMD and tricyclic antidepressants during 2002–2016 were 42.1% (207,804), 3.9% (19,345) and 10.3% (50,305). Model performance is presented in Table 2. The random forest with oversampling registered much higher AUC compared to logistic regression with oversampling, i.e., 80.4% vs. 51.6% (PTB1), 86.2% vs. 57.2% (PTB2), 79.3% vs. 52.7% (PTB3), 79.3% vs. 53.1% (PTB4). Based on random forest variable importance values and rankings in Table 3, PTB1, PTB2, PTB3 and PTB4 have strong associations with low socioeconomic status, GERD, age, infertility, irritable bowel syndrome, diabetes, TMD, salivary gland disease, hypertension, tricyclic antidepressant and benzodiazepine. In terms of max SHAP values for PTB1, PTB2, PTB3 and PTB4 in Table 4, these associations were positive, e.g., low socioeconomic status (0.29), age (0.21), GERD (0.27) and TMD (0.23) for PTB4 in Table 4. The inclusion of low socioeconomic status, age, GERD or TMD into the random forest will increase the probability of PTB4 by 0.29, 0.21, 0.27 or 0.23.
[Figure omitted. See PDF.]
[Figure omitted. See PDF.]
[Figure omitted. See PDF.]
[Figure omitted. See PDF.]
The positive association between PTB and TMD was more apparent in Figs 1–4. Here, points with low TMD values and low SHAP values for PTB were positioned in the left bottom, while points with high TMD values and high SHAP values for PTB were positioned in the right top (Figs 1–4). These figures are called the SHAP dependence plots of TMD vs. PTB. In these figures, the blue (or red) color represents the absence (or presence) of tricyclic antidepressant for a participant, which was found to have the highest correlation with TMD for the prediction of PTB. Here, points with low TMD values, the absence of tricyclic antidepressant and low SHAP values for PTB were positioned in the left bottom, whereas points with high TMD values, the presence of tricyclic antidepressant and high SHAP values for PTB were positioned in the right top.
[Figure omitted. See PDF.]
Legend: The positive association between PTB (preterm birth) and TMD (temporomandibular disease) was more apparent in Figs 1–4. Here, points with low TMD values and low SHAP values for PTB were positioned in the left bottom, while points with high TMD values and high SHAP values for PTB were positioned in the right top. These figures are called the SHAP dependence plots of TMD vs. PTB. In these figures, the blue (or red) color represents the absence (or presence) of TCA (tricyclic antidepressant) for a participant, which was found to have the highest correlation with TMD for the prediction of PTB. Here, points with low TMD values, the absence of TCA and low SHAP values for PTB were positioned in the left bottom, whereas points with high TMD values, the presence of TCA and high SHAP values for PTB were positioned in the right top.
[Figure omitted. See PDF.]
[Figure omitted. See PDF.]
[Figure omitted. See PDF.]
Discussion
Summary
Based on random forest variable importance values and rankings, PTB has strong associations with low socioeconomic status, GERD, age, infertility, irritable bowel syndrome, diabetes, TMD, salivary gland disease, hypertension, tricyclic antidepressant and benzodiazepine. Specifically, the positive association among PTB, TMD and tricyclic antidepressant was more apparent in the SHAP dependence plot: points with low TMD values, the absence of tricyclic antidepressant and low SHAP values for PTB were positioned in the left bottom, whereas points with high TMD values, the presence of tricyclic antidepressant and high SHAP values for PTB were positioned in the right top.
Contributions
The results of previous studies based on self-reported questionnaires [14–17] has been mixed on an association between PTB and TMD. The relationship was not statistically significant between control and experimental adolescents in some examinations [14, 15], but positive in another investigation [16]. Indeed, a systematic review confirmed the former finding (no significance) [17]. The unique contribution of this study is that it used machine learning and population data for confirming the positive association between PTB and TMD. One plausible pathway between PTB and TMD would be vitamin deficiency as in the case of PTB-oral hygiene status [18]. Moreover, the findings of explainable artificial intelligence (SHAP) in this study sheds new light on an association among PTB, TMD and antidepressant medication (depression). A systematic review and a prospective cohort study reported that pregnant women experience intense stress and this becomes a significant risk factor for their PTB [19, 20]. Pregnant women would take antidepressants to relieve their mental burden but this is expected to cause a vicious cycle of furthering the risk of PTB: Three systematic reviews highlighted a positive relationship between antidepressant medication and PTB [21–23]. As addressed above, in a similar context, a prospective study of female students before the university entrance reported a positive association between stress and TMD [7]. This positive association between stress and TMD was affirmed by a review of 33 original studies [8] and a survey of 112 participants in a general hospital [9]. This study extends the horizon of existing literature by employing explainable artificial intelligence and population data for identifying an association among PTB, TMD, GERD and antidepressant medication (depression) together. Previous studies highlight behavioral, infectious, neuroendocrine and neuroinflammatory mechanisms between stress and PTB [24]. Likewise, one possible pathway between stress and TMD would be the hypothalamic–pituitary–adrenal axis, the serotoninergic and opioid systems [25]. These statements can be extended to include GERD. No examination has been done and more investigation is needed in this direction.
Limitations
This study had some limitations. Firstly, this study did not examine etiological differences between preterm birth due to maternal and fetal indication, and preterm birth due to spontaneous preterm labor. Secondly, this study did not investigate possible mediating effects among independent variables. Thirdly, this study did not use recurrent neural networks (deep learning models) for the limited capacity of the computer server in the Korea National Health Insurance Service data analysis center during the study period. Fourthly, this study did not cover parameter tuning, using the default hyper-parameters of the random forest, i.e., the number of trees 100, the splitting criterion of GINI, the max depth of trees undetermined.
Conclusions
A cutting-edge approach of explainable artificial intelligence highlights the strong associations of preterm birth with temporomandibular disorder, gastrointestinal diseases and antidepressant medication. Close surveillance is needed for pregnant women regarding these multiple risks at the same time.
Supporting information
S1 Table. Abbreviations.
https://doi.org/10.1371/journal.pone.0296329.s001
(DOC)
S2 Table. ICD-10 code for preterm birth, temporomandibular disorder and gastrointestinal diseases.
https://doi.org/10.1371/journal.pone.0296329.s002
(DOC)
S3 Table. ATC code for medication.
https://doi.org/10.1371/journal.pone.0296329.s003
(DOC)
Citation: Lee K-S, Song I-S, Kim ES, Kim J, Jung S, Nam S, et al. (2024) Machine learning analysis with population data for the associations of preterm birth with temporomandibular disorder and gastrointestinal diseases. PLoS ONE 19(1): e0296329. https://doi.org/10.1371/journal.pone.0296329
About the Authors:
Kwang-Sig Lee
Contributed equally to this work with: Kwang-Sig Lee, In-Seok Song
Roles: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing
Affiliation: AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea
In-Seok Song
Contributed equally to this work with: Kwang-Sig Lee, In-Seok Song
Roles: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing
Affiliation: Department of Oral and Maxillofacial Surgery, Korea University Anam Hospital, Seoul, Korea
Eun Sun Kim
Roles: Conceptualization, Funding acquisition, Writing – original draft, Writing – review & editing
Affiliation: Department of Gastroenterology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
Jisu Kim
Roles: Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization
Affiliations: AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea, Department of Statistics, Korea University College of Political Science and Economics, Seoul, Korea, Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
Sohee Jung
Roles: Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization
Affiliations: AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea, Department of Statistics, Korea University College of Political Science and Economics, Seoul, Korea, Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
Sunwoo Nam
Roles: Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization
Affiliations: AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea, Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
Ki Hoon Ahn
Roles: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing
E-mail: [email protected]
Affiliation: Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
ORICD: https://orcid.org/0000-0002-6314-4621
1. Goldenberg RL, Culhane JF, Iams JD, Romero R. Epidemiology and causes of preterm birth. Lancet. 2008;371(9606):75–84. pmid:18177778
2. Anderson C, Cacola P. Implications of preterm birth for maternal mental health and infant development. MCN Am J Matern Child Nurs. 2017;42(2):108–14. pmid:27879500
3. Garcia-Flores V, Romero R, Furcron AE, Levenson D, Galaz J, Zou C, et al. Prenatal maternal stress causes preterm birth and affects neonatal adaptive immunity in mice. Front Immunol. 2020;11:254. pmid:32174914
4. Staneva A, Bogossian F, Pritchard M, Wittkowski A. The effects of maternal depression, anxiety, and perceived stress during pregnancy on preterm birth: A systematic review. Women Birth. 2015;28(3):179–93. pmid:25765470
5. Wadhwa PD, Entringer S, Buss C, Lu MC. The contribution of maternal stress to preterm birth: issues and considerations. Clin Perinatol. 2011;38(3):351–84. pmid:21890014
6. Minervini G, Franco R, Marrapodi MM, Crimi S, Badnjević A, Cervino G, et al. Correlation between temporomandibular disorders (TMD) and posture evaluated trough the diagnostic criteria for temporomandibular disorders (DC/TMD): A systematic review with meta-analysis. J Clin Med. 2023;12(7):2652. pmid:37048735
7. Mottaghi A, Razavi SM, Pozveh EZ, Jahangirmoghaddam M. Assessment of the relationship between stress and temporomandibular joint disorder in female students before university entrance exam (Konkour exam). Dent Res J (Isfahan). 2011;8(Suppl 1):S76–9. pmid:23372600
8. Dos Santos EA, Peinado BRR, Frazão DR, Né YGS, Fagundes NCF, Magno MB, et al. Association between temporomandibular disorders and anxiety: A systematic review. Front Psychiatry. 2022;13:990430. pmid:36311527
9. Lee YH, Auh QS. Clinical factors affecting depression in patients with painful temporomandibular disorders during the COVID-19 pandemic. Sci Rep. 2022;12(1):14667. pmid:36038574
10. Mizyed I, Fass SS, Fass R. Review article: gastro-oesophageal reflux disease and psychological comorbidity. Aliment Pharmacol Ther. 2009;29(4):351–8. pmid:19035971
11. Fond G, Loundou A, Hamdani N, Boukouaci W, Dargel A, Oliveira J, et al. Anxiety and depression comorbidities in irritable bowel syndrome (IBS): a systematic review and meta-analysis. Eur Arch Psychiatry Clin Neurosci. 2014;264(8):651–60. pmid:24705634
12. Lee KS, Ham BJ. Machine learning on early diagnosis of depression. Psychiatry Investig. 2022;19(8):597–605. pmid:36059048
13. Lee KS, Kim ES. Explainable artificial intelligence in the early diagnosis of gastrointestinal disease. Diagnostics (Basel). 2022;12(11). pmid:36359583
14. Nilsson IM, Brogårdh-Roth S, Månsson J, Ekberg E. Temporomandibular pain in adolescents with a history of preterm birth. J Oral Rehabil. 2019;46(7):589–96. pmid:30807654
15. Paulsson L, Ekberg E, Nilner M, Bondemark L. Mandibular function, temporomandibular disorders, and headache in prematurely born children. Acta Odontol Scand. 2009;67(1):30–7. pmid:18991148
16. Brogårdh-Roth S, Paulsson L, Larsson P, Ekberg E. Do preterm-born adolescents have a poorer oral health-related quality of life? BMC Oral Health. 2021;21(1):440. pmid:34503481
17. Minervini G, Franco R, Marrapodi MM, Fiorillo L, Cervino G, Cicciù M. Prevalence of temporomandibular disorders (TMD) in pregnancy: A systematic review with meta-analysis. J Oral Rehabil. 2023;50(7):627–634. pmid:37021601
18. Ferrillo M, Migliario M, Roccuzzo A, Molinero-Mourelle P, Falcicchio G, Umano GR, et al. Periodontal disease and vitamin D deficiency in pregnant women: Which correlation with preterm and low-weight birth? J Clin Med. 2021;10(19):4578. pmid:34640596
19. Shapiro GD, Fraser WD, Frasch MG, Séguin JR. Psychosocial stress in pregnancy and preterm birth: associations and mechanisms. J Perinat Med. 2013;41(6):631–45. pmid:24216160
20. Dole N, Savitz DA, Hertz-Picciotto I, Siega-Riz AM, McMahon MJ, Buekens P. Maternal stress and preterm birth. Am J Epidemiol. 2003;157(1):14–24. pmid:12505886
21. Huybrechts KF, Sanghani RS, Avorn J, Urato AC. Preterm birth and antidepressant medication use during pregnancy: a systematic review and meta-analysis. PLoS One. 2014;9(3):e92778. pmid:24671232
22. Huang H, Coleman S, Bridge JA, Yonkers K, Katon W. A meta-analysis of the relationship between antidepressant use in pregnancy and the risk of preterm birth and low birth weight. Gen Hosp Psychiatry. 2014;36(1):13–8. pmid:24094568
23. Chang Q, Ma XY, Xu XR, Su H, Wu QJ, Zhao YH. Antidepressant Use in Depressed Women During Pregnancy and the Risk of Preterm Birth: A Systematic Review and Meta-Analysis of 23 Cohort Studies. Front Pharmacol. 2020;11:659. pmid:32508635
24. Shapiro GD, Fraser WD, Frasch MG, Séguin JR. Psychosocial stress in pregnancy and preterm birth: associations and mechanisms. J Perinat Med. 2013;41(6):631–45. pmid:24216160
25. Gameiro GH, da Silva Andrade A, Nouer DF, Ferraz de Arruda Veiga MC. How may stressful experiences contribute to the development of temporomandibular disorders? Clin Oral Investig. 2006;10(4):261–8. pmid:16924558
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024 Lee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
This study employs machine learning analysis with population data for the associations of preterm birth (PTB) with temporomandibular disorder (TMD) and gastrointestinal diseases. The source of the population-based retrospective cohort was Korea National Health Insurance claims for 489,893 primiparous women with delivery at the age of 25–40 in 2017. The dependent variable was PTB in 2017. Twenty-one predictors were included, i.e., demographic, socioeconomic, disease and medication information during 2002–2016. Random forest variable importance was derived for finding important predictors of PTB and evaluating its associations with the predictors including TMD and gastroesophageal reflux disease (GERD). Shapley Additive Explanation (SHAP) values were calculated to analyze the directions of these associations. The random forest with oversampling registered a much higher area under the receiver-operating-characteristic curve compared to logistic regression with oversampling, i.e., 79.3% vs. 53.1%. According to random forest variable importance values and rankings, PTB has strong associations with low socioeconomic status, GERD, age, infertility, irritable bowel syndrome, diabetes, TMD, salivary gland disease, hypertension, tricyclic antidepressant and benzodiazepine. In terms of max SHAP values, these associations were positive, e.g., low socioeconomic status (0.29), age (0.21), GERD (0.27) and TMD (0.23). The inclusion of low socioeconomic status, age, GERD or TMD into the random forest will increase the probability of PTB by 0.29, 0.21, 0.27 or 0.23. A cutting-edge approach of explainable artificial intelligence highlights the strong associations of preterm birth with temporomandibular disorder, gastrointestinal diseases and antidepressant medication. Close surveillance is needed for pregnant women regarding these multiple risks at the same time.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer