Introduction
Root caries is a destructive lesion of the hard tissues that occurs on the root surface.1 Due to the lower mineral content of root surfaces, they are more prone to demineralization.2 The pathogenesis of root caries is closely associated with root surface exposure caused by gingival recession. Aging is accompanied by unique oral physiological and behavioral characteristics, such as gingival recession due to periodontitis, reduced salivary secretion, and changes in oral hygiene habits.3,4 These factors create an environment conducive to microbial colonization and biofilm formation,5 thus increasing the risk of root caries.6,7 With global aging, improved tooth retention has been achieved, yet the burden of root caries among older adults remains substantial, posing a major public health challenge in dentistry. This highlights the importance of root caries prevention and management.2,8 However, early-stage changes in root caries are often subtle, and the primary diagnostic method—visual-tactile examination—is subject to the subjectivity of examiner.2 Adjunctive diagnostic tools, such as X-ray and fluorescence-based methods, lack strong evidence to support their efficacy.9 Thus, the diagnosis of root caries, particularly in its early stages, remains controversial.9,10 Those show that the critical need for risk prediction in root caries. Identifying high-risk group can facilitate targeted prevention and management.
The k-prototypes clustering algorithm is an unsupervised machine learning approach for analyzing mixed-type data. This algorithm enables simultaneous clustering of both quantitative and qualitative variables, making it a novel application in the medical field in recent years.11 By identifying latent grouping patterns within the data, it allows direct recognition and classification of population characteristics without the need for predefined outcome indicators.12 This method is particularly advantageous for exploring complex factors associated with high-risk group and identifying associated factors.11 Unsupervised machine learning has been studied for identifying high-risk group in various diseases, such as acute exacerbation of chronic obstructive pulmonary disease,13 nonspecific neck pain,14 heart failure,15 and immune-related liver injury.16 However, there is no application of this method for identifying risk factors and high-risk group for root caries.
The objective of this study is to apply the k-prototypes clustering algorithm to identify high-risk group for root caries among older adults. By integrating binary logistic regression model, the study further investigates associated factors, providing a scientific basis for the precise identification of high-risk group and the optimization of oral health intervention strategies for older adults. The study hypothesizes that: (1) The k-prototypes clustering results are stable and effective in identifying high-risk group for root caries. (2) Multidimensional factors (such as periodontal and coronal conditions, oral hygiene behaviors) play significant roles in the risk of root caries.
Methods
Source of Data and Participants
The data for this study were derived from the 2021 oral health surveillance project for key populations in Guangdong Province, China, a representative cross-sectional study.17 Prior to participation, all individuals were informed about the study details and signed written informed consent forms. This study strictly was approved by the Stomatological Ethics Committee of the Chinese Stomatological Association (Permit Number: 2014–003), and complied with the Declaration of Helsinki.
423 participants were recruited from community populations. The inclusion criteria were: adults aged 65 to 74 years; residents of the communities covered by the study who had not been away from the community for more than six months; voluntary participation with signed informed consent; ability to undergo oral health examinations and complete the questionnaires; and absence of severe mental or cognitive impairments.
Exclusion criteria included: individuals with severe systemic diseases that prevented them from completing the study (such as advanced cancer, end-stage renal disease); those who had undergone severe oral surgeries or treatments affecting oral health within the past six months; individuals with severe cognitive impairments preventing comprehension of the study or completion of the questionnaires; and those who refused to sign the informed consent or withdrew from the study during the research process.
Study Design
The sample size was calculated using the formula: . In the formula, N represents the sample size, and the design effect (deff) was set at 4.5. The significance level α was set at 0.05, corresponding to a u value of 1.96 when the cumulative probability in the standard normal distribution is α/2. The relative allowable error (δ) for the expected prevalence (p) was controlled at 10%. Based on this calculation, the minimum required sample size for the survey was initially determined to be 390 participants. To account for potential non-responses, the non-response rate was estimated at 10%, resulting in a final sample size requirement of 429 participants.
A stratified random sampling method was employed to select the study samples. The stratification was based on geographic regions and population distribution, with 13 disease surveillance points designated as independent strata. Within each surveillance point, residential communities were used as the basic sampling units, and three communities were randomly selected using simple random sampling. In each selected community, 11 individuals aged 65 to 74 years were randomly chosen, resulting in a total of 33 individuals per surveillance point.
During the survey, six individuals could not complete the study due to reasons such as declining to participate, or health issues. Ultimately, 423 participants were included in the analysis. This exceeded the minimum sample size requirement.
Variables Design
The variables in this study were defined based on the results of two components: questionnaire survey and oral health examination. The questionnaire survey was conducted face-to-face on-site by two trained professional surveyors. The survey primarily gathered information on demographic characteristics, oral health behaviors, and health conditions. Demographic variables obtained included age (years) and gender. Additionally, qualitative variables potentially associated with root caries were extracted, including education level (low, medium, high), history of systemic diseases (yes, no), presence of xerostomia (yes, no), presence of non-standard denture restoration, presence of unrestored tooth loss, use of fluoride toothpaste (yes, no), toothbrushing frequency (≤1/day, ≥2/day), and oral health knowledge level (low, medium, high).
The oral health examination was performed under artificial light using a combination of visual inspection and probing. The instruments included a plane mouth mirror and a CPI probe. The examination followed a systematic sequence, beginning with the second permanent molar in the upper right quadrant and progressing tooth by tooth to the second permanent molar in the lower left quadrant, covering a total of 28 teeth. Third molars were not included in the examination. Diagnostic criteria were based on the basic methods for oral health surveys (5th edition) published by the World Health Organization.18
The clinical examination provided quantitative variables potentially related to root caries, including periodontal pockets (probing depth ≥4 mm), attachment loss (attachment loss ≥4 mm, examined at six specified sites: 17/16, 11, 26/27, 36/37, 31, 46/47), DMFT of coronal caries (including teeth with coronal caries, filled teeth with caries, filled teeth without caries, and teeth missing due to caries), and DMFT of root caries (including teeth with root caries, filled root caries, filled root caries without active lesions, and residual roots).
Data Analysis
Cluster analysis in this study was conducted using the ClustMixType package (version 0.4–2) in R (version 4.4.2), employing the k-proto function for k-prototypes clustering.19 K-prototypes is an unsupervised machine learning method that extends the classical k-means clustering approach, enabling the analysis of mixed data containing both qualitative and quantitative variables. Its core principle integrates euclidean distance and variable mismatch dissimilarity to comprehensively evaluate the similarity between samples. By iterative optimization, each sample is assigned to the most suitable cluster, ultimately achieving a rational grouping of samples. To ensure the robustness of the clustering results, the Silhouette index was calculated to determine the optimal number of clusters. This index evaluates clustering quality by assessing the distance of a sample within its own cluster and relative to other clusters.20 The Silhouette index (S) ranges from −1 to 1, where values close to 1 indicate well-clustered samples, values near 0 suggest samples on cluster boundaries, and negative values imply potential misclassification. The optimal number of clusters (K) was determined by identifying the K value that maximized the average Silhouette index.
The consistency between clustering results (predicted labels) and root caries distribution (true labels) was validated using a confusion matrix,21 and the differences in composition between clustering results and root caries distribution were visualized with an alluvial diagram.22
Associated factors were analyzed using SPSS software (version 25.0), with clustering results as the dependent variable and the quantitative and qualitative variables used for clustering as independent variables. For univariate analysis, the Wilcoxon rank-sum test was employed for non-normally distributed quantitative variables, while the Chi-square test was used for qualitative variables. Variables with a p-value < 0.05 were included in a binary logistic regression model, and the predictive performance of the binary logistic regression model was assessed using ROC (receiver operating characteristic) curves.23 To enhance the robustness of the findings, a random forest model was constructed using R. The most significant associated factors were identified by comparing and overlapping results from the logistic regression and Random Forest models.24
Figure 1 summarizes the research workflow.
Results
Machine Learning
Clustering Characteristics
This study included a total of 423 adults. After determining the optimal number of clusters as 2 based on the Silhouette index, k-prototypes clustering analysis categorized the samples into cluster 1 and cluster 2. Univariate analyses of both quantitative and qualitative variables revealed statistically significant differences (P<0.05) between the clusters for all variables except for the presence of non-standard denture restoration and use of fluoride toothpaste (Table 1). Compared to cluster 2, cluster 1 was characterized by a younger age, fewer numbers of periodontal pockets (teeth), attachment loss (teeth), and DMFT of coronal caries. Cluster 1 also had higher proportions of males, individuals with higher educational level, no history of systemic diseases, no presence of xerostomia, no unrestored of tooth loss, a toothbrushing frequency of ≥2 times/day, and a higher level of oral health knowledge.
Table 1 Comparison of Demographic, Clinical, and Behavioral Characteristics Between Cluster 1 and Cluster 2
Clustering Model Evaluation
Based on the clustering results and the distribution of root caries, the confusion matrix (Figure 2A) showed that among 221 samples with root caries, 175 were correctly predicted, while among 202 samples without root caries, 168 were correctly predicted. The accuracy of classification was 0.81 (95% CI: 0.77 to 0.85). The sensitivity and specificity of the predictions was 0.79 and 0.83.
Figure 2 Evaluation of the clustering results and root caries distribution. (A) The confusion matrix illustrating the prediction accuracy of the clustering model. (B) The alluvial diagram demonstrating the correspondence between clustering results and the distribution of root caries. 1 in root caries group represents the absence of root caries, and 2 represents the presence of root caries.
An alluvial diagram (Figure 2B) illustrated the correspondence between clustering results and the distribution of root caries. Cluster 2 consisted of 83.7% individuals with root caries and 16.3% without root caries, identifying it as a high-risk group for root caries. In contrast, cluster 1 consisted of 21.5% individuals with root caries and 78.5% without root caries, indicating it as a low-risk group for root caries.
Associated Analysis
Logistic Model
A binary logistic regression model (Table 2) was applied to further analyze four quantitative variables and seven qualitative variables that demonstrated statistical significance (P < 0.05) in the univariate analysis. The results indicated that the risk of being categorized into cluster 2 increased with age, the number of periodontal pockets and attachment losses. Additionally, female, having a history of systemic diseases, the presence of xerostomia, and the presence of unrestored tooth loss were associated with a higher risk of being included in cluster 2 (OR > 1). Conversely, brushing teeth ≥2 times/day and having a high level of oral health knowledge were associated with a lower risk of being included in cluster 2 (OR < 1).
Table 2 Variables Associated with Clusters Identified by the Binary Logistic Regression Model
Logistic Model Evaluation
The ROC curve of the logistic regression model in Figure 3 showed an AUC (area under the curve) of 0.84 (95% CI: 0.80 to 0.88), indicating good predictive performance and reliable discrimination between samples.
Figure 3 ROC curve of the binary logistic regression model showing predictive performance.
The results of the random forest model in Figure 4 highlighted the most important associated factors, all of which were included among the variables with statistically significant differences in the binary logistic regression model. This consistency reinforces the robustness of the identified factors in predicting cluster membership.
Discussion
Unsupervised clustering has demonstrated robust capabilities in handling complex datasets and plays a crucial role in uncovering intricate relationships between factors.11 In this study, the use of k-prototype clustering reduced manual intervention through automated feature extraction, allowing researchers to more intuitively interpret the characteristics of different clusters. Additionally, the algorithm enhances robustness to outliers by assigning them to marginal or independent clusters. Compared to traditional multifactor logistic regression models, k-prototype clustering provides a novel approach for classification that avoids subjective assumptions. It addresses the limitations of logistic regression models stemming from linearity assumptions, offering more comprehensive and structured inputs that mitigate the negative impact of outliers on overall model performance and enhance the interpretability of predictive results. This analytical method shows considerable potential for application in risk prediction fields.
In this study, high-risk group for root caries among the older adults were effectively identified, and important associated factors were determined. The clustering analysis revealed significant differences across various indicators between high-risk individuals (cluster 2) and low-risk individuals (cluster 1). The characteristics of cluster 2 in this study were generally consistent with previous literature,6,25–27 demonstrating a demographic profile of older age, a higher proportion of females, and lower educational level. Additionally, cluster 2 exhibited poorer oral and systemic health status, suboptimal oral hygiene behaviors, and lower levels of oral health knowledge. In contrast, cluster 1 showed distinct advantages across these variables. By integrating the results from the random forest model and the multivariable logistic regression analysis, key factors associated with the high-risk group were identified, including age, gender, the number of periodontal pockets and attachment loss, a history of systemic diseases, presence of xerostomia, presence of unrestored tooth loss, toothbrushing frequency, and the level of oral health knowledge.
Several studies have demonstrated that increasing age is associated with a higher risk of root caries, which is in line with the findings of this current study.28–30 Additionally, multiple studies have reported that gender influences the prevalence of root caries, although there is controversy as to whether the effect is greater in males or females.31–34 In this study, females were more likely to be clustered into high-risk groups. Analysis of gender differences in oral health knowledge level revealed a significantly higher proportion of females with lower level, which may partly explain this classification disparity. However, this could also be attributed to gender differences in factors such as socioeconomic and educational levels, smoking habits, and access to healthcare services in different regions. Further investigation is warranted.
The increased risk of root caries in older adults, influenced by periodontal pockets and attachment loss, which contribute to gingival recession, root surface exposure, and an increase in pathogenic bacteria, has been extensively studied and validated over the years.5,35–37 In this study, the high-risk group also exhibited characteristics such as lower rates of tooth restoration, poorer oral health knowledge, and lower toothbrushing frequency. These factors may contribute to an increased periodontal burden on the remaining teeth,38 plaque retention,39 and other adverse effects on periodontal health. Additionally, some studies have explored the negative impact of coronal caries on root caries.34,40,41 In this study, the number of DMFT for coronal caries was significantly higher in cluster 2 compared to cluster 1, with a statistically significant difference. However, in the multivariable analysis, the contribution of this difference to identifying groups at varying levels of risk was not significant. Future research could be designed to investigate the relationship between coronal caries and root caries in greater detail.
The relationship between systemic health and oral health has garnered significant research attention in recent years.42–44 Systemic diseases, particularly diabetes, are among the most common conditions that may exacerbate the prevalence and progression of periodontitis, which negatively impact the development of root caries.37 Patients with systemic diseases often take multiple medications, which can lead to a reduction in salivary flow, resulting in xerostomia. This condition further impairs oral hygiene, promoting plaque accumulation and acid erosion, thereby increasing the risk of root caries.45 Consequently, oral health management for patients with systemic diseases is crucial. Strategies such as educating older adults, caregivers, and family members to improve oral hygiene practices,46 along with interventions like the use of artificial saliva, immunomodulators to alleviate xerostomia symptoms,47 can provide benefits in reducing the risk of root caries.
Overall, the findings of this study provide valuable guidance for public health initiatives and clinical practices targeting root caries. The results of the cluster analysis offer a basis for optimizing resource allocation, enabling public health services to more precisely reach high-risk groups, thus conserving public resources. For high-risk groups, personalized intervention strategies are recommended, such as regular periodontal treatment, oral health support and education. Additionally, the study highlights the interplay between systemic health and oral health, underscoring the need to incorporate oral health interventions into comprehensive health management for older adults.
The study has some limitations. First, the cross-sectional design only reveals associations between variables and cannot establish causal relationships. Second, the variables included in the cluster analysis were selected based on expert opinions and literature review, a process that may causes subjectivity. Third, the dichotomous decision-making (root caries: yes or no) did not account for the severity, number, or activity of carious lesions, which are critical clinical considerations that might have influenced the findings. Additionally, some data were derived from questionnaires, which could be influenced by subjective perceptions, recall bias, or social desirability effects, introducing potential bias. Lastly, as the study sample was drawn from a homogenous Chinese ethnic group, the generalizability of the findings to diverse racial and ethnic populations worldwide is limited.
In light of the findings and limitations, several recommendations could be proposed to guide future research. Longitudinal studies are essential to establish causal relationships between the identified risk factors and the development of root caries. Future investigations may benefit from incorporating more refined clinical indicators, such as the severity and activity of root carious lesions, and examining the potential influence of salivary biomarkers. Expanding the study population through multicenter collaborations will enable the inclusion of more diverse demographic groups and enhance the generalizability of the results. Furthermore, integrating unsupervised machine learning approaches with additional data sources, such as radiographic imaging and salivary diagnostics, holds promise for developing more clinically relevant and targeted strategies for root caries prevention.
Conclusion
The findings of this study indicate that unsupervised machine learning clustering can effectively identify populations at varying risk of root caries. Factors such as age, gender, periodontal health status, systemic health, xerostomia, dental restorations, oral hygiene behaviors, and oral health awareness play critical roles in increasing the risk of root caries and warrant targeted interventions. Moreover, further optimization of machine learning models and the expansion of datasets in future research may uncover additional potential factors and interaction mechanisms.
Data Sharing Statement
The data cannot be made publicly available because it contains the private personal data of participants.
Ethics Approval and Informed Consent
This study was approved by the Stomatological Ethics Committee of the Chinese Stomatological Association (Permit Number: 2014-003). All participants provided written informed consent prior to participation.
Author Contributions
All authors made significant contributions to the work reported, whether in the conception, study design, execution, data acquisition, analysis and interpretation, or in all these areas. They drafted or written, or substantially revised or critically reviewed the article. They agreed on the journal to which the article will be submitted. They reviewed and agreed on all versions of the article before submission, during revision, the final version accepted for publication, and any significant changes introduced at the proofing stage. They all agree to take responsibility and be accountable for the contents of the article.
Funding
This research was funded by Guangdong General Higher Education Key Area Special Project, Guangdong Provincial Department of Education (2024ZDZX2031) and Science research cultivation program of stomatological hospital, Southern medical university (PY2023021). Linxin Jiang received doctoral study support from the China Scholarship Council (CSC) (CSC No. 202408440099) at University of Leipzig.
Disclosure
The authors report no conflicts of interests in this work.
1. Hellyer P. Root caries. Br Dent J. 2021;231(1):32. doi:10.1038/s41415-021-3240-4
2. AlQranei MS, Balhaddad AA, Melo MAS. The burden of root caries: updated perspectives and advances on management strategies. Gerodontology. 2020;38(2):136–153. doi:10.1111/ger.12511
3. Chavda RM, Patel E, Shah S, Shah S, Sheth S, Munia V. A study to assess the unique oral health challenges faced by elderly individuals, including denture use, dry mouth, and periodontal diseases. Inter J Oral Care Res. 2023;11(44):–47 . doi:10.4103/INJO.INJO_20_23
4. Hopcraft MS, Morgan MV, Satur JG, Wright FAC, Darby IB. Oral hygiene and periodontal disease in Victorian nursing homes. Gerodontology. 2012;29(2):e220–228. doi:10.1111/j.1741-2358.2010.00448.x
5. Do T, Damé-Teixeira N, Naginyte M, Marsh PD. Root surface biofilms and caries. Monographs Oral Sci. 2017;26:26–34.
6. Zhang J, Leung KCM, Sardana D, Wong MCM, Lo EC-M. Risk predictors of dental root caries: a systematic review. Journal of Dentistry. 2019;89:103166. doi:10.1016/j.jdent.2019.07.004
7. Gil-Montoya JA, Sánchez-Lara I, Carnero Pardo C, et al. Oral hygiene in the elderly with different degrees of cognitive impairment and dementia. J Am Geriatr Soc. 2017;65(3):642–647. doi:10.1111/jgs.14697
8. Hayes M, Burke F, Allen PF. Incidence, prevalence and global distribution of root caries. Monogr Oral Sci. 2017;26:1–8.
9. Fee PA, Macey R, Walsh T, Clarkson JE, Ricketts D. Tests to detect and inform the diagnosis of root caries. Cochrane Database Syst Rev. 2020;12(12):Cd013806. doi:10.1002/14651858.CD013806
10. Spielman R, Ameh G, Brandes I, Berkowitz L, Elson N, Blum IR. Challenges in differential diagnosis and treatment of cervical root resorption vs. root caries. Primary Dental J. 2024;13(2):65–70. doi:10.1177/20501684241256468
11. Eckhardt CM, Madjarova SJ, Williams RJ, et al. Unsupervised machine learning methods and emerging applications in healthcare. Knee Surge Sports Traumatol Arthro. 2023;31(2):376–381. doi:10.1007/s00167-022-07233-7
12. Friedman J. The elements of statistical learning: data mining, inference, and prediction. (No Title). 2009.
13. Chen Z, Wang J, Wang H, et al. Machine learning reveals sex differences in clinical features of acute exacerbation of chronic obstructive pulmonary disease: a multicenter cross-sectional study. Front Med. 2023;10:1105854. doi:10.3389/fmed.2023.1105854
14. Hwang UJ, Kwon OY, Kim JH. Unsupervised machine learning for clustering forward head posture, protraction and retraction movement patterns based on craniocervical angle data in individuals with nonspecific neck pain. BMC Musculoskeletal Disor. 2024;25(1):376. doi:10.1186/s12891-024-07485-z
15. Simonsen J, Modin D, Skaarup K, et al. Utilizing echocardiography and unsupervised machine learning for heart failure risk identification. Int J Cardiol. 2025;418:132636. doi:10.1016/j.ijcard.2024.132636
16. Yamamoto T, Morooka H, Ito T, Ishigami M. Clustering using unsupervised machine learning to stratify the risk of immune-related liver injury. J Gastroenterol Hepatol. 2023;38(2):251–258. doi:10.1111/jgh.16038
17. Tian T, Yang Z, Li S, et al. Cross-sectional survey and analysis of factors influencing the prevalence of dental caries among older individuals aged 65-74 in Guangdong Province in 2021. BMC Oral Health. 2024;24(1):24. doi:10.1186/s12903-023-03776-4
18. Organization WH. Oral Health Surveys: Basic Methods. World Health Organization; 2013.
19. Huang Z. Extensions to the k-means algorithm for clustering large data sets with categorical values. Data min Knowl Discov. 1998;2(3):283–304. doi:10.1023/A:1009769707641
20. Mur A, Dormido R, Duro N, Dormido-Canto S, Vega J. Determination of the optimal number of clusters using a spectral clustering optimization. Expert Syst Appl. 2016;65:304–314. doi:10.1016/j.eswa.2016.08.059
21. Susmaga R. Confusion matrix visualization. In: Kłopotek MA, Wierzchoń ST, Trojanowski K, editors. Intelligent Information Processing and Web Mining. Berlin, Heidelberg: Springer; 2004:107–116.
22. Yen PT, Chien TW. Using the alluvial diagram to display variable characteristics for COVID-19 patients and research achievements on the topic of COVID-19, epidemiology, pathogenesis, and vaccine (CEPV. Bibliometric Analysis. 2023;102:e33873.
23. Grey DR, Morgan BJT. Some aspects of ROC curve-fitting: normal and logistic models. J math psychol. 1972;9(1):128–139. doi:10.1016/0022-2496(72)90009-0
24. Fawagreh K, Gaber MM, Elyan E. Random forests: from early developments to recent advancements. Syst Sci Control Eng. 2014;2(1):602–609. doi:10.1080/21642583.2014.956265
25. Tokumoto K, Kimura-Ono A, Mino T, et al. Risk factors for root caries annual incidence and progression among older people requiring nursing care: a one-year prospective cohort study. J Prostho Res. 2022;66(2):250–257. doi:10.2186/jpr.JPR_D_20_00272
26. Zhang J, Leung KCM, Chu CH, Lo ECM. Risk indicators for root caries in older adults using long-term social care facilities in Hong Kong. Commun Dentis Oral Epidemiol. 2020;48(1):14–20. doi:10.1111/cdoe.12495
27. Hayes M, Da Mata C, Cole M, McKenna G, Burke F, Allen PF. Risk indicators associated with root caries in independently living older adults. J Dent. 2016;51:8–14. doi:10.1016/j.jdent.2016.05.006
28. Souza JGS, Costa Oliveira BE, Sampaio AA, Lages VA, Romão DA, Martins A. Contextual and individual determinants of root caries in older people. Caries Res. 2018;52(3):253–261. doi:10.1159/000479800
29. Sen S, Kumar S, Chakraborty R, Srivastava S, Mishra G, Choudhary A. Prevalence and risk factors of root caries in the geriatric population in the rural sector. J Family Med Primary Care. 2020;9(2):771–776. doi:10.4103/jfmpc.jfmpc_1053_19
30. Chen W, Zhu T, Zhang D. The prevalence and common risk indicators of root caries and oral health service utilization pattern among adults, a cross-sectional study. PeerJ. 2023;11:e16458. doi:10.7717/peerj.16458
31. Badr F, Sabbah W. Inequalities in untreated root caries and affordability of dental services among older american adults. Int J Environ Res Public Health. 2020;18(1):17. doi:10.3390/ijerph18010017
32. Gao YB, Hu T, Zhou XD, et al. How root caries differs between middle-aged people and the elderly: findings from the 4th national oral health survey of China. Chin J Dental Res. 2018;21(3):221–229. doi:10.3290/j.cjdr.a41078
33. Mamai-Homata E, Topitsoglou V, Oulis C, Margaritis V, Polychronopoulou A. Risk indicators of coronal and root caries in Greek middle aged adults and senior citizens. BMC Public Health. 2012;12(1):484. doi:10.1186/1471-2458-12-484
34. Tepox-Puga SM, Rocha-Ortiz JA, Medina-Solís CE, Castrejón-Pérez RC, Sánchez-García S, Borges-Yáñez SA. Prevalence, and risk indicators of coronal and root caries in mexican older adults in nursing homes. Clin Cosmetic Invest Dentis. 2023;15:333–347. doi:10.2147/CCIDE.S439342
35. Li Y, Xiang Y, Ren H, et al. Association between periodontitis and dental caries: a systematic review and meta-analysis. Clin Oral Invest. 2024;28(6):306. doi:10.1007/s00784-024-05687-2
36. Vehkalahti M, Paunio I. Association between root caries occurrence and periodontal state. Caries Res. 1994;28(4):301–306. doi:10.1159/000261990
37. Gavriilidou NN, Belibasakis GN. Root caries: the intersection between periodontal disease and dental caries in the course of ageing. Br Dent J. 2019;227(12):1063–1067. doi:10.1038/s41415-019-0973-4
38. Guarnieri R, Ippoliti S. Restoration of periodontally compromised dentitions using telescopic full-arch retrievable prosthesis supported by tooth-implant combination: a long-term retrospective study. Inter J Periodon Restor Den. 2018;38(2):217–224. doi:10.11607/prd.3102
39. Ruiz Núñez MDR, da Luz Raulino M, Goulart Castro R, Schaefer Ferreira de Mello AL. Dental plaque control strategies for the elderly population: a scoping review. Inter J Dental Hygiene. 2022;20(1):167–181. doi:10.1111/idh.12497
40. Vehkalahti MM. Relationship between root caries and coronal decay. J Dent Res. 1987;66(10):1608–1610. doi:10.1177/00220345870660102001
41. Sugihara N, Maki Y, Kurokawa A, Matsukubo T. Cohort study on incidence of coronal and root caries in Japanese adults. The Bulletin of Tokyo Dental College. 2014;55(3):125–130. doi:10.2209/tdcpublication.55.125
42. Coll PP, Lindsay A, Meng J, et al. The prevention of infections in older adults: oral health. J Am Geriatr Soc. 2020;68(2):411–416. doi:10.1111/jgs.16154
43. Weintraub JA, Kaeberlein M, Perissinotto C, Atchison KA, Chen X. Geroscience. Aging Oral Health Res. 2023;31:2–15.
44. Kapila YL. Oral health’s inextricable connection to systemic health: special populations bring to bear multimodal relationships and factors connecting periodontal disease to systemic diseases and conditions. Periodontology 2000. 2021;87(1):11–16. doi:10.1111/prd.12398
45. Chan AKY, Tamrakar M, Jiang CM, Ecm L, Leung KCM, Chu CH. Common medical and dental problems of older adults: a narrative review. 2021;6.
46. Todayama N, Hara R, Tabata T, Hatanaka Y. Systemic and oral characteristics of convalescent inpatients requiring oral-health management by a dental specialist during hospitalization. Geriatrics. 2024;9(3). doi:10.3390/geriatrics9030082
47. Baer AN, Walitt B. Update on Sjögren syndrome and other causes of sicca in older adults. Rheum Dis Clin North Am. 2018;44(3):419–436. doi:10.1016/j.rdc.2018.03.002
Linxin Jiang,1,2 Shaohong Huang,1 Daniel R Reissmann,2 Gerhard Schmalz,3 Jianbo Li1
1Stomatological Hospital, School of Stomatology, Southern Medical University, Guangzhou, Guangdong, People’s Republic of China; 2Department of Prosthodontics, University of Leipzig, Leipzig, Germany; 3Department of Conservative Dentistry and Periodontology, Brandenburg Medical School Theodor Fontane (MHB), Brandenburg/Havel, Germany
Correspondence: Jianbo Li, Stomatological Hospital, School of Stomatology, Southern Medical University, 366 Jiangnan South Avenue, Haizhu District, Guangzhou, Guangdong province, 510280, People’s Republic of China, Email [email protected] Gerhard Schmalz, Email [email protected]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025. This work is licensed under https://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Purpose: This study aimed to identify the high-risk group for root caries using unsupervised machine learning and to explore the associated factors.
Patients and Methods: This cross-sectional study included 423 adults aged 65 to 74 years, surveyed in 2021. Clusters representing root caries risk were identified using k-prototypes clustering, with the optimal number of clusters determined by the maximum silhouette index. The confusion matrix and alluvial diagram were used to visualize the predictive accuracy and composition of the clustering results. Binary logistic regression models further analyzed the associated factors, while ROC (receiver operating characteristic) curves and the random forest model visualized the predictive performance and the most important associated factors.
Results: Two clusters were identified: cluster 1, with low root caries risk (21.5% with and 78.5% without root caries), and cluster 2, with high root caries risk (83.7% with and 16.3% without root caries). The clustering results predicted root caries with an accuracy of 0.81, sensitivity of 0.79, and specificity of 0.83. Overlapping results from binary logistic regression and the random forest model indicated that older age, more periodontal pockets, more attachment loss, female, a history of systemic diseases, presence of xerostomia, and presence of unrestored tooth loss were positively associated with cluster 2. Brushing tooth ≥ 2 times per day and a high level of oral health knowledge were negatively associated with cluster 2. The ROC curve for the binary logistic regression model showed an AUC (area under the curve) of 0.84.
Conclusion: Individuals who are older, female, with poorer oral and systemic health status, suboptimal oral hygiene behaviors, and lower oral health knowledge levels are more likely to be identified as high-risk group. The identified factors, revealed through unsupervised machine learning, can facilitate personalized prevention and management strategies for root caries in older adults.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer