Content area

Abstract

Background:Cervical cancer has been ranked as the fourth most common cancer affecting women, contributing to approximately 660,000 new diagnoses and 350,000 fatalities worldwide. Effective early screening has been shown to reduce cervical cancer incidence by up to 80% and prevent more than 40% of new cases.

Objective:This study aims to assess a machine learning–based prediction model and identify the key predictors influencing cervical cancer screening uptake among women aged 30‐49 years in sub-Saharan Africa.

Methods:For this study, a weighted dataset of 33,952 individuals from the 2022 Demographic and Health Survey in Ghana, Kenya, Mozambique, and Tanzania was used. STATA version 17 (StataCorp) and Python 3.10 (Python Software Foundation) were used for data preprocessing and analysis. MinMax and standard scaler were applied for feature scaling, and recursive feature elimination was used for feature selection. An 80:20 ratio was applied for data splitting. Tomek links with random oversampling were used for handling class imbalance. A total of 7 models were selected and trained using both balanced and unbalanced datasets. Model evaluation was performed using area under the receiver operating characteristic curve, accuracy, and a confusion matrix.

Results:The proportion of cervical cancer screening in sub-Saharan Africa was 13%, which is lower than reported in previous studies. Random forest was the best-performing model, achieving an accuracy of 78%, an area under the curve of 86%, an F1-score of 79%, a recall of 81%, and a precision of 77%. The waterfall plot’s Shapley Additive Explanations analysis showed that wealth status, awareness of sexually transmitted infections, HIV testing exposure, age at first sexual intercourse, educational level, residency, smartphone ownership, having a single sexual partner, and previous health status were predictors of cervical cancer screening.

Conclusions:Improving education and awareness, expanding access to screening (especially in rural areas), leveraging both digital health and community-based outreach, integrating screening with other health services, and addressing socioeconomic barriers are recommended strategies to increase cervical cancer screening rates in sub-Saharan Africa.

Details

1009240
Title
Identifying Predictors of Cervical Cancer Screening Uptake in Sub-Saharan Africa Using Machine Learning: Cross-Sectional Study
Publication title
Volume
11
First page
e71677
Number of pages
15
Publication year
2025
Publication date
2025
Section
Public Health Informatics
Publisher
JMIR Publications
Place of publication
Toronto
Country of publication
Canada
Publication subject
e-ISSN
23692960
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-09-17
Milestone dates
2025-01-23 (Submitted); 2025-05-28 (Revised version received); 2025-06-06 (Accepted); 2025-09-17 (Published)
Publication history
 
 
   First posting date
17 Sep 2025
ProQuest document ID
3252497839
Document URL
https://www.proquest.com/scholarly-journals/identifying-predictors-cervical-cancer-screening/docview/3252497839/se-2?accountid=208611
Copyright
© 2025. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-09-26
Database
ProQuest One Academic