Content area

Abstract

COVID-19 mortality is a complex phenomenon influenced by multiple factors. This study aimed to identify factors associated with death in COVID-19 patients by considering clinical, demographic, environmental, and socioeconomic conditions, using machine learning models and a national dataset from Mexico covering all pandemic waves. We integrated data from the national COVID-19 dataset, municipal-level socioeconomic indicators, and water quality contaminants (physicochemical and microbiological). Patients were assigned to one of four datasets (groundwater, lentic, lotic, and coastal) based on their municipality of residence. We trained XGBoost models to predict patient death or survival on balanced subsets of each dataset. Hyperparameters were optimized using a grid search and cross-validation, and feature importance was analyzed using SHAP values, point-biserial correlation, and XGBoost metrics. The models achieved strong predictive performance (F1 score > 0.97). Key risk factors included older age (≥50 years), pneumonia, intubation, obesity, diabetes, hypertension, and chronic kidney disease, while outpatient status, younger age (<40 years), contact with a confirmed case, and care in private medical units were associated with survival. Female sex showed a protective trend. Higher socioeconomic levels appeared protective, whereas lower levels increased risk. Water quality contaminants (e.g., manganese, hardness, fluoride, dissolved oxygen, fecal coliforms) ranked among the top 30 features, suggesting an association between environmental factors and COVID-19 mortality.

Details

1009240
Business indexing term
Location
Title
Factors Associated with COVID-19 Mortality in Mexico: A Machine Learning Approach Using Clinical, Socioeconomic, and Environmental Data
Author
Díaz-González Lorena 1   VIAFID ORCID Logo  ; Toribio-Colin, Yael Sharim 2 ; Pérez-Sansalvador, Julio César 3 ; Lakouari Noureddine 3   VIAFID ORCID Logo 

 Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, Cuernavaca 62209, Morelos, Mexico 
 Licenciatura en Ciencias, Instituto de Investigación en Ciencias Básicas Aplicadas (IICBA), Universidad Autónoma del Estado de Morelos, Cuernavaca 62209, Morelos, Mexico; [email protected] 
 Department of Computer Science, Instituto Nacional de Astrofísica, Óptica y Electrónica, Luis Enrique Erro 1, Tonantzintla 72840, Puebla, Mexico, Secretaría de Ciencia, Humanidades, Tecnología e Innovación (SECIHTI), Insurgentes Sur 1582, Ciudad de Mexico 03940, Mexico 
Volume
7
Issue
2
First page
55
Number of pages
41
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
e-ISSN
25044990
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-06-15
Milestone dates
2025-04-26 (Received); 2025-06-12 (Accepted)
Publication history
 
 
   First posting date
15 Jun 2025
ProQuest document ID
3223924824
Document URL
https://www.proquest.com/scholarly-journals/factors-associated-with-covid-19-mortality-mexico/docview/3223924824/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-11-17
Database
2 databases
  • Coronavirus Research Database
  • ProQuest One Academic