Full Text

Turn on search term navigation

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Cardiovascular disease (CVD) is a significant global health concern and the leading cause of death in many countries. Early detection and diagnosis of CVD can significantly reduce the risk of complications and mortality. Machine learning methods, particularly classification algorithms, have demonstrated their potential to accurately predict the risk of cardiovascular disease (CVD) by analyzing patient data. This study evaluates seven binary classification algorithms, including Random Forests, Logistic Regression, Naive Bayes, K-Nearest Neighbors (kNN), Support Vector Machines, Gradient Boosting, and Artificial Neural Networks, to understand their effectiveness in predicting CVD. Advanced preprocessing techniques, such as SMOTE–ENN for addressing class imbalance and hyperparameter optimization through Grid Search Cross-Validation, were applied to enhance the reliability and performance of these models. Standard evaluation metrics, including accuracy, precision, recall, F1-score, and Area Under the Receiver Operating Characteristic Curve (ROC-AUC), were used to assess predictive capabilities. The results show that kNN achieved the highest accuracy (99%) and AUC (0.99), surpassing traditional models like Logistic Regression and Gradient Boosting. The study examines the challenges encountered when working with datasets related to cardiovascular diseases, such as class imbalance and feature selection. It demonstrates how addressing these issues enhances the reliability and applicability of predictive models. These findings emphasize the potential of kNN as a reliable tool for early CVD prediction, offering significant improvements over previous studies. This research highlights the value of advanced machine learning techniques in healthcare, addressing key challenges and laying a foundation for future studies aimed at improving predictive models for CVD prevention.

Details

Title
Evaluating Binary Classifiers for Cardiovascular Disease Prediction: Enhancing Early Diagnostic Capabilities
Author
Iacobescu, Paul 1   VIAFID ORCID Logo  ; Marina, Virginia 2   VIAFID ORCID Logo  ; Anghel, Catalin 1   VIAFID ORCID Logo  ; Aurelian-Dumitrache Anghele 3   VIAFID ORCID Logo 

 Department of Computer Science and Information Technology, “Dunărea de Jos” University of Galati, 800201 Galati, Romania; [email protected] (P.I.); [email protected] (C.A.) 
 Medical Department of Occupational Health, Faculty of Medicine and Pharmacy, “Dunărea de Jos” University of Galati, 800201 Galati, Romania 
 Doctoral School of “Dunărea de Jos” University of Galati,800201 Galati, Romania; [email protected] 
First page
396
Publication year
2024
Publication date
2024
Publisher
MDPI AG
e-ISSN
23083425
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3149635547
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.