Content area

Abstract

Machine Learning (ML), a sub field of Artificial Intelligence (AL), has been used successfully in the healthcare domain for disease diagnosis. Thyroid disorders and diabetes are two of the most prevalent and interconnected chronic diseases, as both play critical roles in regulating various physiological processes in the body. This study aims to predict thyroid disorders in diabetes patients using six machine learning algorithms: Random Forest (RF), Decision Tree (DT), K-Nearest Neighbors (KNN), Logistic Regression (LR), Naive Bayes (NB), and Support Vector Machine (SVM). A locally sourced dataset comprising 44,539 instances of diabetic patients was utilized, undergoing preprocessing steps including data cleaning, encoding, and balancing. Two balancing techniques were employed: manual balancing andRandomUnderSampler. The dataset was partitioned into training and testing sets using a Stratified K-Fold cross-validation approach with 10 folds to ensure robust evaluation. Each algorithm's performance was assessed using metrics such as accuracy and Fl-score. Among the models, the RF algorithm outperformed the others, achieving the highest accuracy of 95% on the manually balanced dataset and 84% when the RandomUnderSampler technique was employed. Additionally, the Fl-scores for RF were 95% and 82%, respectively, indicating its robustness in handling imbalanced datasets. This study highlights the importance of selecting appropriate preprocessing techniques and machine learning methods for healthcare datasets. The findings can assist healthcare providers in making early diagnoses and interventions for thyroid disorders in diabetic patients, potentially improving their quality of life and overall healthcare outcomes.

Details

1009240
Business indexing term
Title
Comparison of Machine Learning Algorithms for Predicting Thyroid Disorders in Diabetic Patients
Author
Sayyid, Hiba O 1 ; Mahmood, Salma A 2 ; Hamadi, Saad S 3 

 Department of Computer Science, University of Basrah, College of Computer Sciences and Information Technology, Basrah, Iraq 
 Department of Intelligent Medical Systems, University of Basrah, College of Computer Sciences and Information Technology, Basrah, Iraq 
 Department of Internal Medicine, University of Basrah, College of Medicine, Basrah, Iraq 
Publication title
Informatica; Ljubljana
Volume
49
Issue
12
Pages
105-114
Publication year
2025
Publication date
Feb 2025
Publisher
Slovenian Society Informatika / Slovensko drustvo Informatika
Place of publication
Ljubljana
Country of publication
Slovenia
Publication subject
ISSN
03505596
e-ISSN
18543871
Source type
Scholarly Journal
Language of publication
English
Document type
Feature
ProQuest document ID
3188467301
Document URL
https://www.proquest.com/scholarly-journals/comparison-machine-learning-algorithms-predicting/docview/3188467301/se-2?accountid=208611
Copyright
© 2025. This work is published under https://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-07-22
Database
ProQuest One Academic