Full text

Turn on search term navigation

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Simple Summary

The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases.

Abstract

Thyroid disease prediction has emerged as an important task recently. Despite existing approaches for its diagnosis, often the target is binary classification, the used datasets are small-sized and results are not validated either. Predominantly, existing approaches focus on model optimization and the feature engineering part is less investigated. To overcome these limitations, this study presents an approach that investigates feature engineering for machine learning and deep learning models. Forward feature selection, backward feature elimination, bidirectional feature elimination, and machine learning-based feature selection using extra tree classifiers are adopted. The proposed approach can predict Hashimoto’s thyroiditis (primary hypothyroid), binding protein (increased binding protein), autoimmune thyroiditis (compensated hypothyroid), and non-thyroidal syndrome (NTIS) (concurrent non-thyroidal illness). Extensive experiments show that the extra tree classifier-based selected feature yields the best results with 0.99 accuracy and an F1 score when used with the random forest classifier. Results suggest that the machine learning models are a better choice for thyroid disease detection regarding the provided accuracy and the computational complexity. K-fold cross-validation and performance comparison with existing studies corroborate the superior performance of the proposed approach.

Details

Title
Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques
Author
Chaganti, Rajasekhar 1   VIAFID ORCID Logo  ; Furqan Rustam 2   VIAFID ORCID Logo  ; Isabel De La Torre Díez 3   VIAFID ORCID Logo  ; Vidal Mazón, Juan Luis 4   VIAFID ORCID Logo  ; Carmen Lili Rodríguez 5 ; Imran Ashraf 6   VIAFID ORCID Logo 

 Toyota Research Institute, Los Altos, CA 94022, USA 
 Department of Software Engineering, School of System Sciences, University of Management and Technology, Lahore 54770, Pakistan 
 Department of Signal Theory and Communications and Telematic Engineering, University of Valladolid, Paseo de Belén 15, 47011 Valladolid, Spain 
 Higher Polytechnic School, Universidad Europea del Atlántico, Parque Científico y Tecnológico de Cantabria, Isabel Torres 21, 39011 Santander, Spain; Project Department, Universidade Internacional do Cuanza, Cuito EN250, Bié, Angola; Department of Project Management, Universidad Internacional Iberoamericana, Arecibo, PR 00613, USA 
 Higher Polytechnic School, Universidad Europea del Atlántico, Parque Científico y Tecnológico de Cantabria, Isabel Torres 21, 39011 Santander, Spain; Department of Project Management, Universidad Internacional Iberoamericana, Campeche 24560, Mexico 
 Department of Information and Communication Engineering, Yeungnam University, Gyeongsan 38541, Korea 
First page
3914
Publication year
2022
Publication date
2022
Publisher
MDPI AG
e-ISSN
20726694
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2706125883
Copyright
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.