Abstract

The primary goal of this study was to evaluate the major roles of health-related quality of life (HRQOL) in a 5-year lung cancer survival prediction model using machine learning techniques (MLTs). The predictive performances of the models were compared with data from 809 survivors who underwent lung cancer surgery. Each of the modeling technique was applied to two feature sets: feature set 1 included clinical and sociodemographic variables, and feature set 2 added HRQOL factors to the variables from feature set 1. One of each developed prediction model was trained with the decision tree (DT), logistic regression (LR), bagging, random forest (RF), and adaptive boosting (AdaBoost) methods, and then, the best algorithm for modeling was determined. The models’ performances were compared using fivefold cross-validation. For feature set 1, there were no significant differences in model accuracies (ranging from 0.647 to 0.713). Among the models in feature set 2, the AdaBoost and RF models outperformed the other prognostic models [area under the curve (AUC) = 0.850, 0.898, 0.981, 0.966, and 0.949 for the DT, LR, bagging, RF and AdaBoost models, respectively] in the test set. Overall, 5-year disease-free lung cancer survival prediction models with MLTs that included HRQOL as well as clinical variables improved predictive performance.

Details

Title
The major effects of health-related quality of life on 5-year survival prediction among lung cancer survivors: applications of machine learning
Author
Jin-ah, Sim 1 ; Kim Young Ae 2 ; Han, Kim Ju 3 ; Lee, Jong Mog 4 ; Kim Moon Soo 4 ; Shim Young Mog 5 ; Zo Jae Ill 5 ; Yun Young Ho 6 

 Seoul National University College of Medicine, Department of Biomedical Science, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905) 
 National Cancer Center, National Cancer Control Institute, Goyang, Korea (GRID:grid.410914.9) (ISNI:0000 0004 0628 9810) 
 Seoul National University College of Medicine, Department of Biomedical Informatics, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905) 
 National Cancer Center, Center for Lung Cancer, Goyang, Korea (GRID:grid.410914.9) (ISNI:0000 0004 0628 9810) 
 Samsung Comprehensive Cancer Center, Samsung Medical Center, Lung and Esophageal Cancer Center, Seoul, Korea (GRID:grid.414964.a) (ISNI:0000 0001 0640 5613) 
 Seoul National University College of Medicine, Department of Biomedical Science, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905); Seoul National University College of Medicine, Department of Biomedical Informatics, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905); Seoul National University College of Medicine, Department of Family Medicine, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905) 
Publication year
2020
Publication date
2020
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2419204525
Copyright
© The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.