Abstract

The Cox proportional hazards model is commonly used in evaluating risk factors in cancer survival data. The model assumes an additive, linear relationship between the risk factors and the log hazard. However, this assumption may be too simplistic. Further, failure to take time-varying covariates into account, if present, may lower prediction accuracy. In this retrospective, population-based, prognostic study of data from patients diagnosed with cancer from 2008 to 2015 in Ontario, Canada, we applied machine learning-based time-to-event prediction methods and compared their predictive performance in two sets of analyses: (1) yearly-cohort-based time-invariant and (2) fully time-varying covariates analysis. Machine learning-based methods—gradient boosting model (gbm), random survival forest (rsf), elastic net (enet), lasso and ridge—were compared to the traditional Cox proportional hazards (coxph) model and the prior study which used the yearly-cohort-based time-invariant analysis. Using Harrell’s C index as our primary measure, we found that using both machine learning techniques and incorporating time-dependent covariates can improve predictive performance. Gradient boosting machine showed the best performance on test data in both time-invariant and time-varying covariates analysis.

Details

Title
Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time
Author
Cygu, Steve 1 ; Seow, Hsien 2 ; Dushoff, Jonathan 3 ; Bolker, Benjamin M. 3 

 McMaster University, School of Computational Science and Engineering, Hamilton, Hamilton, Canada (GRID:grid.25073.33) (ISNI:0000 0004 1936 8227) 
 McMaster University, Department of Oncology, Hamilton, Hamilton, Canada (GRID:grid.25073.33) (ISNI:0000 0004 1936 8227) 
 McMaster University, School of Computational Science and Engineering, Hamilton, Hamilton, Canada (GRID:grid.25073.33) (ISNI:0000 0004 1936 8227); McMaster University, Department of Biology, Hamilton, Hamilton, Canada (GRID:grid.25073.33) (ISNI:0000 0004 1936 8227) 
Pages
1370
Publication year
2023
Publication date
2023
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2768984743
Copyright
© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.