This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
Early-stage lung cancer is usually asymptomatic. Hence, lung cancer is frequently diagnosed at a late stage [1, 2]. Non-small-cell lung cancer (NSCLC) is the most common histological subtype of lung cancer, with about 40% of cases harboring distant metastases at the first diagnosis [3]. Bone metastases (BM) occurs in 30-40% of NSCLC patients, which is one of the most frequent distant metastasis events [4]. It is known that distant metastases are the leading cause of cancer-related death [5, 6]. For NSCLC patients with BM, the reported median survival is less than 1 year in different populations [7]. Such poor prognosis highlights the significant demand for accurate tools for predicting the prognosis of NSCLC with BM.
A TNM staging system is a tool based on pathological anatomy, which can assist clinicians to develop effective treatment strategies and improve the patients’ prognosis [8]. However, the prognosis of patients with the same stage is notably different, indicating significant limitations of using the TNM staging system as the prognosis predicting model. More importantly, many other factors should be considered and involved in predicting the prognosis of patients [9, 10]. Survival prediction models designed for lung cancer with BM specifically have been reported previously [11–13]. However, the performance of these models is barely satisfactory as these models are based on the simple Cox regression model but not established as a survival prediction model for NSCLC with BM particularly. Given the impact of histological changes in prognostic determination, we propose narrowing down the scope of the study objects. For example, for patients with a certain histological type of NSCLC, it is necessary to improve the accuracy of the predictive model. Currently, artificial intelligence (AI) models based on machine learning (ML) algorithms are increasingly applied for clinical practice. Most models including random forest (RF), support vector machine (SVM), Bayesian network, and decision tree are created based on traditional ML algorithms [14]. Extreme gradient boosting (XGBoost) is a typical boosting algorithm designed to be highly efficient, flexible, and portable. Boosting is an ensemble technique with which new models can adjust the errors produced by existing models [15]. These advantages guarantee the high performance of XGBoost which provides satisfactory results in machine learning competitions and has been successfully used in other studies and domains [16].
Therefore, in the current study, we extracted the NSCLC patients with BM from the Surveillance, Epidemiology, and End Results (SEER) database and searched for an ideal AI model to predict the 1-year survival of NSCLC with BM by testing the XGBoost and other traditional algorithms.
2. Methods and Materials
2.1. Patients
All NSCLC patients with confirmed BM in the SEER database between 2010 and 2015 were selected for this study. The inclusion criteria were as follows: (a) patients diagnosed with lung cancer on histology, (b) the histologic type of NSCLC, and (c) patients with BM. The exclusion criteria were as follows: (a) lung cancer not the primary cancer, (b) patients without complete clinicopathological characteristics, demographic information, or follow-up information, and (c) follow-up
2.2. Data Collection
Based on the specific patient information available in the SEER database, we selected 19 variables that may affect the prognosis of NSCLC with BM, including age, sex, race, tumor size, tumor site, histological type, grade, laterality, surgery, chemotherapy, radiotherapy, TNM staging, distant metastasis sites (lung, brain, and liver), insurance status, and marital status.
The primary site is defined according to the International Classification of Diseases for Oncology (ICD-O) codes: main bronchus (C34.0), lobe (C34.1-C34.3), overlapping lesion of the lung (C34.8), and lung, if not otherwise specified (C34.9). The histological type is defined in accordance with the following ICD-O-3 codes: adenocarcinoma (8140, 8141, 8144, 8244, 8250–8255, 8260, 8290, 8310, 8323, 8333, 8470, 8480, 8481, 8490, 8507, 8550, 8551, 8570, 8571, 8574, and 8576), squamous cell carcinoma (8052, 8070-8076, 8078, 8083, 8084, and 8123), and other NSCLC (8004, 8012-8014, 8022, 8030, 8035, 8046, 8082, 8200, 8240, 8249, 8430, 8560, and 8562). Regarding marital status, we excluded the misleading data of unmarried or domestic partner, and then, “unmarried,” “separated,” “single,” and “widow” were all included in the unmarried group. The insurance status was divided into insured and uninsured; “any Medicaid,” “insured,” and “insured/no specific” were included in the insured group. All cases in the present study were staged using the 7th edition of the AJCC TNM staging system.
2.3. Prognostic Nomogram
The variables that might be related to prognosis were analyzed by the univariate analysis. Then, variables with
2.4. Construction of the XGBoost Model
Before the sample feature data is put into the model for classification, the data was preprocessed first. In the dataset used in the study, age and size are continuous variables, and the rest is classified. For continuous, we adopted standardization for age and size to speed up the training. The formula is as follows:
To calculate the distance accurately in some machine learning models, we used one-hot encoding for multiclassification variables. The tree-based model has an excellent performance to calculate the importance of features. XGBoost was used to rank feature importance, and eventually, significant variables were included in our model building. After variable selection, there were 17 feature variables left. We also used XGBoost, an ensemble machine learning method predicting the residuals of prior models, and then combined together to make the final prediction. XGBoost uses second-order Taylor series to estimate the value of the loss function and further reduces the likelihood of overfitting by application of regularization. The objective function is as follows:
2.5. Model Evaluation
We have also established three other prediction models based on RF, SVN, and logistic algorithms, respectively. To evaluate the performance of each prediction model, receiver operating characteristic (ROC) curves were used to quantify and compare the predictive performance of the XGBoost model and other prediction models.
3. Results
3.1. Features of Patients
According to the inclusion and exclusion criteria, 5973 NSCLC patients with BM were selected from the SEER database, and an additional 114 NSCLC patients with BM were identified from the AHOCMU for this study. In addition, 4183 patients were enrolled in the training set; the rest 1790 patients were included in the validation set. Patient demographic and clinicopathologic features are presented in Table 1. Briefly, 4657 patients (78.0%) were white and 740 (12.4%) were black. Male (57.4%) had a slight predominance over female (42.6%). Regarding the tumor characteristics, 90.7% were located in the lung lobe; adenocarcinoma (64.5%) accounted for the majority, most of which were moderately or poorly differentiated. For therapy, 231 (3.9%) of the patients received surgery, 3853 (64.5%) received chemotherapy, and 3525 (59.0%) underwent radiotherapy. Lung metastases (28.6%) were more common than liver metastases (20.4%) and brain metastases (23.1%).
Table 1
Demographic and clinicopathologic features of 5973 NSCLC patients with BM in the SEER database.
Variables | Training set | Validation set | ||
Age ( | 0.336 | 0.737 | ||
Size ( | 0.423 | 0.673 | ||
Race | 1.191 | 0.551 | ||
Black | 519 (12.4%) | 221 (12.3%) | ||
Other | 392 (9.4%) | 184 (10.3%) | ||
White | 3272 (78.2%) | 1385 (77.4%) | ||
Sex | 0.146 | 0.702 | ||
Female | 1789 (42.8%) | 756 (42.2%) | ||
Male | 2394 (57.2%) | 1034 (57.8%) | ||
Primary site | 0.350 | 0.950 | ||
Main bronchus | 173 (4.1%) | 79 (4.4%) | ||
Overlapping lesion of lung | 31 (0.7%) | 12 (0.7%) | ||
Lung, NOS | 182 (4.4%) | 76 (4.2%) | ||
Lobe | 3797 (90.8%) | 1623 (90.7%) | ||
Histologic type | 6.010 | 0.050 | ||
ADC | 2688 (64.3%) | 1163 (65.0%) | ||
Others | 558 (13.3%) | 269 (15.0%) | ||
SCC | 937 (22.4%) | 358 (20.0%) | ||
Grade | 2.367 | 0.500 | ||
I | 217 (5.2%) | 86 (4.8%) | ||
II | 1259 (30.1%) | 536 (29.9%) | ||
III | 2596 (62.1%) | 1131 (63.2%) | ||
IV | 111 (2.7%) | 37 (2.1%) | ||
Laterality | 0.277 | 0.599 | ||
Left—origin of primary | 1750 (41.8%) | 762 (42.6%) | ||
Right—origin of primary | 2433 (58.2%) | 1028 (57.4%) | ||
T stage | 5.837 | 0.120 | ||
T1 | 446 (10.7%) | 184 (10.3%) | ||
T2 | 1168 (27.9%) | 555 (31.0%) | ||
T3 | 1139 (27.2%) | 469 (26.2%) | ||
T4 | 1430 (34.2%) | 582 (32.5%) | ||
N stage | 3.715 | 0.294 | ||
N0 | 887 (21.2%) | 343 (19.2%) | ||
N1 | 373 (8.9%) | 170 (9.5%) | ||
N2 | 2039 (48.7%) | 902 (50.4%) | ||
N3 | 884 (21.1%) | 375 (20.9%) | ||
M stage | 0.916 | 0.339 | ||
M1a | 110 (2.6%) | 55 (3.1%) | ||
M1b | 4073 (97.4%) | 1735 (96.9%) | ||
Radiotherapy | 1.270 | 0.260 | ||
No | 1734 (41.5%) | 714 (39.9%) | ||
Yes | 2449 (58.5%) | 1076 (60.1%) | ||
Chemotherapy | 0.099 | 0.753 | ||
No | 1490 (35.6%) | 630 (35.2%) | ||
Yes | 2693 (64.4%) | 1160 (64.8%) | ||
Surgery | 1.452 | 0.228 | ||
No | 4013 (95.9%) | 1729 (96.6%) | ||
Yes | 170 (4.1%) | 61 (3.4%) | ||
Brain metastasis | 2.261 | 0.133 | ||
No | 3239 (77.4%) | 1354 (75.6%) | ||
Yes | 944 (22.6%) | 436 (24.4%) | ||
Liver metastasis | 0.002 | 0.960 | ||
No | 3330 (79.6%) | 1426 (79.7%) | ||
Yes | 853 (20.4%) | 364 (20.3%) | ||
Lung metastasis | 0.576 | 0.448 | ||
No | 2999 (71.7%) | 1266 (70.7%) | ||
Yes | 1184 (28.3%) | 524 (29.3%) | ||
Insurance status | 1.036 | 0.309 | ||
Insured | 4059 (97.0%) | 1728 (96.5%) | ||
Uninsured | 124 (3.0%) | 62 (3.5%) | ||
Marital status | 0.099 | 0.753 | ||
Married | 2412 (57.7%) | 1040 (58.1%) | ||
Unmarried | 1771 (42.3%) | 750 (41.9%) |
NSCLC: non-small-cell lung cancer; BM: bone metastasis; ADC: adenocarcinoma; SCC: squamous cell carcinoma.
3.2. Prognostic Nomogram for 1-Year Survival
The univariate analysis is presented in Table 2. The results of the multivariate logistic analysis indicated that tumor size, age, race, sex, histological type, grade, N stage, surgery, chemotherapy, and liver metastases were OS-related prognostic factors (Table 2). Next, these prognostic factors were integrated to build a nomogram for predicting the prognosis of NSCLC with BM (Figure 1). As shown in Figure 1, tumor size is the most important prognostic factor followed by chemotherapy, age, race, grade, surgery, and liver metastases, which affected the prognosis moderately, while N stage, histologic type, and sex had little effect on prognosis. Furthermore, each prognostic factor was given a corresponding score for the nomogram. The total score was obtained by summing the scores of each relevant factor, and we used the total score to draw a vertical line to obtain the individual probability of NSCLC with BM survival.
Table 2
Univariate analysis and multivariate logistic analysis based on all variables for 1-year survival (training cohort).
Characteristics | Univariate analysis | Multivariate logistic analysis | |
HR (95% CI) | |||
Age | <0.001 | 1.015 (1.008–1.022) | <0.001 |
Size | <0.001 | 1.008 (1.004–1.011) | <0.001 |
Race | |||
Black | <0.001 | Reference | |
Other | 0.423 (0.311–0.575) | <0.001 | |
White | 1.073 (0.851–1.352) | 0.552 | |
Sex | |||
Female | <0.001 | Reference | |
Male | 1.265 (1.088–1.470) | <0.05 | |
Primary site | |||
Main bronchus | <0.05 | ||
Overlapping lesion of lung | |||
Lung, NOS | |||
Lobe | |||
Histologic type | |||
ADC | <0.001 | Reference | |
Others | 1.746 (1.351–2.255) | <0.001 | |
SCC | 1.524 (1.246–1.865) | <0.001 | |
Grade | |||
I | <0.001 | Reference | |
II | 1.037 (0.751–1.433) | 0.824 | |
III | 1.653 (1.206-2.266) | <0.05 | |
IV | 2.454 (1.292-4.659) | <0.05 | |
Laterality | |||
Left—origin of primary | 0.752 | ||
Right—origin of primary | |||
T stage | |||
T1 | <0.001 | ||
T2 | |||
T3 | |||
T4 | |||
N stage | |||
N0 | <0.001 | Reference | |
N1 | 1.198 (0.901-1.592) | 0.215 | |
N2 | 1.636 (1.351-1.982) | <0.001 | |
N3 | 1.816 (1.443-2.284) | <0.001 | |
M stage | |||
M1a | 0.791 | ||
M1b | |||
Radiotherapy | |||
No | 0.162 | ||
Yes | |||
Surgery | |||
No | <0.001 | Reference | |
Yes | 0.438 (0.311-0.617) | <0.001 | |
Chemotherapy | |||
No | <0.001 | Reference | |
Yes | 0.211 (0.174 -0.256) | <0.001 | |
Brain metastasis | |||
No | 0.866 | ||
Yes | |||
Liver metastasis | |||
No | <0.001 | Reference | |
Yes | 1.948 (1.589-2.388) | <0.001 | |
Lung metastasis | |||
No | 0.949 | ||
Yes | |||
Insurance status | |||
Insured | 0.732 | ||
Uninsured | |||
Marital status | |||
Married | <0.001 | ||
Unmarried |
ADC: adenocarcinoma; SCC: squamous cell carcinoma.
[figure omitted; refer to PDF]3.3. Establishment of the XGBoost Model
Correlated features are redundant and may decrease the performance of ML algorithms. The correlations between the features are depicted in Figure 2. Thus, it was necessary to perform the feature reduction. The nineteen features were ranked using the XGBoost Classifier based on feature importance. The ranking is shown in Figure 3. A cut-off point was determined to select the top-ranked features for the best trade-off between model performance and simplicity, according to the accuracy of the model when using different thresholds. After this selection, the M stage and insurance status were removed, and 17 features were fitted into our model.
[figure omitted; refer to PDF][figure omitted; refer to PDF]After grid search, the parameters of the best model were determined (
[figures omitted; refer to PDF]
3.4. Validation of Predictive Accuracy of the XGBoost Model
We depicted the ROC curves for the XGBoost model and the single prognostic factor in training and validation sets, respectively. As shown in Figure 5, the AUC of the XGBoost model was significantly bigger than the single prognostic factor, indicating a much higher prediction accuracy of the XGBoost model. At the same time, the XGBoost model had an AUC of 0.764 in the external validation set, demonstrating a better discriminative ability (Figure 6).
[figures omitted; refer to PDF]
[figure omitted; refer to PDF]3.5. Comparison of Predictive Accuracy between Various Prediction Models
In order to assess the advantage of the prediction model generated by the XGBoost algorithm, we also compared it with other models. The training and validation sets of each model were depicted with ROC curves, and the corresponding AUCs were calculated. In the training set, the accuracy of the XGBoost model for predicting survival (
4. Discussion
Although NSCLC with BM may obtain longer survival than before with advancement of various treatment methods and drugs, accurate prediction of survival for NSCLC with BM remains to be necessary and a challenge for clinicians. This study established and validated the XGBoost model as the most appropriate model for predicting 1-year survival of NSCLC with BM. In essence, the XGBoost model achieved an AUC of 0.792, 0.786, and 0.764 in the training, internal validation, and external validation sets, respectively. Compared with other models, it showed better reliability and accuracy (Figure 4), which could be utilized to predict the 1-year mortality of NSCLC with BM, thus facilitating a reasonable individualized drug treatment program determination.
To our knowledge, this study is the first research to establish a prognostic model for NSCLC with BM by using AI-based models on large-scale populations. The major differences between our study and the others could be summarized as follows. First, we scaled down the research objects to NSCLC instead of the entire lung cancer patient population. It was important as the histological subtype affected the prognosis dramatically, which was in line as previously reported [17–19]. Second, our study only included patients with BM, but not with other metastases, as the prognosis of patients with different metastatic sites was quite different [20, 21]. Therefore, the accuracy of using a prognostic model based on patients with any metastatic NSCLC to predict the prognosis of NSCLC with BM was questionable. More importantly, most of those previous predictive models were based on the Cox regression model or logistic regression. Logistic regression and Cox regression are regular algorithms that can be replaced by more sophisticated algorithms. For instance, XGBoost has excellent performance for processing large-scale and high-dimensional data [22]. Taken together, after defining NSCLC patients with BM, we then constructed and validated a prediction model based on the XGBoost algorithm, avoiding the shortcomings of the other models, and achieved the best performance in prediction among all models (Figure 2).
According to our study, tumor size is a significant factor affecting the patient prognosis, which has not been recognized previously [11–13]. T stage roughly classifies tumor size or depth of invasion, but it cannot reflect the specific character of NSCLC patients with BM and accurately predict the prognosis, because the tumor size of the same period varies greatly [23–25]. Age, tumor size, race, sex, histological types, grade, T stage, N stage, surgery, chemotherapy, liver metastases, and radiotherapy were related to prognosis, which was similar to the previous research [26–30]. The presence of liver metastases significantly decreased survival in lung cancer patients with BM [17]. Firstly, it may be related to the liver being an immunosuppressive organ, thus hindering the immune surveillance of the growing metastases of the liver; secondly, the worse response to chemotherapy caused by metastatic liver cancer leads to a worse prognosis [31]. Undifferentiated, late-stage patients did show worse prognosis as expected, which is consistent with general cognition. We also find that surgery and chemotherapy are generally beneficial for patients. However, due to the lack of specific surgical procedures, chemotherapy-specific drugs, and specific programs, we were unable to further explore the relationship between treatment methods and prognosis in more depth. Of note, molecular targeted therapy and immunotherapy may provide new options in addition to traditional surgery, radiotherapy, and chemotherapy; whether these new treatments affect NSCLC with BM requires further investigation although some recent studies show that epidermal growth factor receptor-targeted drugs can improve prognosis in lung cancer [31, 32].
Although deep learning is accessible in the academics and industry, the boosting algorithm based on a tree model still plays a significant role in some subjects, showing dominance in structured information. The boosting algorithm has been proved to be effective in the predictive model of classification and regression tasks in practice. Traditional regression model and machine learning methods, such as Cox regression and SVM, are limited in terms of learning capacities, which need many artificial feature engineering. The tree-based model is a nonlinear model, which has the advantages of a natural feature combination and strong feature expressive capacity [33]. Tree-based classifiers, including RF and XGBoost, based on homogeneity, fit the characteristics of the data set for the present study. We speculated that the application of regularization, using Taylor expansion to estimate the loss function, and high flexibility to allow for fine-tuning might enable XGBoost to perform better than RF [34]. Taken together, our findings suggested that the XGBoost approach can reflect the feature importance and set up a mortality prediction model with extreme accuracy. Furthermore, this approach has extreme potential for practical implementation because it can be incorporated into existing healthcare information systems.
Our research had some advantages. First, the SEER database provided complete follow-up information of patients covering a large scope. Second, our AI model could provide personalized survival prediction for patients, thereby improving personalized treatment. Finally, our AI model can be used to predict the survival of other NSCLC patients with BM, as all the information used to predict survival is easily accessible, and our model can be performed as software-based or web-based tool optimization. However, this study has certain limitations. First, this study was a retrospective study while prospective randomized clinical trials are needed to provide high-level evidence for clinical application. Second, we could not obtain specific information about the treatment, such as chemotherapy drugs and protocols, radiation doses, and specific surgical procedures.
5. Conclusion
We used the XGBoost algorithm to build an AI model that predicts the 1-year survival of NSCLC with BM. The XGBoost model has higher accuracy and better performance than models generated from other algorithms. Furthermore, the XGBoost model can be integrated into existing healthcare information systems, so we propose that the XGBoost model could be used as a practical clinical prediction model to help clinicians develop better and more reasonable treatment programs.
Authors’ Contributions
Authors CLZ, ZHH, and CH designed the study. Author ZHH wrote the manuscript. Authors CH, CXC, and ZJ collected the data. Authors ZJ and YXT conducted the statistical analysis. Authors CLZ and ZHH revised the manuscript. All authors critically read the manuscript to improve intellectual content. All authors read and approved the final manuscript. Zhangheng Huang, Chuan Hu, and Changxing Chi contributed equally to this work.
Acknowledgments
We are thankful for the contribution of the SEER database and the 18 registries supplying cancer research information and thank all colleagues involved in the study for their contributions.
Glossary
Abbreviations
NSCLC:Non-small-cell lung cancer
BM:Bone metastases
AI:Artificial intelligence
ML:Machine learning
RF:Random forest
SVM:Support vector machine
XGBoost:Extreme gradient boosting
SEER:Surveillance, Epidemiology, and End Results
ROC:Receiver operating characteristic
AUC:Area under the curve.
[1] N. K. Altorki, G. J. Markowitz, D. Gao, J. L. Port, A. Saxena, B. Stiles, T. McGraw, V. Mittal, "The lung microenvironment: an important regulator of tumour growth and metastasis," Nature Reviews. Cancer, vol. 19 no. 1,DOI: 10.1038/s41568-018-0081-9, 2019.
[2] M. Esposito, Y. Kang, "Targeting tumor-stromal interactions in bone metastasis," Pharmacology & Therapeutics, vol. 141 no. 2, pp. 222-233, DOI: 10.1016/j.pharmthera.2013.10.006, 2014.
[3] D. Morgensztern, S. H. Ng, F. Gao, R. Govindan, "Trends in stage distribution for patients with non-small cell lung cancer: a National Cancer Database survey," Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer, vol. 5 no. 1, pp. 29-33, DOI: 10.1097/JTO.0b013e3181c5920c, 2010.
[4] N. LeVasseur, M. Clemons, B. Hutton, R. Shorr, C. Jacobs, "Bone-targeted therapy use in patients with bone metastases from lung cancer: a systematic review of randomized controlled trials," Cancer Treatment Reviews, vol. 50, pp. 183-193, DOI: 10.1016/j.ctrv.2016.09.013, 2016.
[5] L. L. Chen, N. Blumm, N. A. Christakis, A. L. Barabási, T. S. Deisboeck, "Cancer metastasis networks and the prediction of progression patterns," British Journal of Cancer, vol. 101 no. 5, pp. 749-758, DOI: 10.1038/sj.bjc.6605214, 2009.
[6] R. R. Langley, I. J. Fidler, "The seed and soil hypothesis revisited--the role of tumor-stroma interactions in metastasis to different organs," International Journal of Cancer, vol. 128 no. 11, pp. 2527-2535, DOI: 10.1002/ijc.26031, 2011.
[7] C. Decroisette, I. Monnet, H. Berard, G. Quere, H. le Caer, S. Bota, C. Audigier-Valette, L. Geriniere, J. M. Vernejoux, C. Chouaid, Groupe Français de Pneumo-Cancérologie 0601 Team, "Epidemiology and treatment costs of bone metastases from lung cancer: a French prospective, observational, multicenter study (GFPC 0601)," Journal of thoracic oncology, vol. 6 no. 3, pp. 576-582, DOI: 10.1097/JTO.0b013e318206a1e3, 2011.
[8] O. Lababede, M. A. Meziane, "The eighth edition of TNM staging of lung cancer: reference chart and diagrams," The Oncologist, vol. 23 no. 7, pp. 844-848, DOI: 10.1634/theoncologist.2017-0659, 2018.
[9] K. Chansky, J. P. Sculier, J. J. Crowley, D. Giroux, J. van Meerbeeck, P. Goldstraw, International Staging Committee and Participating Institutions, "The International Association for the Study of Lung Cancer Staging Project: prognostic factors and pathologic TNM stage in surgically managed non-small cell lung cancer," Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer, vol. 4 no. 7, pp. 792-801, DOI: 10.1097/JTO.0b013e3181a7716e, 2009.
[10] J. P. Sculier, K. Chansky, J. J. Crowley, J. van Meerbeeck, P. Goldstraw, International Staging Committee and Participating Institutions, "The impact of additional prognostic factors on survival and their relationship with the anatomical extent of disease expressed by the 6th edition of the TNM classification of malignant tumors and the proposals for the 7th edition," Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer, vol. 3 no. 5, pp. 457-466, DOI: 10.1097/JTO.0b013e31816de2b8, 2008.
[11] J. Deng, Z. Ren, J. Wen, B. Wang, X. Hou, Z. Xue, X. Chu, "Construction of a nomogram predicting the overall survival of patients with distantly metastatic non-small-cell lung cancer," Cancer Management and Research, vol. Volume 10, pp. 6143-6156, DOI: 10.2147/CMAR.S183878, 2018.
[12] Y. Wang, Z. Pang, X. Chen, F. Bie, Y. Wang, G. Wang, Q. Liu, J. du, "Survival nomogram for patients with initially diagnosed metastatic non-small-cell lung cancer: a SEER-based study," Future Oncology, vol. 15 no. 29, pp. 3395-3409, DOI: 10.2217/fon-2019-0007, 2019.
[13] X. Q. Zheng, J. F. Huang, J. L. Lin, L. Chen, T. T. Zhou, D. Chen, D. D. Lin, J. F. Shen, A. M. Wu, "Incidence, prognostic factors, and a nomogram of lung cancer with bone metastasis at initial diagnosis: a population-based study," Translational lung cancer research, vol. 8 no. 4, pp. 367-379, DOI: 10.21037/tlcr.2019.08.16, 2019.
[14] M. A. Kazem, "Predictive models in cancer management: a guide for clinicians," The Surgeon, vol. 15 no. 2, pp. 93-97, DOI: 10.1016/j.surge.2016.06.002, 2017.
[15] A. A. Ogunleye, W. Qing-Guo, "XGBoost model for chronic kidney disease diagnosis," IEEE/ACM trans,actions on computational biology and bioinformatics,DOI: 10.1109/tcbb.2019.2911071, 2019.
[16] D. Tamayo, A. Silburt, D. Valencia, K. Menou, M. Ali-Dib, C. Petrovich, C. X. Huang, H. Rein, C. . Laerhoven, A. Paradise, A. Obertas, N. Murray, "A machine learns to predict the stability of tightly packed planetary systems," The Astrophysical Journal, vol. 832 no. 2,DOI: 10.3847/2041-8205/832/2/l22, 2016.
[17] H. Sugiura, K. Yamada, T. Sugiura, T. Hida, T. Mitsudomi, "Predictors of survival in patients with bone metastasis of lung cancer," Clinical Orthopaedics and Related Research, vol. 466 no. 3, pp. 729-736, DOI: 10.1007/s11999-007-0051-0, 2008.
[18] L. Zhang, Z. Gong, "Clinical characteristics and prognostic factors in bone metastases from lung cancer," Medical science monitor : international medical journal of experimental and clinical research, vol. 23, pp. 4087-4094, DOI: 10.12659/MSM.902971, 2017.
[19] T. Fukui, T. Taniguchi, K. Kawaguchi, K. Fukumoto, S. Nakamura, Y. Sakao, K. Yokoi, "Comparisons of the clinicopathological features and survival outcomes between lung cancer patients with adenocarcinoma and squamous cell carcinoma," General Thoracic and Cardiovascular Surgery, vol. 63 no. 9, pp. 507-513, DOI: 10.1007/s11748-015-0564-5, 2015.
[20] M. Riihimäki, A. Hemminki, M. Fallah, H. Thomsen, K. Sundquist, J. Sundquist, K. Hemminki, "Metastatic sites and survival in lung cancer," Lung cancer, vol. 86 no. 1, pp. 78-84, DOI: 10.1016/j.lungcan.2014.07.020, 2014.
[21] V. P. Balachandran, M. Gonen, J. J. Smith, R. P. DeMatteo, "Nomograms in oncology: more than meets the eye," The Lancet. Oncology, vol. 16 no. 4, pp. e173-e180, DOI: 10.1016/S1470-2045(14)71116-7, 2015.
[22] L. Torlay, M. Perrone-Bertolotti, E. Thomas, M. Baciu, "Machine learning-XGBoost analysis of language networks to classify patients with epilepsy," Brain informatics, vol. 4 no. 3, pp. 159-169, DOI: 10.1007/s40708-017-0065-7, 2017.
[23] H. Asamura, T. Goya, Y. Koshiishi, Y. Sohara, K. Eguchi, M. Mori, Y. Nakanishi, R. Tsuchiya, K. Shimokata, H. Inoue, T. Nukiwa, E. Miyaoka, "A Japanese Lung Cancer Registry study: prognosis of 13,010 resected lung cancers," Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer, vol. 3 no. 1, pp. 46-52, DOI: 10.1097/JTO.0b013e31815e8577, 2008.
[24] B. C. Cho, T. D. Pas, H. Kalofonos, Q. Wang, R. Ramlau, Y. Cheng, F. Vitiello, T. Laisaar, E. Vallières, B. Kubisa, S. Orlov, K. Park, C. Debruyne, J. Vansteenkiste, "Prognostic factors in early-stage NSCLC: analysis of the placebo group in the MAGRIT study," Anticancer Research, vol. 39 no. 3, pp. 1403-1409, DOI: 10.21873/anticanres.13255, 2019.
[25] H. Li, Z. Sun, F. Yang, X. Sui, T. Liu, J. Wang, "Primary tumour resection in non-small-cell lung cancer patients with ipsilateral pleural dissemination (M1a): a population-based study," European journal of cardio-thoracic surgery : official journal of the European Association for Cardio-thoracic Surgery, vol. 55 no. 6, pp. 1121-1129, DOI: 10.1093/ejcts/ezy439, 2019.
[26] W. Liu, J. Wu, "Lung cancer with bone metastases in the United States: an analysis from the Surveillance, Epidemiologic, and End Results database," Clinical & Experimental Metastasis, vol. 35 no. 8, pp. 753-761, DOI: 10.1007/s10585-018-9943-5, 2018.
[27] B. Wang, L. Chen, C. Huang, J. Lin, X. Pan, Z. Shao, S. Hu, X. Zhang, X. Wang, "The homogeneous and heterogeneous risk factors for occurrence and prognosis in lung cancer patients with bone metastasis," Journal of bone oncology, vol. 17,DOI: 10.1016/j.jbo.2019.100251, 2019.
[28] Q. Song, J. Shang, C. Zhang, L. Zhang, X. Wu, "Impact of the homogeneous and heterogeneous risk factors on the incidence and survival outcome of bone metastasis in NSCLC patients," Journal of Cancer Research and Clinical Oncology, vol. 145 no. 3, pp. 737-746, DOI: 10.1007/s00432-018-02826-7, 2019.
[29] N. Sathiakumar, E. Delzell, M. A. Morrisey, C. Falkson, M. Yong, M. L. Kilgore, V. Chia, J. Blackburn, T. Arora, "Mortality following bone metastasis and skeletal-related events among patients 65 years and above with lung cancer: A population-based analysis of U.S. Medicare beneficiaries, 1999-2006," Lung India : official organ of Indian Chest Society, vol. 30 no. 1, pp. 20-26, DOI: 10.4103/0970-2113.106127, 2013.
[30] D. Pruksakorn, A. Phanphaisarn, J. Settakorn, U. Arpornchayanon, A. Tantraworasin, P. Chaiyawat, J. klangjorhor, P. Teeyakasem, "Prognostic score for life expectancy evaluation of lung cancer patients after bone metastasis," Journal of bone oncology, vol. 10,DOI: 10.1016/j.jbo.2017.10.001, 2018.
[31] F. Cappuzzo, T. Ciuleanu, L. Stelmakh, S. Cicenas, A. Szczésna, E. Juhász, E. Esteban, O. Molinier, W. Brugger, I. Melezínek, G. Klingelschmitt, B. Klughammer, G. Giaccone, SATURN investigators, "Erlotinib as maintenance treatment in advanced non-small-cell lung cancer: a multicentre, randomised, placebo-controlled phase 3 study," The Lancet. Oncology, vol. 11 no. 6, pp. 521-529, DOI: 10.1016/S1470-2045(10)70112-1, 2010.
[32] N. Thatcher, A. Chang, P. Parikh, J. Rodrigues Pereira, T. Ciuleanu, J. von Pawel, S. Thongprasert, E. H. Tan, K. Pemberton, V. Archer, K. Carroll, "Gefitinib plus best supportive care in previously treated patients with refractory advanced non-small-cell lung cancer: results from a randomised, placebo-controlled, multicentre study (Iressa Survival Evaluation in Lung Cancer)," Lancet, vol. 366 no. 9496, pp. 1527-1537, DOI: 10.1016/S0140-6736(05)67625-8, 2005.
[33] S. W. Choi, T. Ko, K. J. Hong, K. H. Kim, "Machine learning-based prediction of Korean triage and acuity scale level in emergency department patients," Healthcare informatics research, vol. 25 no. 4, pp. 305-312, DOI: 10.4258/hir.2019.25.4.305, 2019.
[34] C. A. Hu, C. M. Chen, Y. C. Fang, S. J. Liang, H. C. Wang, W. F. Fang, C. C. Sheu, W. C. Perng, K. Y. Yang, K. C. Kao, C. L. Wu, C. S. Tsai, M. Y. Lin, W. C. Chao, TSIRC (Taiwan Severe Influenza Research Consortium), "Using a machine learning approach to predict mortality in critically ill influenza patients: a cross-sectional retrospective multicentre study in Taiwan," BMJ Open, vol. 10 no. 2,DOI: 10.1136/bmjopen-2019-033898, 2020.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2020 Zhangheng Huang et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://creativecommons.org/licenses/by/4.0/
Abstract
Non-small-cell lung cancer (NSCLC) patients often develop bone metastases (BM), and the overall survival for these patients is usually perishing. However, a model with high accuracy for predicting the survival of NSCLC with BM is still lacking. Here, we aimed to establish a model based on artificial intelligence for predicting the 1-year survival rate of NSCLC with BM by using extreme gradient boosting (XGBoost), a large-scale machine learning algorithm. We selected NSCLC patients with BM between 2010 and 2015 from the Surveillance, Epidemiology, and End Results database. In total, 5973 cases were enrolled and divided into the training (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 Department of Spine Surgery, Affiliated Hospital of Chengde Medical University, Shuangqiao District, Chengde, Hebei Province, China
2 Department of Spine Surgery, Affiliated Hospital of Chengde Medical University, Shuangqiao District, Chengde, Hebei Province, China; Department of Orthopedic, The Affiliated Hospital of Qingdao University, Shinan District, Qingdao, Shandong Province, China
3 Department of Radiotherapy, The Third Affiliated Hospital of Kunming Medical University, Chenggong District, Kunming, Yunnan Province, China
4 School of Public Health, Jilin University, Chaoyang District, Changchun, Jilin Province, China