Abstract

This study aimed to develop a machine learning (ML) model for predicting pulmonary embolism (PE) in patients with gastrointestinal cancers, a group at increased risk for PE. We conducted a retrospective, multicenter study analyzing patients who underwent computed tomographic pulmonary angiography (CTPA) between 2010 and 2020. The study utilized demographic and clinical data, including the Wells score and D-dimer levels, to train a random forest ML model. The model’s effectiveness was assessed using the area under the receiver operating curve (AUROC). In total, 446 patients from hospital A and 139 from hospital B were included. The training set consisted of 356 patients from hospital A, with internal validation on 90 and external validation on 139 patients from hospital B. The model achieved an AUROC of 0.736 in hospital A and 0.669 in hospital B. The ML model significantly reduced the number of patients recommended for CTPA compared to the conventional diagnostic strategy (hospital A; 100.0% vs. 91.1%, P < 0.001, hospital B; 100.0% vs. 93.5%, P = 0.003). The results indicate that an ML-based prediction model can reduce unnecessary CTPA procedures in gastrointestinal cancer patients, highlighting its potential to enhance diagnostic efficiency and reduce patient burden.

Details

Title
Machine learning-based prediction of pulmonary embolism to reduce unnecessary computed tomography scans in gastrointestinal cancer patients: a retrospective multicenter study
Author
Kim, Joo Seong 1 ; Kwon, Doyun 2 ; Kim, Kyungdo 3 ; Lee, Sang Hyub 4 ; Lee, Seung-Bo 5 ; Kim, Kwangsoo 6 ; Kim, Dongmin 7 ; Lee, Min Woo 4 ; Park, Namyoung 8 ; Choi, Jin Ho 4 ; Jang, Eun Sun 9 ; Cho, In Rae 4 ; Paik, Woo Hyun 4 ; Lee, Jun Kyu 10 ; Ryu, Ji Kon 4 ; Kim, Yong-Tae 4 

 Seoul National University Hospital, Seoul National University College of Medicine, Department of Internal Medicine and Liver Research Institute, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905); Dongguk University College of Medicine, Dongguk University Ilsan Hospital, Department of Internal Medicine, Goyang-si, Korea (GRID:grid.470090.a) (ISNI:0000 0004 1792 3864) 
 Seoul National University College of Medicine, Interdisciplinary Program of Medical Informatics, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905) 
 Duke University, Department of Biomedical Engineering, Pratt School of Engineering, Durham, USA (GRID:grid.26009.3d) (ISNI:0000 0004 1936 7961); Seoul National University Hospital, Transdisciplinary Department of Medicine & Advanced Technology, Seoul, Korea (GRID:grid.412484.f) (ISNI:0000 0001 0302 820X) 
 Seoul National University Hospital, Seoul National University College of Medicine, Department of Internal Medicine and Liver Research Institute, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905) 
 Keimyung University School of Medicine, Department of Medical Informatics, Daegu, Republic of Korea (GRID:grid.412091.f) (ISNI:0000 0001 0669 3109) 
 Seoul National University Hospital, Transdisciplinary Department of Medicine & Advanced Technology, Seoul, Korea (GRID:grid.412484.f) (ISNI:0000 0001 0302 820X); Seoul National University College of Medicine, Department of Medicine, Seoul, Korea (GRID:grid.31501.36) (ISNI:0000 0004 0470 5905) 
 Seoul National University Hospital, Biomedical Research Institute, Seoul, Korea (GRID:grid.412484.f) (ISNI:0000 0001 0302 820X) 
 Kyung Hee University Gangdong Hospital, Department of Medicine, Seoul, Korea (GRID:grid.289247.2) (ISNI:0000 0001 2171 7818) 
 Seoul National University Bundang Hospital, Department of Internal Medicine, Seongnam-si, Korea (GRID:grid.412480.b) (ISNI:0000 0004 0647 3378) 
10  Dongguk University College of Medicine, Dongguk University Ilsan Hospital, Department of Internal Medicine, Goyang-si, Korea (GRID:grid.470090.a) (ISNI:0000 0004 1792 3864) 
Pages
25359
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3120698857
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.