Abstract

Histopathologic diagnosis and classification of cancer plays a critical role in guiding treatment. Advances in next-generation sequencing have ushered in new complementary molecular frameworks. However, existing approaches do not independently assess both site-of-origin (e.g. prostate) and lineage (e.g. adenocarcinoma) and have minimal validation in metastatic disease, where classification is more difficult. Utilizing gradient-boosted machine learning, we developed ATLAS, a pair of separate AI Tumor Lineage and Site-of-origin models from RNA expression data on 8249 tumor samples. We assessed performance independently in 10,376 total tumor samples, including 1490 metastatic samples, achieving an accuracy of 91.4% for cancer site-of-origin and 97.1% for cancer lineage. High confidence predictions (encompassing the majority of cases) were accurate 98–99% of the time in both localized and remarkably even in metastatic samples. We also identified emergent properties of our lineage scores for tumor types on which the model was never trained (zero-shot learning). Adenocarcinoma/sarcoma lineage scores differentiated epithelioid from biphasic/sarcomatoid mesothelioma. Also, predicted lineage de-differentiation identified neuroendocrine/small cell tumors and was associated with poor outcomes across tumor types. Our platform-independent single-sample approach can be easily translated to existing RNA-seq platforms. ATLAS can complement and guide traditional histopathologic assessment in challenging situations and tumors of unknown primary.

ATLAS is a pair of AI Tumor Lineage and Site-of-origin machine learning models, that can accurately classify both primary and metastatic tumors using high-throughput RNA expression data and can identify de-differentiated anaplastic tumors.

Details

Title
A platform-independent AI tumor lineage and site (ATLAS) classifier
Author
Rydzewski, Nicholas R. 1 ; Shi, Yue 2 ; Li, Chenxuan 2 ; Chrostek, Matthew R. 2   VIAFID ORCID Logo  ; Bakhtiar, Hamza 2   VIAFID ORCID Logo  ; Helzer, Kyle T. 2 ; Bootsma, Matthew L. 2 ; Berg, Tracy J. 2 ; Harari, Paul M. 3 ; Floberg, John M. 3   VIAFID ORCID Logo  ; Blitzer, Grace C. 3 ; Kosoff, David 4 ; Taylor, Amy K. 4 ; Sharifi, Marina N. 4 ; Yu, Menggang 5 ; Lang, Joshua M. 4   VIAFID ORCID Logo  ; Patel, Krishnan R. 6   VIAFID ORCID Logo  ; Citrin, Deborah E. 6   VIAFID ORCID Logo  ; Sundling, Kaitlin E. 7   VIAFID ORCID Logo  ; Zhao, Shuang G. 8   VIAFID ORCID Logo 

 National Cancer Institute, National Institutes of Health, Radiation Oncology Branch, Bethesda, USA (GRID:grid.48336.3a) (ISNI:0000 0004 1936 8075); University of Wisconsin, Department of Human Oncology, Madison, USA (GRID:grid.14003.36) (ISNI:0000 0001 2167 3675) 
 University of Wisconsin, Department of Human Oncology, Madison, USA (GRID:grid.14003.36) (ISNI:0000 0001 2167 3675) 
 University of Wisconsin, Department of Human Oncology, Madison, USA (GRID:grid.14003.36) (ISNI:0000 0001 2167 3675); University of Wisconsin, Carbone Cancer Center, Madison, USA (GRID:grid.412639.b) (ISNI:0000 0001 2191 1477) 
 University of Wisconsin, Carbone Cancer Center, Madison, USA (GRID:grid.412639.b) (ISNI:0000 0001 2191 1477); University of Wisconsin, Department of Medicine, Madison, USA (GRID:grid.14003.36) (ISNI:0000 0001 2167 3675) 
 University of Wisconsin, Department of Biostatistics and Medical Informatics, Madison, USA (GRID:grid.14003.36) (ISNI:0000 0001 2167 3675) 
 National Cancer Institute, National Institutes of Health, Radiation Oncology Branch, Bethesda, USA (GRID:grid.48336.3a) (ISNI:0000 0004 1936 8075) 
 University of Wisconsin, Department of Pathology and Laboratory Medicine, Madison, USA (GRID:grid.14003.36) (ISNI:0000 0001 2167 3675); University of Wisconsin, Wisconsin State Laboratory of Hygiene, Madison, USA (GRID:grid.28803.31) (ISNI:0000 0001 0701 8607) 
 University of Wisconsin, Department of Human Oncology, Madison, USA (GRID:grid.14003.36) (ISNI:0000 0001 2167 3675); University of Wisconsin, Carbone Cancer Center, Madison, USA (GRID:grid.412639.b) (ISNI:0000 0001 2191 1477); William S. Middleton Veterans Hospital, Madison, USA (GRID:grid.417123.2) (ISNI:0000 0004 0420 6882) 
Pages
314
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
23993642
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2956512354
Copyright
© This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.