Abstract

Drug response differs substantially in cancer patients due to inter- and intra-tumor heterogeneity. Particularly, transcriptome context, especially tumor microenvironment, has been shown playing a significant role in shaping the actual treatment outcome. In this study, we develop a deep variational autoencoder (VAE) model to compress thousands of genes into latent vectors in a low-dimensional space. We then demonstrate that these encoded vectors could accurately impute drug response, outperform standard signature-gene based approaches, and appropriately control the overfitting problem. We apply rigorous quality assessment and validation, including assessing the impact of cell line lineage, cross-validation, cross-panel evaluation, and application in independent clinical data sets, to warrant the accuracy of the imputed drug response in both cell lines and cancer samples. Specifically, the expression-regulated component (EReX) of the observed drug response achieves high correlation across panels. Using the well-trained models, we impute drug response of The Cancer Genome Atlas data and investigate the features and signatures associated with the imputed drug response, including cell line origins, somatic mutations and tumor mutation burdens, tumor microenvironment, and confounding factors. In summary, our deep learning method and the results are useful for the study of signatures and markers of drug response.

Drug response in cancer patients vary dramatically due to inter- and intra-tumor heterogeneity and transcriptome context plays a significant role in shaping the actual treatment outcome. Here, the authors develop a deep variational autoencoder model to compress gene signatures into latent vectors and accurately impute drug response.

Details

Title
Deep generative neural network for accurate drug response imputation
Author
Jia Peilin 1   VIAFID ORCID Logo  ; Hu Ruifeng 1   VIAFID ORCID Logo  ; Guangsheng, Pei 1   VIAFID ORCID Logo  ; Dai Yulin 1   VIAFID ORCID Logo  ; Yin-Ying, Wang 1 ; Zhao, Zhongming 2   VIAFID ORCID Logo 

 The University of Texas Health Science Center at Houston, Center for Precision Health, School of Biomedical Informatics, Houston, USA (GRID:grid.267308.8) (ISNI:0000 0000 9206 2401) 
 The University of Texas Health Science Center at Houston, Center for Precision Health, School of Biomedical Informatics, Houston, USA (GRID:grid.267308.8) (ISNI:0000 0000 9206 2401); The University of Texas Health Science Center at Houston, Human Genetics Center, School of Public Health, Houston, USA (GRID:grid.267308.8) (ISNI:0000 0000 9206 2401); MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, USA (GRID:grid.240145.6) (ISNI:0000 0001 2291 4776); Vanderbilt University Medical Center, Department of Biomedical Informatics, Nashville, USA (GRID:grid.412807.8) (ISNI:0000 0004 1936 9916) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2503047512
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.