Abstract

We introduce a new computational method named EMeth to estimate cell type proportions using DNA methylation data. EMeth is a reference-based method that requires cell type-specific DNA methylation data from relevant cell types. EMeth improves on the existing reference-based methods by detecting the CpGs whose DNA methylation are inconsistent with the deconvolution model and reducing their contributions to cell type decomposition. Another novel feature of EMeth is that it allows a cell type with known proportions but unknown reference and estimates its methylation. This is motivated by the case of studying methylation in tumor cells while bulk tumor samples include tumor cells as well as other cell types such as infiltrating immune cells, and tumor cell proportion can be estimated by copy number data. We demonstrate that EMeth delivers more accurate estimates of cell type proportions than several other methods using simulated data and in silico mixtures. Applications in cancer studies show that the proportions of T regulatory cells estimated by DNA methylation have expected associations with mutation load and survival time, while the estimates from gene expression miss such associations.

Details

Title
EMeth: An EM algorithm for cell type decomposition based on DNA methylation data
Author
Zhang Hanyu 1 ; Cai Ruoyi 2 ; Dai, James 3 ; Sun, Wei 4 

 University of Washington, Department of Statistics, Seattle, USA (GRID:grid.34477.33) (ISNI:0000000122986657) 
 University of Washington, Department of Biostatistics, Seattle, USA (GRID:grid.34477.33) (ISNI:0000000122986657) 
 Fred Hutchinson Cancer Research Center, Public Health Science Division, Seattle, USA (GRID:grid.270240.3) (ISNI:0000 0001 2180 1622) 
 University of Washington, Department of Biostatistics, Seattle, USA (GRID:grid.34477.33) (ISNI:0000000122986657); Fred Hutchinson Cancer Research Center, Public Health Science Division, Seattle, USA (GRID:grid.270240.3) (ISNI:0000 0001 2180 1622); University of North Carolina, Department of Biostatistics, Chapel Hill, USA (GRID:grid.410711.2) (ISNI:0000 0001 1034 1720) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2500163190
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.