Abstract

Submegabase-size topologically associating domains (TAD) have been observed in high-throughput chromatin interaction data (Hi-C). However, accurate detection of TADs depends on ultra-deep sequencing and sophisticated normalization procedures. Here we propose a fast and normalization-free method to decode the domains of chromosomes (deDoc) that utilizes structural information theory. By treating Hi-C contact matrix as a representation of a graph, deDoc partitions the graph into segments with minimal structural entropy. We show that structural entropy can also be used to determine the proper bin size of the Hi-C data. By applying deDoc to pooled Hi-C data from 10 single cells, we detect megabase-size TAD-like domains. This result implies that the modular structure of the genome spatial organization may be fundamental to even a small cohort of single cells. Our algorithms may facilitate systematic investigations of chromosomal domains on a larger scale than hitherto have been possible.

Details

Title
Decoding topologically associating domains with ultra-low resolution Hi-C data by graph structural entropy
Author
Li, Angsheng 1 ; Yin, Xianchen 2 ; Xu, Bingxiang 3 ; Wang, Danyang 3 ; Han, Jimin 4 ; Wei, Yi 5 ; Deng, Yun 6 ; Xiong, Ying 7 ; Zhang, Zhihua 3 

 State Key Laboratory of Software Development Environment, School of Computer Science, Beihang University, Beijing, P.R. China; State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China; School of Computer Science, University of Chinese Academy of Sciences, Beijing, P.R. China 
 State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing, P.R. China; School of Computer Science, University of Chinese Academy of Sciences, Beijing, P.R. China 
 CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, P.R. China; School of Life Science, University of Chinese Academy of Sciences, Beijing, P.R. China 
 School of Computer Science, University of Chinese Academy of Sciences, Beijing, P.R. China 
 School of Mathematics, University of Chinese Academy of Sciences, Beijing, P.R. China 
 School of Life Science, University of Chinese Academy of Sciences, Beijing, P.R. China 
 School of Physics, University of Chinese Academy of Sciences, Beijing, P.R. China 
Pages
1-12
Publication year
2018
Publication date
Aug 2018
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2088919237
Copyright
© 2018. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.