Cargando…

Decoding topologically associating domains with ultra-low resolution Hi-C data by graph structural entropy

Submegabase-size topologically associating domains (TAD) have been observed in high-throughput chromatin interaction data (Hi-C). However, accurate detection of TADs depends on ultra-deep sequencing and sophisticated normalization procedures. Here we propose a fast and normalization-free method to d...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Angsheng, Yin, Xianchen, Xu, Bingxiang, Wang, Danyang, Han, Jimin, Wei, Yi, Deng, Yun, Xiong, Ying, Zhang, Zhihua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6093941/
https://www.ncbi.nlm.nih.gov/pubmed/30111883
http://dx.doi.org/10.1038/s41467-018-05691-7
Descripción
Sumario:Submegabase-size topologically associating domains (TAD) have been observed in high-throughput chromatin interaction data (Hi-C). However, accurate detection of TADs depends on ultra-deep sequencing and sophisticated normalization procedures. Here we propose a fast and normalization-free method to decode the domains of chromosomes (deDoc) that utilizes structural information theory. By treating Hi-C contact matrix as a representation of a graph, deDoc partitions the graph into segments with minimal structural entropy. We show that structural entropy can also be used to determine the proper bin size of the Hi-C data. By applying deDoc to pooled Hi-C data from 10 single cells, we detect megabase-size TAD-like domains. This result implies that the modular structure of the genome spatial organization may be fundamental to even a small cohort of single cells. Our algorithms may facilitate systematic investigations of chromosomal domains on a larger scale than hitherto have been possible.