Cargando…

A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information

BACKGROUND: The three-dimensional (3D) structure of chromatins plays significant roles during cell differentiation and development. Hi-C and other 3C-based technologies allow us to look deep into the chromatin architectures. Many studies have suggested that topologically associating domains (TAD), a...

Descripción completa

Detalles Bibliográficos
Autores principales: Gan, Wei, Luo, Juan, Li, Yi Zhou, Guo, Jia Li, Zhu, Min, Li, Meng Long
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6933632/
https://www.ncbi.nlm.nih.gov/pubmed/31881832
http://dx.doi.org/10.1186/s12864-019-6303-z
_version_ 1783483245658112000
author Gan, Wei
Luo, Juan
Li, Yi Zhou
Guo, Jia Li
Zhu, Min
Li, Meng Long
author_facet Gan, Wei
Luo, Juan
Li, Yi Zhou
Guo, Jia Li
Zhu, Min
Li, Meng Long
author_sort Gan, Wei
collection PubMed
description BACKGROUND: The three-dimensional (3D) structure of chromatins plays significant roles during cell differentiation and development. Hi-C and other 3C-based technologies allow us to look deep into the chromatin architectures. Many studies have suggested that topologically associating domains (TAD), as the structure and functional unit, are conserved across different organs. However, our understanding about the underlying mechanism of the TAD boundary formation is still limited. RESULTS: We developed a computational method, TAD–Lactuca, to infer this structure by taking the contextual information of the epigenetic modification signals and the primary DNA sequence information on the genome. TAD–Lactuca is found stable in the case of multi-resolutions and different datasets. It could achieve high accuracy and even outperforms the state-of-art methods when the sequence patterns were incorporated. Moreover, several transcript factor binding motifs, besides the well-known CCCTC-binding factor (CTCF) motif, were found significantly enriched on the boundaries. CONCLUSIONS: We provided a low cost, effective method to predict TAD boundaries. Above results suggested the incorporation of sequence features could significantly improve the performance. The sequence motif enrichment analysis indicates several gene regulation motifs around the boundaries, which is consistent with TADs may serve as the functional units of gene regulation and implies the sequence patterns would be important in chromatin folding.
format Online
Article
Text
id pubmed-6933632
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69336322019-12-30 A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information Gan, Wei Luo, Juan Li, Yi Zhou Guo, Jia Li Zhu, Min Li, Meng Long BMC Genomics Research BACKGROUND: The three-dimensional (3D) structure of chromatins plays significant roles during cell differentiation and development. Hi-C and other 3C-based technologies allow us to look deep into the chromatin architectures. Many studies have suggested that topologically associating domains (TAD), as the structure and functional unit, are conserved across different organs. However, our understanding about the underlying mechanism of the TAD boundary formation is still limited. RESULTS: We developed a computational method, TAD–Lactuca, to infer this structure by taking the contextual information of the epigenetic modification signals and the primary DNA sequence information on the genome. TAD–Lactuca is found stable in the case of multi-resolutions and different datasets. It could achieve high accuracy and even outperforms the state-of-art methods when the sequence patterns were incorporated. Moreover, several transcript factor binding motifs, besides the well-known CCCTC-binding factor (CTCF) motif, were found significantly enriched on the boundaries. CONCLUSIONS: We provided a low cost, effective method to predict TAD boundaries. Above results suggested the incorporation of sequence features could significantly improve the performance. The sequence motif enrichment analysis indicates several gene regulation motifs around the boundaries, which is consistent with TADs may serve as the functional units of gene regulation and implies the sequence patterns would be important in chromatin folding. BioMed Central 2019-12-27 /pmc/articles/PMC6933632/ /pubmed/31881832 http://dx.doi.org/10.1186/s12864-019-6303-z Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Gan, Wei
Luo, Juan
Li, Yi Zhou
Guo, Jia Li
Zhu, Min
Li, Meng Long
A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information
title A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information
title_full A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information
title_fullStr A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information
title_full_unstemmed A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information
title_short A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information
title_sort computational method to predict topologically associating domain boundaries combining histone marks and sequence information
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6933632/
https://www.ncbi.nlm.nih.gov/pubmed/31881832
http://dx.doi.org/10.1186/s12864-019-6303-z
work_keys_str_mv AT ganwei acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT luojuan acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT liyizhou acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT guojiali acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT zhumin acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT limenglong acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT ganwei computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT luojuan computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT liyizhou computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT guojiali computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT zhumin computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation
AT limenglong computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation