Cargando…
A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information
BACKGROUND: The three-dimensional (3D) structure of chromatins plays significant roles during cell differentiation and development. Hi-C and other 3C-based technologies allow us to look deep into the chromatin architectures. Many studies have suggested that topologically associating domains (TAD), a...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6933632/ https://www.ncbi.nlm.nih.gov/pubmed/31881832 http://dx.doi.org/10.1186/s12864-019-6303-z |
_version_ | 1783483245658112000 |
---|---|
author | Gan, Wei Luo, Juan Li, Yi Zhou Guo, Jia Li Zhu, Min Li, Meng Long |
author_facet | Gan, Wei Luo, Juan Li, Yi Zhou Guo, Jia Li Zhu, Min Li, Meng Long |
author_sort | Gan, Wei |
collection | PubMed |
description | BACKGROUND: The three-dimensional (3D) structure of chromatins plays significant roles during cell differentiation and development. Hi-C and other 3C-based technologies allow us to look deep into the chromatin architectures. Many studies have suggested that topologically associating domains (TAD), as the structure and functional unit, are conserved across different organs. However, our understanding about the underlying mechanism of the TAD boundary formation is still limited. RESULTS: We developed a computational method, TAD–Lactuca, to infer this structure by taking the contextual information of the epigenetic modification signals and the primary DNA sequence information on the genome. TAD–Lactuca is found stable in the case of multi-resolutions and different datasets. It could achieve high accuracy and even outperforms the state-of-art methods when the sequence patterns were incorporated. Moreover, several transcript factor binding motifs, besides the well-known CCCTC-binding factor (CTCF) motif, were found significantly enriched on the boundaries. CONCLUSIONS: We provided a low cost, effective method to predict TAD boundaries. Above results suggested the incorporation of sequence features could significantly improve the performance. The sequence motif enrichment analysis indicates several gene regulation motifs around the boundaries, which is consistent with TADs may serve as the functional units of gene regulation and implies the sequence patterns would be important in chromatin folding. |
format | Online Article Text |
id | pubmed-6933632 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69336322019-12-30 A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information Gan, Wei Luo, Juan Li, Yi Zhou Guo, Jia Li Zhu, Min Li, Meng Long BMC Genomics Research BACKGROUND: The three-dimensional (3D) structure of chromatins plays significant roles during cell differentiation and development. Hi-C and other 3C-based technologies allow us to look deep into the chromatin architectures. Many studies have suggested that topologically associating domains (TAD), as the structure and functional unit, are conserved across different organs. However, our understanding about the underlying mechanism of the TAD boundary formation is still limited. RESULTS: We developed a computational method, TAD–Lactuca, to infer this structure by taking the contextual information of the epigenetic modification signals and the primary DNA sequence information on the genome. TAD–Lactuca is found stable in the case of multi-resolutions and different datasets. It could achieve high accuracy and even outperforms the state-of-art methods when the sequence patterns were incorporated. Moreover, several transcript factor binding motifs, besides the well-known CCCTC-binding factor (CTCF) motif, were found significantly enriched on the boundaries. CONCLUSIONS: We provided a low cost, effective method to predict TAD boundaries. Above results suggested the incorporation of sequence features could significantly improve the performance. The sequence motif enrichment analysis indicates several gene regulation motifs around the boundaries, which is consistent with TADs may serve as the functional units of gene regulation and implies the sequence patterns would be important in chromatin folding. BioMed Central 2019-12-27 /pmc/articles/PMC6933632/ /pubmed/31881832 http://dx.doi.org/10.1186/s12864-019-6303-z Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Gan, Wei Luo, Juan Li, Yi Zhou Guo, Jia Li Zhu, Min Li, Meng Long A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information |
title | A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information |
title_full | A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information |
title_fullStr | A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information |
title_full_unstemmed | A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information |
title_short | A computational method to predict topologically associating domain boundaries combining histone Marks and sequence information |
title_sort | computational method to predict topologically associating domain boundaries combining histone marks and sequence information |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6933632/ https://www.ncbi.nlm.nih.gov/pubmed/31881832 http://dx.doi.org/10.1186/s12864-019-6303-z |
work_keys_str_mv | AT ganwei acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT luojuan acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT liyizhou acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT guojiali acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT zhumin acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT limenglong acomputationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT ganwei computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT luojuan computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT liyizhou computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT guojiali computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT zhumin computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation AT limenglong computationalmethodtopredicttopologicallyassociatingdomainboundariescombininghistonemarksandsequenceinformation |