Cargando…
Deciphering epigenomic code for cell differentiation using deep learning
BACKGROUND: Although DNA sequence plays a crucial role in establishing the unique epigenome of a cell type, little is known about the sequence determinants that lead to the unique epigenomes of different cell types produced during cell differentiation. To fill this gap, we employed two types of deep...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6739944/ https://www.ncbi.nlm.nih.gov/pubmed/31510916 http://dx.doi.org/10.1186/s12864-019-6072-8 |
_version_ | 1783451017899147264 |
---|---|
author | Ni, Pengyu Su, Zhengchang |
author_facet | Ni, Pengyu Su, Zhengchang |
author_sort | Ni, Pengyu |
collection | PubMed |
description | BACKGROUND: Although DNA sequence plays a crucial role in establishing the unique epigenome of a cell type, little is known about the sequence determinants that lead to the unique epigenomes of different cell types produced during cell differentiation. To fill this gap, we employed two types of deep convolutional neural networks (CNNs) constructed for each of differentially related cell types and for each of histone marks measured in the cells, to learn the sequence determinants of various histone modification patterns in each cell type. RESULTS: We applied our models to four differentially related human CD(4)(+) T cell types and six histone marks measured in each cell type. The cell models can accurately predict the histone marks in each cell type, while the mark models can also accurately predict the cell types based on a single mark. Sequence motifs learned by both the cell or mark models are highly similar to known binding motifs of transcription factors known to play important roles in CD(4)(+) T cell differentiation. Both the unique histone mark patterns in each cell type and the different patterns of the same histone mark in different cell types are determined by a set of motifs with unique combinations. Interestingly, the level of sharing motifs learned in the different cell models reflects the lineage relationships of the cells, while the level of sharing motifs learned in the different histone mark models reflects their functional relationships. These models can also enable the prediction of the importance of learned motifs and their interactions in determining specific histone mark patterns in the cell types. CONCLUSION: Sequence determinants of various histone modification patterns in different cell types can be revealed by comparative analysis of motifs learned in the CNN models for multiple cell types and histone marks. The learned motifs are interpretable and may provide insights into the underlying molecular mechanisms of establishing the unique epigenomes in different cell types. Thus, our results support the hypothesis that DNA sequences ultimately determine the unique epigenomes of different cell types through their interactions with transcriptional factors, epigenome remodeling system and extracellular cues during cell differentiation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-6072-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6739944 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-67399442019-09-16 Deciphering epigenomic code for cell differentiation using deep learning Ni, Pengyu Su, Zhengchang BMC Genomics Research Article BACKGROUND: Although DNA sequence plays a crucial role in establishing the unique epigenome of a cell type, little is known about the sequence determinants that lead to the unique epigenomes of different cell types produced during cell differentiation. To fill this gap, we employed two types of deep convolutional neural networks (CNNs) constructed for each of differentially related cell types and for each of histone marks measured in the cells, to learn the sequence determinants of various histone modification patterns in each cell type. RESULTS: We applied our models to four differentially related human CD(4)(+) T cell types and six histone marks measured in each cell type. The cell models can accurately predict the histone marks in each cell type, while the mark models can also accurately predict the cell types based on a single mark. Sequence motifs learned by both the cell or mark models are highly similar to known binding motifs of transcription factors known to play important roles in CD(4)(+) T cell differentiation. Both the unique histone mark patterns in each cell type and the different patterns of the same histone mark in different cell types are determined by a set of motifs with unique combinations. Interestingly, the level of sharing motifs learned in the different cell models reflects the lineage relationships of the cells, while the level of sharing motifs learned in the different histone mark models reflects their functional relationships. These models can also enable the prediction of the importance of learned motifs and their interactions in determining specific histone mark patterns in the cell types. CONCLUSION: Sequence determinants of various histone modification patterns in different cell types can be revealed by comparative analysis of motifs learned in the CNN models for multiple cell types and histone marks. The learned motifs are interpretable and may provide insights into the underlying molecular mechanisms of establishing the unique epigenomes in different cell types. Thus, our results support the hypothesis that DNA sequences ultimately determine the unique epigenomes of different cell types through their interactions with transcriptional factors, epigenome remodeling system and extracellular cues during cell differentiation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-6072-8) contains supplementary material, which is available to authorized users. BioMed Central 2019-09-12 /pmc/articles/PMC6739944/ /pubmed/31510916 http://dx.doi.org/10.1186/s12864-019-6072-8 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Ni, Pengyu Su, Zhengchang Deciphering epigenomic code for cell differentiation using deep learning |
title | Deciphering epigenomic code for cell differentiation using deep learning |
title_full | Deciphering epigenomic code for cell differentiation using deep learning |
title_fullStr | Deciphering epigenomic code for cell differentiation using deep learning |
title_full_unstemmed | Deciphering epigenomic code for cell differentiation using deep learning |
title_short | Deciphering epigenomic code for cell differentiation using deep learning |
title_sort | deciphering epigenomic code for cell differentiation using deep learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6739944/ https://www.ncbi.nlm.nih.gov/pubmed/31510916 http://dx.doi.org/10.1186/s12864-019-6072-8 |
work_keys_str_mv | AT nipengyu decipheringepigenomiccodeforcelldifferentiationusingdeeplearning AT suzhengchang decipheringepigenomiccodeforcelldifferentiationusingdeeplearning |