Cargando…
Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network
Gene expression is regulated at both transcriptional and post-transcriptional levels. DNA sequence and epigenetic modifications are key factors which regulate gene transcription. Understanding their complex interactions and their respective contributions to gene expression regulation remains a chall...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307602/ https://www.ncbi.nlm.nih.gov/pubmed/35891778 http://dx.doi.org/10.1016/j.csbj.2022.07.014 |
_version_ | 1784752798741561344 |
---|---|
author | Gao, Shang Rehman, Jalees Dai, Yang |
author_facet | Gao, Shang Rehman, Jalees Dai, Yang |
author_sort | Gao, Shang |
collection | PubMed |
description | Gene expression is regulated at both transcriptional and post-transcriptional levels. DNA sequence and epigenetic modifications are key factors which regulate gene transcription. Understanding their complex interactions and their respective contributions to gene expression regulation remains a challenge in biological studies. We have developed iSEGnet, a framework of deep convolutional neural network to predict mRNA abundance using the information on DNA sequences as well as epigenetic modifications within genes and their cis-regulatory regions. We demonstrate that our framework outperforms other machine learning models in terms of predicting mRNA abundance using transcriptional and epigenetic profiles from six distinct cell lines/types chosen from the ENCODE. The analysis from the learned models also reveals that specific regions around promotors and transcription termination sites are most important for gene expression regulation. Using the method of Integrated Gradients, we identify narrow segments in these regions which are most likely to impact gene expression for a specific epigenetic modification. We further show that these identified segments are enriched in known active regulatory regions by comparing the transcription factor binding sites obtained via ChIP-seq. Moreover, we demonstrate how iSEGnet can uncover potential transcription factors that have regulatory functions in cancer using two cancer multi-omics data. |
format | Online Article Text |
id | pubmed-9307602 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-93076022022-07-25 Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network Gao, Shang Rehman, Jalees Dai, Yang Comput Struct Biotechnol J Research Article Gene expression is regulated at both transcriptional and post-transcriptional levels. DNA sequence and epigenetic modifications are key factors which regulate gene transcription. Understanding their complex interactions and their respective contributions to gene expression regulation remains a challenge in biological studies. We have developed iSEGnet, a framework of deep convolutional neural network to predict mRNA abundance using the information on DNA sequences as well as epigenetic modifications within genes and their cis-regulatory regions. We demonstrate that our framework outperforms other machine learning models in terms of predicting mRNA abundance using transcriptional and epigenetic profiles from six distinct cell lines/types chosen from the ENCODE. The analysis from the learned models also reveals that specific regions around promotors and transcription termination sites are most important for gene expression regulation. Using the method of Integrated Gradients, we identify narrow segments in these regions which are most likely to impact gene expression for a specific epigenetic modification. We further show that these identified segments are enriched in known active regulatory regions by comparing the transcription factor binding sites obtained via ChIP-seq. Moreover, we demonstrate how iSEGnet can uncover potential transcription factors that have regulatory functions in cancer using two cancer multi-omics data. Research Network of Computational and Structural Biotechnology 2022-07-13 /pmc/articles/PMC9307602/ /pubmed/35891778 http://dx.doi.org/10.1016/j.csbj.2022.07.014 Text en https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Gao, Shang Rehman, Jalees Dai, Yang Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network |
title | Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network |
title_full | Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network |
title_fullStr | Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network |
title_full_unstemmed | Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network |
title_short | Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network |
title_sort | assessing comparative importance of dna sequence and epigenetic modifications on gene expression using a deep convolutional neural network |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307602/ https://www.ncbi.nlm.nih.gov/pubmed/35891778 http://dx.doi.org/10.1016/j.csbj.2022.07.014 |
work_keys_str_mv | AT gaoshang assessingcomparativeimportanceofdnasequenceandepigeneticmodificationsongeneexpressionusingadeepconvolutionalneuralnetwork AT rehmanjalees assessingcomparativeimportanceofdnasequenceandepigeneticmodificationsongeneexpressionusingadeepconvolutionalneuralnetwork AT daiyang assessingcomparativeimportanceofdnasequenceandepigeneticmodificationsongeneexpressionusingadeepconvolutionalneuralnetwork |