Cargando…
DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome
MOTIVATION: Enhancers are vital cis-regulatory elements that regulate gene expression. Enhancer RNAs (eRNAs), a type of long noncoding RNAs, are transcribed from enhancer regions in the genome. The tissue-specific expression of eRNAs is crucial in the regulation of gene expression and cancer develop...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10281860/ https://www.ncbi.nlm.nih.gov/pubmed/37294799 http://dx.doi.org/10.1093/bioinformatics/btad375 |
_version_ | 1785061072606068736 |
---|---|
author | Zhang, Tianjiao Li, Liangyu Sun, Hailong Wang, Guohua |
author_facet | Zhang, Tianjiao Li, Liangyu Sun, Hailong Wang, Guohua |
author_sort | Zhang, Tianjiao |
collection | PubMed |
description | MOTIVATION: Enhancers are vital cis-regulatory elements that regulate gene expression. Enhancer RNAs (eRNAs), a type of long noncoding RNAs, are transcribed from enhancer regions in the genome. The tissue-specific expression of eRNAs is crucial in the regulation of gene expression and cancer development. The methods that identify eRNAs based solely on genomic sequence data have high error rates because they do not account for tissue specificity. Specific histone modifications associated with eRNAs offer valuable information for their identification. However, identification of eRNAs using histone modification data requires the use of both RNA-seq and histone modification data. Unfortunately, many public datasets contain only one of these components, which impedes the accurate identification of eRNAs. RESULTS: We introduce DeepITEH, a deep learning framework that leverages RNA-seq data and histone modification data from multiple samples of the same tissue to enhance the accuracy of identifying eRNAs. Specifically, deepITEH initially categorizes eRNAs into two classes, namely, regularly expressed eRNAs and accidental eRNAs, using histone modification data from multiple samples of the same tissue. Thereafter, it integrates both sequence and histone modification features to identify eRNAs in specific tissues. To evaluate the performance of DeepITEH, we compared it with four existing state-of-the-art enhancer prediction methods, SeqPose, iEnhancer-RD, LSTMAtt, and FRL, on four normal tissues and four cancer tissues. Remarkably, seven of these tissues demonstrated a substantially improved specific eRNA prediction performance with DeepITEH, when compared with other methods. Our findings suggest that DeepITEH can effectively predict potential eRNAs on the human genome, providing insights for studying the eRNA function in cancer. AVAILABILITY AND IMPLEMENTATION: The source code and dataset of DeepITEH have been uploaded to https://github.com/lyli1013/DeepITEH. |
format | Online Article Text |
id | pubmed-10281860 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-102818602023-06-22 DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome Zhang, Tianjiao Li, Liangyu Sun, Hailong Wang, Guohua Bioinformatics Original Paper MOTIVATION: Enhancers are vital cis-regulatory elements that regulate gene expression. Enhancer RNAs (eRNAs), a type of long noncoding RNAs, are transcribed from enhancer regions in the genome. The tissue-specific expression of eRNAs is crucial in the regulation of gene expression and cancer development. The methods that identify eRNAs based solely on genomic sequence data have high error rates because they do not account for tissue specificity. Specific histone modifications associated with eRNAs offer valuable information for their identification. However, identification of eRNAs using histone modification data requires the use of both RNA-seq and histone modification data. Unfortunately, many public datasets contain only one of these components, which impedes the accurate identification of eRNAs. RESULTS: We introduce DeepITEH, a deep learning framework that leverages RNA-seq data and histone modification data from multiple samples of the same tissue to enhance the accuracy of identifying eRNAs. Specifically, deepITEH initially categorizes eRNAs into two classes, namely, regularly expressed eRNAs and accidental eRNAs, using histone modification data from multiple samples of the same tissue. Thereafter, it integrates both sequence and histone modification features to identify eRNAs in specific tissues. To evaluate the performance of DeepITEH, we compared it with four existing state-of-the-art enhancer prediction methods, SeqPose, iEnhancer-RD, LSTMAtt, and FRL, on four normal tissues and four cancer tissues. Remarkably, seven of these tissues demonstrated a substantially improved specific eRNA prediction performance with DeepITEH, when compared with other methods. Our findings suggest that DeepITEH can effectively predict potential eRNAs on the human genome, providing insights for studying the eRNA function in cancer. AVAILABILITY AND IMPLEMENTATION: The source code and dataset of DeepITEH have been uploaded to https://github.com/lyli1013/DeepITEH. Oxford University Press 2023-06-09 /pmc/articles/PMC10281860/ /pubmed/37294799 http://dx.doi.org/10.1093/bioinformatics/btad375 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Zhang, Tianjiao Li, Liangyu Sun, Hailong Wang, Guohua DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome |
title | DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome |
title_full | DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome |
title_fullStr | DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome |
title_full_unstemmed | DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome |
title_short | DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome |
title_sort | deepiteh: a deep learning framework for identifying tissue-specific ernas from the human genome |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10281860/ https://www.ncbi.nlm.nih.gov/pubmed/37294799 http://dx.doi.org/10.1093/bioinformatics/btad375 |
work_keys_str_mv | AT zhangtianjiao deepitehadeeplearningframeworkforidentifyingtissuespecificernasfromthehumangenome AT liliangyu deepitehadeeplearningframeworkforidentifyingtissuespecificernasfromthehumangenome AT sunhailong deepitehadeeplearningframeworkforidentifyingtissuespecificernasfromthehumangenome AT wangguohua deepitehadeeplearningframeworkforidentifyingtissuespecificernasfromthehumangenome |