Cargando…

DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome

MOTIVATION: Enhancers are vital cis-regulatory elements that regulate gene expression. Enhancer RNAs (eRNAs), a type of long noncoding RNAs, are transcribed from enhancer regions in the genome. The tissue-specific expression of eRNAs is crucial in the regulation of gene expression and cancer develop...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Tianjiao, Li, Liangyu, Sun, Hailong, Wang, Guohua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10281860/
https://www.ncbi.nlm.nih.gov/pubmed/37294799
http://dx.doi.org/10.1093/bioinformatics/btad375
_version_ 1785061072606068736
author Zhang, Tianjiao
Li, Liangyu
Sun, Hailong
Wang, Guohua
author_facet Zhang, Tianjiao
Li, Liangyu
Sun, Hailong
Wang, Guohua
author_sort Zhang, Tianjiao
collection PubMed
description MOTIVATION: Enhancers are vital cis-regulatory elements that regulate gene expression. Enhancer RNAs (eRNAs), a type of long noncoding RNAs, are transcribed from enhancer regions in the genome. The tissue-specific expression of eRNAs is crucial in the regulation of gene expression and cancer development. The methods that identify eRNAs based solely on genomic sequence data have high error rates because they do not account for tissue specificity. Specific histone modifications associated with eRNAs offer valuable information for their identification. However, identification of eRNAs using histone modification data requires the use of both RNA-seq and histone modification data. Unfortunately, many public datasets contain only one of these components, which impedes the accurate identification of eRNAs. RESULTS: We introduce DeepITEH, a deep learning framework that leverages RNA-seq data and histone modification data from multiple samples of the same tissue to enhance the accuracy of identifying eRNAs. Specifically, deepITEH initially categorizes eRNAs into two classes, namely, regularly expressed eRNAs and accidental eRNAs, using histone modification data from multiple samples of the same tissue. Thereafter, it integrates both sequence and histone modification features to identify eRNAs in specific tissues. To evaluate the performance of DeepITEH, we compared it with four existing state-of-the-art enhancer prediction methods, SeqPose, iEnhancer-RD, LSTMAtt, and FRL, on four normal tissues and four cancer tissues. Remarkably, seven of these tissues demonstrated a substantially improved specific eRNA prediction performance with DeepITEH, when compared with other methods. Our findings suggest that DeepITEH can effectively predict potential eRNAs on the human genome, providing insights for studying the eRNA function in cancer. AVAILABILITY AND IMPLEMENTATION: The source code and dataset of DeepITEH have been uploaded to https://github.com/lyli1013/DeepITEH.
format Online
Article
Text
id pubmed-10281860
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102818602023-06-22 DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome Zhang, Tianjiao Li, Liangyu Sun, Hailong Wang, Guohua Bioinformatics Original Paper MOTIVATION: Enhancers are vital cis-regulatory elements that regulate gene expression. Enhancer RNAs (eRNAs), a type of long noncoding RNAs, are transcribed from enhancer regions in the genome. The tissue-specific expression of eRNAs is crucial in the regulation of gene expression and cancer development. The methods that identify eRNAs based solely on genomic sequence data have high error rates because they do not account for tissue specificity. Specific histone modifications associated with eRNAs offer valuable information for their identification. However, identification of eRNAs using histone modification data requires the use of both RNA-seq and histone modification data. Unfortunately, many public datasets contain only one of these components, which impedes the accurate identification of eRNAs. RESULTS: We introduce DeepITEH, a deep learning framework that leverages RNA-seq data and histone modification data from multiple samples of the same tissue to enhance the accuracy of identifying eRNAs. Specifically, deepITEH initially categorizes eRNAs into two classes, namely, regularly expressed eRNAs and accidental eRNAs, using histone modification data from multiple samples of the same tissue. Thereafter, it integrates both sequence and histone modification features to identify eRNAs in specific tissues. To evaluate the performance of DeepITEH, we compared it with four existing state-of-the-art enhancer prediction methods, SeqPose, iEnhancer-RD, LSTMAtt, and FRL, on four normal tissues and four cancer tissues. Remarkably, seven of these tissues demonstrated a substantially improved specific eRNA prediction performance with DeepITEH, when compared with other methods. Our findings suggest that DeepITEH can effectively predict potential eRNAs on the human genome, providing insights for studying the eRNA function in cancer. AVAILABILITY AND IMPLEMENTATION: The source code and dataset of DeepITEH have been uploaded to https://github.com/lyli1013/DeepITEH. Oxford University Press 2023-06-09 /pmc/articles/PMC10281860/ /pubmed/37294799 http://dx.doi.org/10.1093/bioinformatics/btad375 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Zhang, Tianjiao
Li, Liangyu
Sun, Hailong
Wang, Guohua
DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome
title DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome
title_full DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome
title_fullStr DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome
title_full_unstemmed DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome
title_short DeepITEH: a deep learning framework for identifying tissue-specific eRNAs from the human genome
title_sort deepiteh: a deep learning framework for identifying tissue-specific ernas from the human genome
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10281860/
https://www.ncbi.nlm.nih.gov/pubmed/37294799
http://dx.doi.org/10.1093/bioinformatics/btad375
work_keys_str_mv AT zhangtianjiao deepitehadeeplearningframeworkforidentifyingtissuespecificernasfromthehumangenome
AT liliangyu deepitehadeeplearningframeworkforidentifyingtissuespecificernasfromthehumangenome
AT sunhailong deepitehadeeplearningframeworkforidentifyingtissuespecificernasfromthehumangenome
AT wangguohua deepitehadeeplearningframeworkforidentifyingtissuespecificernasfromthehumangenome