Cargando…

A two-stream convolutional neural network for microRNA transcription start site feature integration and identification

MicroRNAs (miRNAs) play important roles in post-transcriptional gene regulation and phenotype development. Understanding the regulation of miRNA genes is critical to understand gene regulation. One of the challenges to study miRNA gene regulation is the lack of condition-specific annotation of miRNA...

Descripción completa

Detalles Bibliográficos
Autores principales: Cha, Mingyu, Zheng, Hansi, Talukder, Amlan, Barham, Clayton, Li, Xiaoman, Hu, Haiyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7952457/
https://www.ncbi.nlm.nih.gov/pubmed/33707582
http://dx.doi.org/10.1038/s41598-021-85173-x
_version_ 1783663733985247232
author Cha, Mingyu
Zheng, Hansi
Talukder, Amlan
Barham, Clayton
Li, Xiaoman
Hu, Haiyan
author_facet Cha, Mingyu
Zheng, Hansi
Talukder, Amlan
Barham, Clayton
Li, Xiaoman
Hu, Haiyan
author_sort Cha, Mingyu
collection PubMed
description MicroRNAs (miRNAs) play important roles in post-transcriptional gene regulation and phenotype development. Understanding the regulation of miRNA genes is critical to understand gene regulation. One of the challenges to study miRNA gene regulation is the lack of condition-specific annotation of miRNA transcription start sites (TSSs). Unlike protein-coding genes, miRNA TSSs can be tens of thousands of nucleotides away from the precursor miRNAs and they are hard to be detected by conventional RNA-Seq experiments. A number of studies have been attempted to computationally predict miRNA TSSs. However, high-resolution condition-specific miRNA TSS prediction remains a challenging problem. Recently, deep learning models have been successfully applied to various bioinformatics problems but have not been effectively created for condition-specific miRNA TSS prediction. Here we created a two-stream deep learning model called D-miRT for computational prediction of condition-specific miRNA TSSs (http://hulab.ucf.edu/research/projects/DmiRT/). D-miRT is a natural fit for the integration of low-resolution epigenetic features (DNase-Seq and histone modification data) and high-resolution sequence features. Compared with alternative computational models on different sets of training data, D-miRT outperformed all baseline models and demonstrated high accuracy for condition-specific miRNA TSS prediction tasks. Comparing with the most recent approaches on cell-specific miRNA TSS identification using cell lines that were unseen to the model training processes, D-miRT also showed superior performance.
format Online
Article
Text
id pubmed-7952457
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-79524572021-03-12 A two-stream convolutional neural network for microRNA transcription start site feature integration and identification Cha, Mingyu Zheng, Hansi Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan Sci Rep Article MicroRNAs (miRNAs) play important roles in post-transcriptional gene regulation and phenotype development. Understanding the regulation of miRNA genes is critical to understand gene regulation. One of the challenges to study miRNA gene regulation is the lack of condition-specific annotation of miRNA transcription start sites (TSSs). Unlike protein-coding genes, miRNA TSSs can be tens of thousands of nucleotides away from the precursor miRNAs and they are hard to be detected by conventional RNA-Seq experiments. A number of studies have been attempted to computationally predict miRNA TSSs. However, high-resolution condition-specific miRNA TSS prediction remains a challenging problem. Recently, deep learning models have been successfully applied to various bioinformatics problems but have not been effectively created for condition-specific miRNA TSS prediction. Here we created a two-stream deep learning model called D-miRT for computational prediction of condition-specific miRNA TSSs (http://hulab.ucf.edu/research/projects/DmiRT/). D-miRT is a natural fit for the integration of low-resolution epigenetic features (DNase-Seq and histone modification data) and high-resolution sequence features. Compared with alternative computational models on different sets of training data, D-miRT outperformed all baseline models and demonstrated high accuracy for condition-specific miRNA TSS prediction tasks. Comparing with the most recent approaches on cell-specific miRNA TSS identification using cell lines that were unseen to the model training processes, D-miRT also showed superior performance. Nature Publishing Group UK 2021-03-11 /pmc/articles/PMC7952457/ /pubmed/33707582 http://dx.doi.org/10.1038/s41598-021-85173-x Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Cha, Mingyu
Zheng, Hansi
Talukder, Amlan
Barham, Clayton
Li, Xiaoman
Hu, Haiyan
A two-stream convolutional neural network for microRNA transcription start site feature integration and identification
title A two-stream convolutional neural network for microRNA transcription start site feature integration and identification
title_full A two-stream convolutional neural network for microRNA transcription start site feature integration and identification
title_fullStr A two-stream convolutional neural network for microRNA transcription start site feature integration and identification
title_full_unstemmed A two-stream convolutional neural network for microRNA transcription start site feature integration and identification
title_short A two-stream convolutional neural network for microRNA transcription start site feature integration and identification
title_sort two-stream convolutional neural network for microrna transcription start site feature integration and identification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7952457/
https://www.ncbi.nlm.nih.gov/pubmed/33707582
http://dx.doi.org/10.1038/s41598-021-85173-x
work_keys_str_mv AT chamingyu atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT zhenghansi atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT talukderamlan atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT barhamclayton atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT lixiaoman atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT huhaiyan atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT chamingyu twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT zhenghansi twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT talukderamlan twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT barhamclayton twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT lixiaoman twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification
AT huhaiyan twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification