Cargando…
A two-stream convolutional neural network for microRNA transcription start site feature integration and identification
MicroRNAs (miRNAs) play important roles in post-transcriptional gene regulation and phenotype development. Understanding the regulation of miRNA genes is critical to understand gene regulation. One of the challenges to study miRNA gene regulation is the lack of condition-specific annotation of miRNA...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7952457/ https://www.ncbi.nlm.nih.gov/pubmed/33707582 http://dx.doi.org/10.1038/s41598-021-85173-x |
_version_ | 1783663733985247232 |
---|---|
author | Cha, Mingyu Zheng, Hansi Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan |
author_facet | Cha, Mingyu Zheng, Hansi Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan |
author_sort | Cha, Mingyu |
collection | PubMed |
description | MicroRNAs (miRNAs) play important roles in post-transcriptional gene regulation and phenotype development. Understanding the regulation of miRNA genes is critical to understand gene regulation. One of the challenges to study miRNA gene regulation is the lack of condition-specific annotation of miRNA transcription start sites (TSSs). Unlike protein-coding genes, miRNA TSSs can be tens of thousands of nucleotides away from the precursor miRNAs and they are hard to be detected by conventional RNA-Seq experiments. A number of studies have been attempted to computationally predict miRNA TSSs. However, high-resolution condition-specific miRNA TSS prediction remains a challenging problem. Recently, deep learning models have been successfully applied to various bioinformatics problems but have not been effectively created for condition-specific miRNA TSS prediction. Here we created a two-stream deep learning model called D-miRT for computational prediction of condition-specific miRNA TSSs (http://hulab.ucf.edu/research/projects/DmiRT/). D-miRT is a natural fit for the integration of low-resolution epigenetic features (DNase-Seq and histone modification data) and high-resolution sequence features. Compared with alternative computational models on different sets of training data, D-miRT outperformed all baseline models and demonstrated high accuracy for condition-specific miRNA TSS prediction tasks. Comparing with the most recent approaches on cell-specific miRNA TSS identification using cell lines that were unseen to the model training processes, D-miRT also showed superior performance. |
format | Online Article Text |
id | pubmed-7952457 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-79524572021-03-12 A two-stream convolutional neural network for microRNA transcription start site feature integration and identification Cha, Mingyu Zheng, Hansi Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan Sci Rep Article MicroRNAs (miRNAs) play important roles in post-transcriptional gene regulation and phenotype development. Understanding the regulation of miRNA genes is critical to understand gene regulation. One of the challenges to study miRNA gene regulation is the lack of condition-specific annotation of miRNA transcription start sites (TSSs). Unlike protein-coding genes, miRNA TSSs can be tens of thousands of nucleotides away from the precursor miRNAs and they are hard to be detected by conventional RNA-Seq experiments. A number of studies have been attempted to computationally predict miRNA TSSs. However, high-resolution condition-specific miRNA TSS prediction remains a challenging problem. Recently, deep learning models have been successfully applied to various bioinformatics problems but have not been effectively created for condition-specific miRNA TSS prediction. Here we created a two-stream deep learning model called D-miRT for computational prediction of condition-specific miRNA TSSs (http://hulab.ucf.edu/research/projects/DmiRT/). D-miRT is a natural fit for the integration of low-resolution epigenetic features (DNase-Seq and histone modification data) and high-resolution sequence features. Compared with alternative computational models on different sets of training data, D-miRT outperformed all baseline models and demonstrated high accuracy for condition-specific miRNA TSS prediction tasks. Comparing with the most recent approaches on cell-specific miRNA TSS identification using cell lines that were unseen to the model training processes, D-miRT also showed superior performance. Nature Publishing Group UK 2021-03-11 /pmc/articles/PMC7952457/ /pubmed/33707582 http://dx.doi.org/10.1038/s41598-021-85173-x Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Cha, Mingyu Zheng, Hansi Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan A two-stream convolutional neural network for microRNA transcription start site feature integration and identification |
title | A two-stream convolutional neural network for microRNA transcription start site feature integration and identification |
title_full | A two-stream convolutional neural network for microRNA transcription start site feature integration and identification |
title_fullStr | A two-stream convolutional neural network for microRNA transcription start site feature integration and identification |
title_full_unstemmed | A two-stream convolutional neural network for microRNA transcription start site feature integration and identification |
title_short | A two-stream convolutional neural network for microRNA transcription start site feature integration and identification |
title_sort | two-stream convolutional neural network for microrna transcription start site feature integration and identification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7952457/ https://www.ncbi.nlm.nih.gov/pubmed/33707582 http://dx.doi.org/10.1038/s41598-021-85173-x |
work_keys_str_mv | AT chamingyu atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT zhenghansi atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT talukderamlan atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT barhamclayton atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT lixiaoman atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT huhaiyan atwostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT chamingyu twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT zhenghansi twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT talukderamlan twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT barhamclayton twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT lixiaoman twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification AT huhaiyan twostreamconvolutionalneuralnetworkformicrornatranscriptionstartsitefeatureintegrationandidentification |