Cargando…

Predicting Drosha and Dicer Cleavage Sites with DeepMirCut

MicroRNAs are a class of small RNAs involved in post-transcriptional gene silencing with roles in disease and development. Many computational tools have been developed to identify novel microRNAs. However, there have been no attempts to predict cleavage sites for Drosha from primary sequence, or to...

Descripción completa

Detalles Bibliográficos
Autores principales: Bell, Jimmy, Hendrix, David A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8819831/
https://www.ncbi.nlm.nih.gov/pubmed/35141278
http://dx.doi.org/10.3389/fmolb.2021.799056
_version_ 1784646132234715136
author Bell, Jimmy
Hendrix, David A.
author_facet Bell, Jimmy
Hendrix, David A.
author_sort Bell, Jimmy
collection PubMed
description MicroRNAs are a class of small RNAs involved in post-transcriptional gene silencing with roles in disease and development. Many computational tools have been developed to identify novel microRNAs. However, there have been no attempts to predict cleavage sites for Drosha from primary sequence, or to identify cleavage sites using deep neural networks. Here, we present DeepMirCut, a recurrent neural network-based software that predicts both Dicer and Drosha cleavage sites. We built a microRNA primary sequence database including flanking genomic sequences for 34,713 microRNA annotations. We compare models trained on sequence data, sequence and secondary structure data, as well as input data with annotated structures. Our best model is able to predict cuts within closer average proximity than results reported for other methods. We show that a guanine nucleotide before and a uracil nucleotide after Dicer cleavage sites on the 3′ arm of the microRNA precursor had a positive effect on predictions while the opposite order (U before, G after) had a negative effect. Our analysis was also able to predict several positions where bulges had either positive or negative effects on the score. We expect that our approach and the data we have curated will enable several future studies.
format Online
Article
Text
id pubmed-8819831
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88198312022-02-08 Predicting Drosha and Dicer Cleavage Sites with DeepMirCut Bell, Jimmy Hendrix, David A. Front Mol Biosci Molecular Biosciences MicroRNAs are a class of small RNAs involved in post-transcriptional gene silencing with roles in disease and development. Many computational tools have been developed to identify novel microRNAs. However, there have been no attempts to predict cleavage sites for Drosha from primary sequence, or to identify cleavage sites using deep neural networks. Here, we present DeepMirCut, a recurrent neural network-based software that predicts both Dicer and Drosha cleavage sites. We built a microRNA primary sequence database including flanking genomic sequences for 34,713 microRNA annotations. We compare models trained on sequence data, sequence and secondary structure data, as well as input data with annotated structures. Our best model is able to predict cuts within closer average proximity than results reported for other methods. We show that a guanine nucleotide before and a uracil nucleotide after Dicer cleavage sites on the 3′ arm of the microRNA precursor had a positive effect on predictions while the opposite order (U before, G after) had a negative effect. Our analysis was also able to predict several positions where bulges had either positive or negative effects on the score. We expect that our approach and the data we have curated will enable several future studies. Frontiers Media S.A. 2022-01-24 /pmc/articles/PMC8819831/ /pubmed/35141278 http://dx.doi.org/10.3389/fmolb.2021.799056 Text en Copyright © 2022 Bell and Hendrix. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Bell, Jimmy
Hendrix, David A.
Predicting Drosha and Dicer Cleavage Sites with DeepMirCut
title Predicting Drosha and Dicer Cleavage Sites with DeepMirCut
title_full Predicting Drosha and Dicer Cleavage Sites with DeepMirCut
title_fullStr Predicting Drosha and Dicer Cleavage Sites with DeepMirCut
title_full_unstemmed Predicting Drosha and Dicer Cleavage Sites with DeepMirCut
title_short Predicting Drosha and Dicer Cleavage Sites with DeepMirCut
title_sort predicting drosha and dicer cleavage sites with deepmircut
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8819831/
https://www.ncbi.nlm.nih.gov/pubmed/35141278
http://dx.doi.org/10.3389/fmolb.2021.799056
work_keys_str_mv AT belljimmy predictingdroshaanddicercleavagesiteswithdeepmircut
AT hendrixdavida predictingdroshaanddicercleavagesiteswithdeepmircut