Cargando…
Predicting Drosha and Dicer Cleavage Sites with DeepMirCut
MicroRNAs are a class of small RNAs involved in post-transcriptional gene silencing with roles in disease and development. Many computational tools have been developed to identify novel microRNAs. However, there have been no attempts to predict cleavage sites for Drosha from primary sequence, or to...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8819831/ https://www.ncbi.nlm.nih.gov/pubmed/35141278 http://dx.doi.org/10.3389/fmolb.2021.799056 |
_version_ | 1784646132234715136 |
---|---|
author | Bell, Jimmy Hendrix, David A. |
author_facet | Bell, Jimmy Hendrix, David A. |
author_sort | Bell, Jimmy |
collection | PubMed |
description | MicroRNAs are a class of small RNAs involved in post-transcriptional gene silencing with roles in disease and development. Many computational tools have been developed to identify novel microRNAs. However, there have been no attempts to predict cleavage sites for Drosha from primary sequence, or to identify cleavage sites using deep neural networks. Here, we present DeepMirCut, a recurrent neural network-based software that predicts both Dicer and Drosha cleavage sites. We built a microRNA primary sequence database including flanking genomic sequences for 34,713 microRNA annotations. We compare models trained on sequence data, sequence and secondary structure data, as well as input data with annotated structures. Our best model is able to predict cuts within closer average proximity than results reported for other methods. We show that a guanine nucleotide before and a uracil nucleotide after Dicer cleavage sites on the 3′ arm of the microRNA precursor had a positive effect on predictions while the opposite order (U before, G after) had a negative effect. Our analysis was also able to predict several positions where bulges had either positive or negative effects on the score. We expect that our approach and the data we have curated will enable several future studies. |
format | Online Article Text |
id | pubmed-8819831 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88198312022-02-08 Predicting Drosha and Dicer Cleavage Sites with DeepMirCut Bell, Jimmy Hendrix, David A. Front Mol Biosci Molecular Biosciences MicroRNAs are a class of small RNAs involved in post-transcriptional gene silencing with roles in disease and development. Many computational tools have been developed to identify novel microRNAs. However, there have been no attempts to predict cleavage sites for Drosha from primary sequence, or to identify cleavage sites using deep neural networks. Here, we present DeepMirCut, a recurrent neural network-based software that predicts both Dicer and Drosha cleavage sites. We built a microRNA primary sequence database including flanking genomic sequences for 34,713 microRNA annotations. We compare models trained on sequence data, sequence and secondary structure data, as well as input data with annotated structures. Our best model is able to predict cuts within closer average proximity than results reported for other methods. We show that a guanine nucleotide before and a uracil nucleotide after Dicer cleavage sites on the 3′ arm of the microRNA precursor had a positive effect on predictions while the opposite order (U before, G after) had a negative effect. Our analysis was also able to predict several positions where bulges had either positive or negative effects on the score. We expect that our approach and the data we have curated will enable several future studies. Frontiers Media S.A. 2022-01-24 /pmc/articles/PMC8819831/ /pubmed/35141278 http://dx.doi.org/10.3389/fmolb.2021.799056 Text en Copyright © 2022 Bell and Hendrix. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Bell, Jimmy Hendrix, David A. Predicting Drosha and Dicer Cleavage Sites with DeepMirCut |
title | Predicting Drosha and Dicer Cleavage Sites with DeepMirCut |
title_full | Predicting Drosha and Dicer Cleavage Sites with DeepMirCut |
title_fullStr | Predicting Drosha and Dicer Cleavage Sites with DeepMirCut |
title_full_unstemmed | Predicting Drosha and Dicer Cleavage Sites with DeepMirCut |
title_short | Predicting Drosha and Dicer Cleavage Sites with DeepMirCut |
title_sort | predicting drosha and dicer cleavage sites with deepmircut |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8819831/ https://www.ncbi.nlm.nih.gov/pubmed/35141278 http://dx.doi.org/10.3389/fmolb.2021.799056 |
work_keys_str_mv | AT belljimmy predictingdroshaanddicercleavagesiteswithdeepmircut AT hendrixdavida predictingdroshaanddicercleavagesiteswithdeepmircut |