Cargando…

MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information

MicroRNAs (miRNA) are small noncoding RNA sequences consisting of about 22 nucleotides that are involved in the regulation of almost 60% of mammalian genes. Presently, there are very limited approaches for the visualization of miRNA locations present inside cells to support the elucidation of pathwa...

Descripción completa

Detalles Bibliográficos
Autores principales: Asim, Muhammad Nabeel, Malik, Muhammad Imran, Zehe, Christoph, Trygg, Johan, Dengel, Andreas, Ahmed, Sheraz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763197/
https://www.ncbi.nlm.nih.gov/pubmed/33316943
http://dx.doi.org/10.3390/genes11121475
_version_ 1783627960725536768
author Asim, Muhammad Nabeel
Malik, Muhammad Imran
Zehe, Christoph
Trygg, Johan
Dengel, Andreas
Ahmed, Sheraz
author_facet Asim, Muhammad Nabeel
Malik, Muhammad Imran
Zehe, Christoph
Trygg, Johan
Dengel, Andreas
Ahmed, Sheraz
author_sort Asim, Muhammad Nabeel
collection PubMed
description MicroRNAs (miRNA) are small noncoding RNA sequences consisting of about 22 nucleotides that are involved in the regulation of almost 60% of mammalian genes. Presently, there are very limited approaches for the visualization of miRNA locations present inside cells to support the elucidation of pathways and mechanisms behind miRNA function, transport, and biogenesis. MIRLocator, a state-of-the-art tool for the prediction of subcellular localization of miRNAs makes use of a sequence-to-sequence model along with pretrained k-mer embeddings. Existing pretrained k-mer embedding generation methodologies focus on the extraction of semantics of k-mers. However, in RNA sequences, positional information of nucleotides is more important because distinct positions of the four nucleotides define the function of an RNA molecule. Considering the importance of the nucleotide position, we propose a novel approach (kmerPR2vec) which is a fusion of positional information of k-mers with randomly initialized neural k-mer embeddings. In contrast to existing k-mer-based representation, the proposed kmerPR2vec representation is much more rich in terms of semantic information and has more discriminative power. Using novel kmerPR2vec representation, we further present an end-to-end system (MirLocPredictor) which couples the discriminative power of kmerPR2vec with Convolutional Neural Networks (CNNs) for miRNA subcellular location prediction. The effectiveness of the proposed kmerPR2vec approach is evaluated with deep learning-based topologies (i.e., Convolutional Neural Networks (CNN) and Recurrent Neural Network (RNN)) and by using 9 different evaluation measures. Analysis of the results reveals that MirLocPredictor outperform state-of-the-art methods with a significant margin of 18% and 19% in terms of precision and recall.
format Online
Article
Text
id pubmed-7763197
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77631972020-12-27 MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information Asim, Muhammad Nabeel Malik, Muhammad Imran Zehe, Christoph Trygg, Johan Dengel, Andreas Ahmed, Sheraz Genes (Basel) Article MicroRNAs (miRNA) are small noncoding RNA sequences consisting of about 22 nucleotides that are involved in the regulation of almost 60% of mammalian genes. Presently, there are very limited approaches for the visualization of miRNA locations present inside cells to support the elucidation of pathways and mechanisms behind miRNA function, transport, and biogenesis. MIRLocator, a state-of-the-art tool for the prediction of subcellular localization of miRNAs makes use of a sequence-to-sequence model along with pretrained k-mer embeddings. Existing pretrained k-mer embedding generation methodologies focus on the extraction of semantics of k-mers. However, in RNA sequences, positional information of nucleotides is more important because distinct positions of the four nucleotides define the function of an RNA molecule. Considering the importance of the nucleotide position, we propose a novel approach (kmerPR2vec) which is a fusion of positional information of k-mers with randomly initialized neural k-mer embeddings. In contrast to existing k-mer-based representation, the proposed kmerPR2vec representation is much more rich in terms of semantic information and has more discriminative power. Using novel kmerPR2vec representation, we further present an end-to-end system (MirLocPredictor) which couples the discriminative power of kmerPR2vec with Convolutional Neural Networks (CNNs) for miRNA subcellular location prediction. The effectiveness of the proposed kmerPR2vec approach is evaluated with deep learning-based topologies (i.e., Convolutional Neural Networks (CNN) and Recurrent Neural Network (RNN)) and by using 9 different evaluation measures. Analysis of the results reveals that MirLocPredictor outperform state-of-the-art methods with a significant margin of 18% and 19% in terms of precision and recall. MDPI 2020-12-09 /pmc/articles/PMC7763197/ /pubmed/33316943 http://dx.doi.org/10.3390/genes11121475 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Asim, Muhammad Nabeel
Malik, Muhammad Imran
Zehe, Christoph
Trygg, Johan
Dengel, Andreas
Ahmed, Sheraz
MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information
title MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information
title_full MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information
title_fullStr MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information
title_full_unstemmed MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information
title_short MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information
title_sort mirlocpredictor: a convnet-based multi-label microrna subcellular localization predictor by incorporating k-mer positional information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763197/
https://www.ncbi.nlm.nih.gov/pubmed/33316943
http://dx.doi.org/10.3390/genes11121475
work_keys_str_mv AT asimmuhammadnabeel mirlocpredictoraconvnetbasedmultilabelmicrornasubcellularlocalizationpredictorbyincorporatingkmerpositionalinformation
AT malikmuhammadimran mirlocpredictoraconvnetbasedmultilabelmicrornasubcellularlocalizationpredictorbyincorporatingkmerpositionalinformation
AT zehechristoph mirlocpredictoraconvnetbasedmultilabelmicrornasubcellularlocalizationpredictorbyincorporatingkmerpositionalinformation
AT tryggjohan mirlocpredictoraconvnetbasedmultilabelmicrornasubcellularlocalizationpredictorbyincorporatingkmerpositionalinformation
AT dengelandreas mirlocpredictoraconvnetbasedmultilabelmicrornasubcellularlocalizationpredictorbyincorporatingkmerpositionalinformation
AT ahmedsheraz mirlocpredictoraconvnetbasedmultilabelmicrornasubcellularlocalizationpredictorbyincorporatingkmerpositionalinformation