Cargando…

EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction

Subcellular localization of Ribonucleic Acid (RNA) molecules provide significant insights into the functionality of RNAs and helps to explore their association with various diseases. Predominantly developed single-compartment localization predictors (SCLPs) lack to demystify RNA association with div...

Descripción completa

Detalles Bibliográficos
Autores principales: Asim, Muhammad Nabeel, Ibrahim, Muhammad Ali, Malik, Muhammad Imran, Zehe, Christoph, Cloarec, Olivier, Trygg, Johan, Dengel, Andreas, Ahmed, Sheraz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9356161/
https://www.ncbi.nlm.nih.gov/pubmed/35983235
http://dx.doi.org/10.1016/j.csbj.2022.07.031
_version_ 1784763455498092544
author Asim, Muhammad Nabeel
Ibrahim, Muhammad Ali
Malik, Muhammad Imran
Zehe, Christoph
Cloarec, Olivier
Trygg, Johan
Dengel, Andreas
Ahmed, Sheraz
author_facet Asim, Muhammad Nabeel
Ibrahim, Muhammad Ali
Malik, Muhammad Imran
Zehe, Christoph
Cloarec, Olivier
Trygg, Johan
Dengel, Andreas
Ahmed, Sheraz
author_sort Asim, Muhammad Nabeel
collection PubMed
description Subcellular localization of Ribonucleic Acid (RNA) molecules provide significant insights into the functionality of RNAs and helps to explore their association with various diseases. Predominantly developed single-compartment localization predictors (SCLPs) lack to demystify RNA association with diverse biochemical and pathological processes mainly happen through RNA co-localization in multiple compartments. Limited multi-compartment localization predictors (MCLPs) manage to produce decent performance only for target RNA class of particular sub-type. Further, existing computational approaches have limited practical significance and potential to optimize therapeutics due to the poor degree of model explainability. The paper in hand presents an explainable Long Short-Term Memory (LSTM) network “EL-RMLocNet”, predictive performance and interpretability of which are optimized using a novel GeneticSeq2Vec statistical representation learning scheme and attention mechanism for accurate multi-compartment localization prediction of different RNAs solely using raw RNA sequences. GeneticSeq2Vec generates optimized statistical vectors of raw RNA sequences by capturing short and long range relations of nucleotide k-mers. Using sequence vectors generated by GeneticSeq2Vec scheme, Long Short Term Memory layers extract most informative features, weighting of which on the basis of discriminative potential for accurate multi-compartment localization prediction is performed using attention layer. Through reverse engineering, weights of statistical feature space are mapped to nucleotide k-mers patterns to make multi-compartment localization prediction decision making transparent and explainable for different RNA classes and species. Empirical evaluation indicates that EL-RMLocNet outperforms state-of-the-art predictor for subcellular localization prediction of 4 different RNA classes by an average accuracy figure of 8% for Homo Sapiens species and 6% for Mus Musculus species. EL-RMLocNet is freely available as a web server at (https://sds_genetic_analysis.opendfki.de/subcellular_loc/).
format Online
Article
Text
id pubmed-9356161
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-93561612022-08-17 EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Malik, Muhammad Imran Zehe, Christoph Cloarec, Olivier Trygg, Johan Dengel, Andreas Ahmed, Sheraz Comput Struct Biotechnol J Research Article Subcellular localization of Ribonucleic Acid (RNA) molecules provide significant insights into the functionality of RNAs and helps to explore their association with various diseases. Predominantly developed single-compartment localization predictors (SCLPs) lack to demystify RNA association with diverse biochemical and pathological processes mainly happen through RNA co-localization in multiple compartments. Limited multi-compartment localization predictors (MCLPs) manage to produce decent performance only for target RNA class of particular sub-type. Further, existing computational approaches have limited practical significance and potential to optimize therapeutics due to the poor degree of model explainability. The paper in hand presents an explainable Long Short-Term Memory (LSTM) network “EL-RMLocNet”, predictive performance and interpretability of which are optimized using a novel GeneticSeq2Vec statistical representation learning scheme and attention mechanism for accurate multi-compartment localization prediction of different RNAs solely using raw RNA sequences. GeneticSeq2Vec generates optimized statistical vectors of raw RNA sequences by capturing short and long range relations of nucleotide k-mers. Using sequence vectors generated by GeneticSeq2Vec scheme, Long Short Term Memory layers extract most informative features, weighting of which on the basis of discriminative potential for accurate multi-compartment localization prediction is performed using attention layer. Through reverse engineering, weights of statistical feature space are mapped to nucleotide k-mers patterns to make multi-compartment localization prediction decision making transparent and explainable for different RNA classes and species. Empirical evaluation indicates that EL-RMLocNet outperforms state-of-the-art predictor for subcellular localization prediction of 4 different RNA classes by an average accuracy figure of 8% for Homo Sapiens species and 6% for Mus Musculus species. EL-RMLocNet is freely available as a web server at (https://sds_genetic_analysis.opendfki.de/subcellular_loc/). Research Network of Computational and Structural Biotechnology 2022-07-26 /pmc/articles/PMC9356161/ /pubmed/35983235 http://dx.doi.org/10.1016/j.csbj.2022.07.031 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Asim, Muhammad Nabeel
Ibrahim, Muhammad Ali
Malik, Muhammad Imran
Zehe, Christoph
Cloarec, Olivier
Trygg, Johan
Dengel, Andreas
Ahmed, Sheraz
EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction
title EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction
title_full EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction
title_fullStr EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction
title_full_unstemmed EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction
title_short EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction
title_sort el-rmlocnet: an explainable lstm network for rna-associated multi-compartment localization prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9356161/
https://www.ncbi.nlm.nih.gov/pubmed/35983235
http://dx.doi.org/10.1016/j.csbj.2022.07.031
work_keys_str_mv AT asimmuhammadnabeel elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction
AT ibrahimmuhammadali elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction
AT malikmuhammadimran elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction
AT zehechristoph elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction
AT cloarecolivier elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction
AT tryggjohan elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction
AT dengelandreas elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction
AT ahmedsheraz elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction