Cargando…
EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction
Subcellular localization of Ribonucleic Acid (RNA) molecules provide significant insights into the functionality of RNAs and helps to explore their association with various diseases. Predominantly developed single-compartment localization predictors (SCLPs) lack to demystify RNA association with div...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9356161/ https://www.ncbi.nlm.nih.gov/pubmed/35983235 http://dx.doi.org/10.1016/j.csbj.2022.07.031 |
_version_ | 1784763455498092544 |
---|---|
author | Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Malik, Muhammad Imran Zehe, Christoph Cloarec, Olivier Trygg, Johan Dengel, Andreas Ahmed, Sheraz |
author_facet | Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Malik, Muhammad Imran Zehe, Christoph Cloarec, Olivier Trygg, Johan Dengel, Andreas Ahmed, Sheraz |
author_sort | Asim, Muhammad Nabeel |
collection | PubMed |
description | Subcellular localization of Ribonucleic Acid (RNA) molecules provide significant insights into the functionality of RNAs and helps to explore their association with various diseases. Predominantly developed single-compartment localization predictors (SCLPs) lack to demystify RNA association with diverse biochemical and pathological processes mainly happen through RNA co-localization in multiple compartments. Limited multi-compartment localization predictors (MCLPs) manage to produce decent performance only for target RNA class of particular sub-type. Further, existing computational approaches have limited practical significance and potential to optimize therapeutics due to the poor degree of model explainability. The paper in hand presents an explainable Long Short-Term Memory (LSTM) network “EL-RMLocNet”, predictive performance and interpretability of which are optimized using a novel GeneticSeq2Vec statistical representation learning scheme and attention mechanism for accurate multi-compartment localization prediction of different RNAs solely using raw RNA sequences. GeneticSeq2Vec generates optimized statistical vectors of raw RNA sequences by capturing short and long range relations of nucleotide k-mers. Using sequence vectors generated by GeneticSeq2Vec scheme, Long Short Term Memory layers extract most informative features, weighting of which on the basis of discriminative potential for accurate multi-compartment localization prediction is performed using attention layer. Through reverse engineering, weights of statistical feature space are mapped to nucleotide k-mers patterns to make multi-compartment localization prediction decision making transparent and explainable for different RNA classes and species. Empirical evaluation indicates that EL-RMLocNet outperforms state-of-the-art predictor for subcellular localization prediction of 4 different RNA classes by an average accuracy figure of 8% for Homo Sapiens species and 6% for Mus Musculus species. EL-RMLocNet is freely available as a web server at (https://sds_genetic_analysis.opendfki.de/subcellular_loc/). |
format | Online Article Text |
id | pubmed-9356161 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-93561612022-08-17 EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Malik, Muhammad Imran Zehe, Christoph Cloarec, Olivier Trygg, Johan Dengel, Andreas Ahmed, Sheraz Comput Struct Biotechnol J Research Article Subcellular localization of Ribonucleic Acid (RNA) molecules provide significant insights into the functionality of RNAs and helps to explore their association with various diseases. Predominantly developed single-compartment localization predictors (SCLPs) lack to demystify RNA association with diverse biochemical and pathological processes mainly happen through RNA co-localization in multiple compartments. Limited multi-compartment localization predictors (MCLPs) manage to produce decent performance only for target RNA class of particular sub-type. Further, existing computational approaches have limited practical significance and potential to optimize therapeutics due to the poor degree of model explainability. The paper in hand presents an explainable Long Short-Term Memory (LSTM) network “EL-RMLocNet”, predictive performance and interpretability of which are optimized using a novel GeneticSeq2Vec statistical representation learning scheme and attention mechanism for accurate multi-compartment localization prediction of different RNAs solely using raw RNA sequences. GeneticSeq2Vec generates optimized statistical vectors of raw RNA sequences by capturing short and long range relations of nucleotide k-mers. Using sequence vectors generated by GeneticSeq2Vec scheme, Long Short Term Memory layers extract most informative features, weighting of which on the basis of discriminative potential for accurate multi-compartment localization prediction is performed using attention layer. Through reverse engineering, weights of statistical feature space are mapped to nucleotide k-mers patterns to make multi-compartment localization prediction decision making transparent and explainable for different RNA classes and species. Empirical evaluation indicates that EL-RMLocNet outperforms state-of-the-art predictor for subcellular localization prediction of 4 different RNA classes by an average accuracy figure of 8% for Homo Sapiens species and 6% for Mus Musculus species. EL-RMLocNet is freely available as a web server at (https://sds_genetic_analysis.opendfki.de/subcellular_loc/). Research Network of Computational and Structural Biotechnology 2022-07-26 /pmc/articles/PMC9356161/ /pubmed/35983235 http://dx.doi.org/10.1016/j.csbj.2022.07.031 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Malik, Muhammad Imran Zehe, Christoph Cloarec, Olivier Trygg, Johan Dengel, Andreas Ahmed, Sheraz EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction |
title | EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction |
title_full | EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction |
title_fullStr | EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction |
title_full_unstemmed | EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction |
title_short | EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction |
title_sort | el-rmlocnet: an explainable lstm network for rna-associated multi-compartment localization prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9356161/ https://www.ncbi.nlm.nih.gov/pubmed/35983235 http://dx.doi.org/10.1016/j.csbj.2022.07.031 |
work_keys_str_mv | AT asimmuhammadnabeel elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction AT ibrahimmuhammadali elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction AT malikmuhammadimran elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction AT zehechristoph elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction AT cloarecolivier elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction AT tryggjohan elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction AT dengelandreas elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction AT ahmedsheraz elrmlocnetanexplainablelstmnetworkforrnaassociatedmulticompartmentlocalizationprediction |