Cargando…
Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction
Circular ribonucleic acids (circRNAs) are novel non-coding RNAs that emanate from alternative splicing of precursor mRNA in reversed order across exons. Despite the abundant presence of circRNAs in human genes and their involvement in diverse physiological processes, the functionality of most circRN...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9329987/ https://www.ncbi.nlm.nih.gov/pubmed/35897818 http://dx.doi.org/10.3390/ijms23158221 |
_version_ | 1784758050607857664 |
---|---|
author | Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Imran Malik, Muhammad Dengel, Andreas Ahmed, Sheraz |
author_facet | Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Imran Malik, Muhammad Dengel, Andreas Ahmed, Sheraz |
author_sort | Asim, Muhammad Nabeel |
collection | PubMed |
description | Circular ribonucleic acids (circRNAs) are novel non-coding RNAs that emanate from alternative splicing of precursor mRNA in reversed order across exons. Despite the abundant presence of circRNAs in human genes and their involvement in diverse physiological processes, the functionality of most circRNAs remains a mystery. Like other non-coding RNAs, sub-cellular localization knowledge of circRNAs has the aptitude to demystify the influence of circRNAs on protein synthesis, degradation, destination, their association with different diseases, and potential for drug development. To date, wet experimental approaches are being used to detect sub-cellular locations of circular RNAs. These approaches help to elucidate the role of circRNAs as protein scaffolds, RNA-binding protein (RBP) sponges, micro-RNA (miRNA) sponges, parental gene expression modifiers, alternative splicing regulators, and transcription regulators. To complement wet-lab experiments, considering the progress made by machine learning approaches for the determination of sub-cellular localization of other non-coding RNAs, the paper in hand develops a computational framework, Circ-LocNet, to precisely detect circRNA sub-cellular localization. Circ-LocNet performs comprehensive extrinsic evaluation of 7 residue frequency-based, residue order and frequency-based, and physio-chemical property-based sequence descriptors using the five most widely used machine learning classifiers. Further, it explores the performance impact of K-order sequence descriptor fusion where it ensembles similar as well dissimilar genres of statistical representation learning approaches to reap the combined benefits. Considering the diversity of statistical representation learning schemes, it assesses the performance of second-order, third-order, and going all the way up to seventh-order sequence descriptor fusion. A comprehensive empirical evaluation of Circ-LocNet over a newly developed benchmark dataset using different settings reveals that standalone residue frequency-based sequence descriptors and tree-based classifiers are more suitable to predict sub-cellular localization of circular RNAs. Further, K-order heterogeneous sequence descriptors fusion in combination with tree-based classifiers most accurately predict sub-cellular localization of circular RNAs. We anticipate this study will act as a rich baseline and push the development of robust computational methodologies for the accurate sub-cellular localization determination of novel circRNAs. |
format | Online Article Text |
id | pubmed-9329987 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-93299872022-07-29 Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Imran Malik, Muhammad Dengel, Andreas Ahmed, Sheraz Int J Mol Sci Article Circular ribonucleic acids (circRNAs) are novel non-coding RNAs that emanate from alternative splicing of precursor mRNA in reversed order across exons. Despite the abundant presence of circRNAs in human genes and their involvement in diverse physiological processes, the functionality of most circRNAs remains a mystery. Like other non-coding RNAs, sub-cellular localization knowledge of circRNAs has the aptitude to demystify the influence of circRNAs on protein synthesis, degradation, destination, their association with different diseases, and potential for drug development. To date, wet experimental approaches are being used to detect sub-cellular locations of circular RNAs. These approaches help to elucidate the role of circRNAs as protein scaffolds, RNA-binding protein (RBP) sponges, micro-RNA (miRNA) sponges, parental gene expression modifiers, alternative splicing regulators, and transcription regulators. To complement wet-lab experiments, considering the progress made by machine learning approaches for the determination of sub-cellular localization of other non-coding RNAs, the paper in hand develops a computational framework, Circ-LocNet, to precisely detect circRNA sub-cellular localization. Circ-LocNet performs comprehensive extrinsic evaluation of 7 residue frequency-based, residue order and frequency-based, and physio-chemical property-based sequence descriptors using the five most widely used machine learning classifiers. Further, it explores the performance impact of K-order sequence descriptor fusion where it ensembles similar as well dissimilar genres of statistical representation learning approaches to reap the combined benefits. Considering the diversity of statistical representation learning schemes, it assesses the performance of second-order, third-order, and going all the way up to seventh-order sequence descriptor fusion. A comprehensive empirical evaluation of Circ-LocNet over a newly developed benchmark dataset using different settings reveals that standalone residue frequency-based sequence descriptors and tree-based classifiers are more suitable to predict sub-cellular localization of circular RNAs. Further, K-order heterogeneous sequence descriptors fusion in combination with tree-based classifiers most accurately predict sub-cellular localization of circular RNAs. We anticipate this study will act as a rich baseline and push the development of robust computational methodologies for the accurate sub-cellular localization determination of novel circRNAs. MDPI 2022-07-26 /pmc/articles/PMC9329987/ /pubmed/35897818 http://dx.doi.org/10.3390/ijms23158221 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Asim, Muhammad Nabeel Ibrahim, Muhammad Ali Imran Malik, Muhammad Dengel, Andreas Ahmed, Sheraz Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction |
title | Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction |
title_full | Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction |
title_fullStr | Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction |
title_full_unstemmed | Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction |
title_short | Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction |
title_sort | circ-locnet: a computational framework for circular rna sub-cellular localization prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9329987/ https://www.ncbi.nlm.nih.gov/pubmed/35897818 http://dx.doi.org/10.3390/ijms23158221 |
work_keys_str_mv | AT asimmuhammadnabeel circlocnetacomputationalframeworkforcircularrnasubcellularlocalizationprediction AT ibrahimmuhammadali circlocnetacomputationalframeworkforcircularrnasubcellularlocalizationprediction AT imranmalikmuhammad circlocnetacomputationalframeworkforcircularrnasubcellularlocalizationprediction AT dengelandreas circlocnetacomputationalframeworkforcircularrnasubcellularlocalizationprediction AT ahmedsheraz circlocnetacomputationalframeworkforcircularrnasubcellularlocalizationprediction |