Cargando…

aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs

Predicting RNA structure is crucial for understanding RNA’s mechanism of action. Comparative approaches for the prediction of RNA structures can be classified into four main strategies. The three first—align-and-fold, align-then-fold and fold-then-align—exploit multiple sequence alignments to improv...

Descripción completa

Detalles Bibliográficos
Autores principales: Bossanyi, Marc-André, Carpentier, Valentin, Glouzon, Jean-Pierre S, Ouangraoua, Aïda, Anselmetti, Yoann
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671329/
https://www.ncbi.nlm.nih.gov/pubmed/33575631
http://dx.doi.org/10.1093/nargab/lqaa086
_version_ 1783610909237706752
author Bossanyi, Marc-André
Carpentier, Valentin
Glouzon, Jean-Pierre S
Ouangraoua, Aïda
Anselmetti, Yoann
author_facet Bossanyi, Marc-André
Carpentier, Valentin
Glouzon, Jean-Pierre S
Ouangraoua, Aïda
Anselmetti, Yoann
author_sort Bossanyi, Marc-André
collection PubMed
description Predicting RNA structure is crucial for understanding RNA’s mechanism of action. Comparative approaches for the prediction of RNA structures can be classified into four main strategies. The three first—align-and-fold, align-then-fold and fold-then-align—exploit multiple sequence alignments to improve the accuracy of conserved RNA-structure prediction. Align-and-fold methods perform generally better, but are also typically slower than the other alignment-based methods. The fourth strategy—alignment-free—consists in predicting the conserved RNA structure without relying on sequence alignment. This strategy has the advantage of being the faster, while predicting accurate structures through the use of latent representations of the candidate structures for each sequence. This paper presents aliFreeFoldMulti, an extension of the aliFreeFold algorithm. This algorithm predicts a representative secondary structure of multiple RNA homologs by using a vector representation of their suboptimal structures. aliFreeFoldMulti improves on aliFreeFold by additionally computing the conserved structure for each sequence. aliFreeFoldMulti is assessed by comparing its prediction performance and time efficiency with a set of leading RNA-structure prediction methods. aliFreeFoldMulti has the lowest computing times and the highest maximum accuracy scores. It achieves comparable average structure prediction accuracy as other methods, except TurboFoldII which is the best in terms of average accuracy but with the highest computing times. We present aliFreeFoldMulti as an illustration of the potential of alignment-free approaches to provide fast and accurate RNA-structure prediction methods.
format Online
Article
Text
id pubmed-7671329
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-76713292021-02-10 aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs Bossanyi, Marc-André Carpentier, Valentin Glouzon, Jean-Pierre S Ouangraoua, Aïda Anselmetti, Yoann NAR Genom Bioinform Standard Article Predicting RNA structure is crucial for understanding RNA’s mechanism of action. Comparative approaches for the prediction of RNA structures can be classified into four main strategies. The three first—align-and-fold, align-then-fold and fold-then-align—exploit multiple sequence alignments to improve the accuracy of conserved RNA-structure prediction. Align-and-fold methods perform generally better, but are also typically slower than the other alignment-based methods. The fourth strategy—alignment-free—consists in predicting the conserved RNA structure without relying on sequence alignment. This strategy has the advantage of being the faster, while predicting accurate structures through the use of latent representations of the candidate structures for each sequence. This paper presents aliFreeFoldMulti, an extension of the aliFreeFold algorithm. This algorithm predicts a representative secondary structure of multiple RNA homologs by using a vector representation of their suboptimal structures. aliFreeFoldMulti improves on aliFreeFold by additionally computing the conserved structure for each sequence. aliFreeFoldMulti is assessed by comparing its prediction performance and time efficiency with a set of leading RNA-structure prediction methods. aliFreeFoldMulti has the lowest computing times and the highest maximum accuracy scores. It achieves comparable average structure prediction accuracy as other methods, except TurboFoldII which is the best in terms of average accuracy but with the highest computing times. We present aliFreeFoldMulti as an illustration of the potential of alignment-free approaches to provide fast and accurate RNA-structure prediction methods. Oxford University Press 2020-10-27 /pmc/articles/PMC7671329/ /pubmed/33575631 http://dx.doi.org/10.1093/nargab/lqaa086 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Bossanyi, Marc-André
Carpentier, Valentin
Glouzon, Jean-Pierre S
Ouangraoua, Aïda
Anselmetti, Yoann
aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs
title aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs
title_full aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs
title_fullStr aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs
title_full_unstemmed aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs
title_short aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs
title_sort alifreefoldmulti: alignment-free method to predict secondary structures of multiple rna homologs
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671329/
https://www.ncbi.nlm.nih.gov/pubmed/33575631
http://dx.doi.org/10.1093/nargab/lqaa086
work_keys_str_mv AT bossanyimarcandre alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs
AT carpentiervalentin alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs
AT glouzonjeanpierres alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs
AT ouangraouaaida alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs
AT anselmettiyoann alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs