Cargando…
aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs
Predicting RNA structure is crucial for understanding RNA’s mechanism of action. Comparative approaches for the prediction of RNA structures can be classified into four main strategies. The three first—align-and-fold, align-then-fold and fold-then-align—exploit multiple sequence alignments to improv...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671329/ https://www.ncbi.nlm.nih.gov/pubmed/33575631 http://dx.doi.org/10.1093/nargab/lqaa086 |
_version_ | 1783610909237706752 |
---|---|
author | Bossanyi, Marc-André Carpentier, Valentin Glouzon, Jean-Pierre S Ouangraoua, Aïda Anselmetti, Yoann |
author_facet | Bossanyi, Marc-André Carpentier, Valentin Glouzon, Jean-Pierre S Ouangraoua, Aïda Anselmetti, Yoann |
author_sort | Bossanyi, Marc-André |
collection | PubMed |
description | Predicting RNA structure is crucial for understanding RNA’s mechanism of action. Comparative approaches for the prediction of RNA structures can be classified into four main strategies. The three first—align-and-fold, align-then-fold and fold-then-align—exploit multiple sequence alignments to improve the accuracy of conserved RNA-structure prediction. Align-and-fold methods perform generally better, but are also typically slower than the other alignment-based methods. The fourth strategy—alignment-free—consists in predicting the conserved RNA structure without relying on sequence alignment. This strategy has the advantage of being the faster, while predicting accurate structures through the use of latent representations of the candidate structures for each sequence. This paper presents aliFreeFoldMulti, an extension of the aliFreeFold algorithm. This algorithm predicts a representative secondary structure of multiple RNA homologs by using a vector representation of their suboptimal structures. aliFreeFoldMulti improves on aliFreeFold by additionally computing the conserved structure for each sequence. aliFreeFoldMulti is assessed by comparing its prediction performance and time efficiency with a set of leading RNA-structure prediction methods. aliFreeFoldMulti has the lowest computing times and the highest maximum accuracy scores. It achieves comparable average structure prediction accuracy as other methods, except TurboFoldII which is the best in terms of average accuracy but with the highest computing times. We present aliFreeFoldMulti as an illustration of the potential of alignment-free approaches to provide fast and accurate RNA-structure prediction methods. |
format | Online Article Text |
id | pubmed-7671329 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-76713292021-02-10 aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs Bossanyi, Marc-André Carpentier, Valentin Glouzon, Jean-Pierre S Ouangraoua, Aïda Anselmetti, Yoann NAR Genom Bioinform Standard Article Predicting RNA structure is crucial for understanding RNA’s mechanism of action. Comparative approaches for the prediction of RNA structures can be classified into four main strategies. The three first—align-and-fold, align-then-fold and fold-then-align—exploit multiple sequence alignments to improve the accuracy of conserved RNA-structure prediction. Align-and-fold methods perform generally better, but are also typically slower than the other alignment-based methods. The fourth strategy—alignment-free—consists in predicting the conserved RNA structure without relying on sequence alignment. This strategy has the advantage of being the faster, while predicting accurate structures through the use of latent representations of the candidate structures for each sequence. This paper presents aliFreeFoldMulti, an extension of the aliFreeFold algorithm. This algorithm predicts a representative secondary structure of multiple RNA homologs by using a vector representation of their suboptimal structures. aliFreeFoldMulti improves on aliFreeFold by additionally computing the conserved structure for each sequence. aliFreeFoldMulti is assessed by comparing its prediction performance and time efficiency with a set of leading RNA-structure prediction methods. aliFreeFoldMulti has the lowest computing times and the highest maximum accuracy scores. It achieves comparable average structure prediction accuracy as other methods, except TurboFoldII which is the best in terms of average accuracy but with the highest computing times. We present aliFreeFoldMulti as an illustration of the potential of alignment-free approaches to provide fast and accurate RNA-structure prediction methods. Oxford University Press 2020-10-27 /pmc/articles/PMC7671329/ /pubmed/33575631 http://dx.doi.org/10.1093/nargab/lqaa086 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Bossanyi, Marc-André Carpentier, Valentin Glouzon, Jean-Pierre S Ouangraoua, Aïda Anselmetti, Yoann aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs |
title | aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs |
title_full | aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs |
title_fullStr | aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs |
title_full_unstemmed | aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs |
title_short | aliFreeFoldMulti: alignment-free method to predict secondary structures of multiple RNA homologs |
title_sort | alifreefoldmulti: alignment-free method to predict secondary structures of multiple rna homologs |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671329/ https://www.ncbi.nlm.nih.gov/pubmed/33575631 http://dx.doi.org/10.1093/nargab/lqaa086 |
work_keys_str_mv | AT bossanyimarcandre alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs AT carpentiervalentin alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs AT glouzonjeanpierres alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs AT ouangraouaaida alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs AT anselmettiyoann alifreefoldmultialignmentfreemethodtopredictsecondarystructuresofmultiplernahomologs |