Cargando…

aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences

MOTIVATION: Predicting the conserved secondary structure of homologous ribonucleic acid (RNA) sequences is crucial for understanding RNA functions. However, fast and accurate RNA structure prediction is challenging, especially when the number and the divergence of homologous RNA increases. To addres...

Descripción completa

Detalles Bibliográficos
Autores principales: Glouzon, Jean-Pierre Séhi, Ouangraoua, Aïda
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022685/
https://www.ncbi.nlm.nih.gov/pubmed/29949960
http://dx.doi.org/10.1093/bioinformatics/bty234
_version_ 1783335730870747136
author Glouzon, Jean-Pierre Séhi
Ouangraoua, Aïda
author_facet Glouzon, Jean-Pierre Séhi
Ouangraoua, Aïda
author_sort Glouzon, Jean-Pierre Séhi
collection PubMed
description MOTIVATION: Predicting the conserved secondary structure of homologous ribonucleic acid (RNA) sequences is crucial for understanding RNA functions. However, fast and accurate RNA structure prediction is challenging, especially when the number and the divergence of homologous RNA increases. To address this challenge, we propose aliFreeFold, based on a novel alignment-free approach which computes a representative structure from a set of homologous RNA sequences using sub-optimal secondary structures generated for each sequence. It is based on a vector representation of sub-optimal structures capturing structure conservation signals by weighting structural motifs according to their conservation across the sub-optimal structures. RESULTS: We demonstrate that aliFreeFold provides a good balance between speed and accuracy regarding predictions of representative structures for sets of homologous RNA compared to traditional methods based on sequence and structure alignment. We show that aliFreeFold is capable of uncovering conserved structural features fastly and effectively thanks to its weighting scheme that gives more (resp. less) importance to common (resp. uncommon) structural motifs. The weighting scheme is also shown to be capable of capturing conservation signal as the number of homologous RNA increases. These results demonstrate the ability of aliFreefold to efficiently and accurately provide interesting structural representatives of RNA families. AVAILABILITY AND IMPLEMENTATION: aliFreeFold was implemented in C++. Source code and Linux binary are freely available at https://github.com/UdeS-CoBIUS/aliFreeFold. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6022685
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60226852018-07-05 aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences Glouzon, Jean-Pierre Séhi Ouangraoua, Aïda Bioinformatics Ismb 2018–Intelligent Systems for Molecular Biology Proceedings MOTIVATION: Predicting the conserved secondary structure of homologous ribonucleic acid (RNA) sequences is crucial for understanding RNA functions. However, fast and accurate RNA structure prediction is challenging, especially when the number and the divergence of homologous RNA increases. To address this challenge, we propose aliFreeFold, based on a novel alignment-free approach which computes a representative structure from a set of homologous RNA sequences using sub-optimal secondary structures generated for each sequence. It is based on a vector representation of sub-optimal structures capturing structure conservation signals by weighting structural motifs according to their conservation across the sub-optimal structures. RESULTS: We demonstrate that aliFreeFold provides a good balance between speed and accuracy regarding predictions of representative structures for sets of homologous RNA compared to traditional methods based on sequence and structure alignment. We show that aliFreeFold is capable of uncovering conserved structural features fastly and effectively thanks to its weighting scheme that gives more (resp. less) importance to common (resp. uncommon) structural motifs. The weighting scheme is also shown to be capable of capturing conservation signal as the number of homologous RNA increases. These results demonstrate the ability of aliFreefold to efficiently and accurately provide interesting structural representatives of RNA families. AVAILABILITY AND IMPLEMENTATION: aliFreeFold was implemented in C++. Source code and Linux binary are freely available at https://github.com/UdeS-CoBIUS/aliFreeFold. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-07-01 2018-06-27 /pmc/articles/PMC6022685/ /pubmed/29949960 http://dx.doi.org/10.1093/bioinformatics/bty234 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2018–Intelligent Systems for Molecular Biology Proceedings
Glouzon, Jean-Pierre Séhi
Ouangraoua, Aïda
aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences
title aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences
title_full aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences
title_fullStr aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences
title_full_unstemmed aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences
title_short aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences
title_sort alifreefold: an alignment-free approach to predict secondary structure from homologous rna sequences
topic Ismb 2018–Intelligent Systems for Molecular Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022685/
https://www.ncbi.nlm.nih.gov/pubmed/29949960
http://dx.doi.org/10.1093/bioinformatics/bty234
work_keys_str_mv AT glouzonjeanpierresehi alifreefoldanalignmentfreeapproachtopredictsecondarystructurefromhomologousrnasequences
AT ouangraouaaida alifreefoldanalignmentfreeapproachtopredictsecondarystructurefromhomologousrnasequences