Cargando…
aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences
MOTIVATION: Predicting the conserved secondary structure of homologous ribonucleic acid (RNA) sequences is crucial for understanding RNA functions. However, fast and accurate RNA structure prediction is challenging, especially when the number and the divergence of homologous RNA increases. To addres...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022685/ https://www.ncbi.nlm.nih.gov/pubmed/29949960 http://dx.doi.org/10.1093/bioinformatics/bty234 |
_version_ | 1783335730870747136 |
---|---|
author | Glouzon, Jean-Pierre Séhi Ouangraoua, Aïda |
author_facet | Glouzon, Jean-Pierre Séhi Ouangraoua, Aïda |
author_sort | Glouzon, Jean-Pierre Séhi |
collection | PubMed |
description | MOTIVATION: Predicting the conserved secondary structure of homologous ribonucleic acid (RNA) sequences is crucial for understanding RNA functions. However, fast and accurate RNA structure prediction is challenging, especially when the number and the divergence of homologous RNA increases. To address this challenge, we propose aliFreeFold, based on a novel alignment-free approach which computes a representative structure from a set of homologous RNA sequences using sub-optimal secondary structures generated for each sequence. It is based on a vector representation of sub-optimal structures capturing structure conservation signals by weighting structural motifs according to their conservation across the sub-optimal structures. RESULTS: We demonstrate that aliFreeFold provides a good balance between speed and accuracy regarding predictions of representative structures for sets of homologous RNA compared to traditional methods based on sequence and structure alignment. We show that aliFreeFold is capable of uncovering conserved structural features fastly and effectively thanks to its weighting scheme that gives more (resp. less) importance to common (resp. uncommon) structural motifs. The weighting scheme is also shown to be capable of capturing conservation signal as the number of homologous RNA increases. These results demonstrate the ability of aliFreefold to efficiently and accurately provide interesting structural representatives of RNA families. AVAILABILITY AND IMPLEMENTATION: aliFreeFold was implemented in C++. Source code and Linux binary are freely available at https://github.com/UdeS-CoBIUS/aliFreeFold. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6022685 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-60226852018-07-05 aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences Glouzon, Jean-Pierre Séhi Ouangraoua, Aïda Bioinformatics Ismb 2018–Intelligent Systems for Molecular Biology Proceedings MOTIVATION: Predicting the conserved secondary structure of homologous ribonucleic acid (RNA) sequences is crucial for understanding RNA functions. However, fast and accurate RNA structure prediction is challenging, especially when the number and the divergence of homologous RNA increases. To address this challenge, we propose aliFreeFold, based on a novel alignment-free approach which computes a representative structure from a set of homologous RNA sequences using sub-optimal secondary structures generated for each sequence. It is based on a vector representation of sub-optimal structures capturing structure conservation signals by weighting structural motifs according to their conservation across the sub-optimal structures. RESULTS: We demonstrate that aliFreeFold provides a good balance between speed and accuracy regarding predictions of representative structures for sets of homologous RNA compared to traditional methods based on sequence and structure alignment. We show that aliFreeFold is capable of uncovering conserved structural features fastly and effectively thanks to its weighting scheme that gives more (resp. less) importance to common (resp. uncommon) structural motifs. The weighting scheme is also shown to be capable of capturing conservation signal as the number of homologous RNA increases. These results demonstrate the ability of aliFreefold to efficiently and accurately provide interesting structural representatives of RNA families. AVAILABILITY AND IMPLEMENTATION: aliFreeFold was implemented in C++. Source code and Linux binary are freely available at https://github.com/UdeS-CoBIUS/aliFreeFold. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-07-01 2018-06-27 /pmc/articles/PMC6022685/ /pubmed/29949960 http://dx.doi.org/10.1093/bioinformatics/bty234 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb 2018–Intelligent Systems for Molecular Biology Proceedings Glouzon, Jean-Pierre Séhi Ouangraoua, Aïda aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences |
title | aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences |
title_full | aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences |
title_fullStr | aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences |
title_full_unstemmed | aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences |
title_short | aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences |
title_sort | alifreefold: an alignment-free approach to predict secondary structure from homologous rna sequences |
topic | Ismb 2018–Intelligent Systems for Molecular Biology Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022685/ https://www.ncbi.nlm.nih.gov/pubmed/29949960 http://dx.doi.org/10.1093/bioinformatics/bty234 |
work_keys_str_mv | AT glouzonjeanpierresehi alifreefoldanalignmentfreeapproachtopredictsecondarystructurefromhomologousrnasequences AT ouangraouaaida alifreefoldanalignmentfreeapproachtopredictsecondarystructurefromhomologousrnasequences |