Cargando…

RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules

BACKGROUND: In recent years, RNA molecules that are not translated into proteins (ncRNAs) have drawn a great deal of attention, as they were shown to be involved in many cellular functions. One of the most important computational problems regarding ncRNA is to predict the secondary structure of a mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Horesh, Yair, Doniger, Tirza, Michaeli, Shulamit, Unger, Ron
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2147038/
https://www.ncbi.nlm.nih.gov/pubmed/17908318
http://dx.doi.org/10.1186/1471-2105-8-366
_version_ 1782144349430087680
author Horesh, Yair
Doniger, Tirza
Michaeli, Shulamit
Unger, Ron
author_facet Horesh, Yair
Doniger, Tirza
Michaeli, Shulamit
Unger, Ron
author_sort Horesh, Yair
collection PubMed
description BACKGROUND: In recent years, RNA molecules that are not translated into proteins (ncRNAs) have drawn a great deal of attention, as they were shown to be involved in many cellular functions. One of the most important computational problems regarding ncRNA is to predict the secondary structure of a molecule from its sequence. In particular, we attempted to predict the secondary structure for a set of unaligned ncRNA molecules that are taken from the same family, and thus presumably have a similar structure. RESULTS: We developed the RNAspa program, which comparatively predicts the secondary structure for a set of ncRNA molecules in linear time in the number of molecules. We observed that in a list of several hundred suboptimal minimal free energy (MFE) predictions, as provided by the RNAsubopt program of the Vienna package, it is likely that at least one suggested structure would be similar to the true, correct one. The suboptimal solutions of each molecule are represented as a layer of vertices in a graph. The shortest path in this graph is the basis for structural predictions for the molecule. We also show that RNA secondary structures can be compared very rapidly by a simple string Edit-Distance algorithm with a minimal loss of accuracy. We show that this approach allows us to more deeply explore the suboptimal structure space. CONCLUSION: The algorithm was tested on three datasets which include several ncRNA families taken from the Rfam database. These datasets allowed for comparison of the algorithm with other methods. In these tests, RNAspa performed better than four other programs.
format Text
id pubmed-2147038
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21470382007-12-19 RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules Horesh, Yair Doniger, Tirza Michaeli, Shulamit Unger, Ron BMC Bioinformatics Methodology Article BACKGROUND: In recent years, RNA molecules that are not translated into proteins (ncRNAs) have drawn a great deal of attention, as they were shown to be involved in many cellular functions. One of the most important computational problems regarding ncRNA is to predict the secondary structure of a molecule from its sequence. In particular, we attempted to predict the secondary structure for a set of unaligned ncRNA molecules that are taken from the same family, and thus presumably have a similar structure. RESULTS: We developed the RNAspa program, which comparatively predicts the secondary structure for a set of ncRNA molecules in linear time in the number of molecules. We observed that in a list of several hundred suboptimal minimal free energy (MFE) predictions, as provided by the RNAsubopt program of the Vienna package, it is likely that at least one suggested structure would be similar to the true, correct one. The suboptimal solutions of each molecule are represented as a layer of vertices in a graph. The shortest path in this graph is the basis for structural predictions for the molecule. We also show that RNA secondary structures can be compared very rapidly by a simple string Edit-Distance algorithm with a minimal loss of accuracy. We show that this approach allows us to more deeply explore the suboptimal structure space. CONCLUSION: The algorithm was tested on three datasets which include several ncRNA families taken from the Rfam database. These datasets allowed for comparison of the algorithm with other methods. In these tests, RNAspa performed better than four other programs. BioMed Central 2007-10-01 /pmc/articles/PMC2147038/ /pubmed/17908318 http://dx.doi.org/10.1186/1471-2105-8-366 Text en Copyright © 2007 Horesh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Horesh, Yair
Doniger, Tirza
Michaeli, Shulamit
Unger, Ron
RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules
title RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules
title_full RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules
title_fullStr RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules
title_full_unstemmed RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules
title_short RNAspa: a shortest path approach for comparative prediction of the secondary structure of ncRNA molecules
title_sort rnaspa: a shortest path approach for comparative prediction of the secondary structure of ncrna molecules
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2147038/
https://www.ncbi.nlm.nih.gov/pubmed/17908318
http://dx.doi.org/10.1186/1471-2105-8-366
work_keys_str_mv AT horeshyair rnaspaashortestpathapproachforcomparativepredictionofthesecondarystructureofncrnamolecules
AT donigertirza rnaspaashortestpathapproachforcomparativepredictionofthesecondarystructureofncrnamolecules
AT michaelishulamit rnaspaashortestpathapproachforcomparativepredictionofthesecondarystructureofncrnamolecules
AT ungerron rnaspaashortestpathapproachforcomparativepredictionofthesecondarystructureofncrnamolecules