Cargando…

A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures

BACKGROUND: Improving accuracy and efficiency of computational methods that predict pseudoknotted RNA secondary structures is an ongoing challenge. Existing methods based on free energy minimization tend to be very slow and are limited in the types of pseudoknots that they can predict. Incorporating...

Descripción completa

Detalles Bibliográficos
Autores principales: Jabbari, Hosna, Condon, Anne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4064103/
https://www.ncbi.nlm.nih.gov/pubmed/24884954
http://dx.doi.org/10.1186/1471-2105-15-147
_version_ 1782321895174045696
author Jabbari, Hosna
Condon, Anne
author_facet Jabbari, Hosna
Condon, Anne
author_sort Jabbari, Hosna
collection PubMed
description BACKGROUND: Improving accuracy and efficiency of computational methods that predict pseudoknotted RNA secondary structures is an ongoing challenge. Existing methods based on free energy minimization tend to be very slow and are limited in the types of pseudoknots that they can predict. Incorporating known structural information can improve prediction accuracy; however, there are not many methods for prediction of pseudoknotted structures that can incorporate structural information as input. There is even less understanding of the relative robustness of these methods with respect to partial information. RESULTS: We present a new method, Iterative HFold, for pseudoknotted RNA secondary structure prediction. Iterative HFold takes as input a pseudoknot-free structure, and produces a possibly pseudoknotted structure whose energy is at least as low as that of any (density-2) pseudoknotted structure containing the input structure. Iterative HFold leverages strengths of earlier methods, namely the fast running time of HFold, a method that is based on the hierarchical folding hypothesis, and the energy parameters of HotKnots V2.0. Our experimental evaluation on a large data set shows that Iterative HFold is robust with respect to partial information, with average accuracy on pseudoknotted structures steadily increasing from roughly 54% to 79% as the user provides up to 40% of the input structure. Iterative HFold is much faster than HotKnots V2.0, while having comparable accuracy. Iterative HFold also has significantly better accuracy than IPknot on our HK-PK and IP-pk168 data sets. CONCLUSIONS: Iterative HFold is a robust method for prediction of pseudoknotted RNA secondary structures, whose accuracy with more than 5% information about true pseudoknot-free structures is better than that of IPknot, and with about 35% information about true pseudoknot-free structures compares well with that of HotKnots V2.0 while being significantly faster. Iterative HFold and all data used in this work are freely available at http://www.cs.ubc.ca/~hjabbari/software.php.
format Online
Article
Text
id pubmed-4064103
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40641032014-06-30 A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures Jabbari, Hosna Condon, Anne BMC Bioinformatics Research Article BACKGROUND: Improving accuracy and efficiency of computational methods that predict pseudoknotted RNA secondary structures is an ongoing challenge. Existing methods based on free energy minimization tend to be very slow and are limited in the types of pseudoknots that they can predict. Incorporating known structural information can improve prediction accuracy; however, there are not many methods for prediction of pseudoknotted structures that can incorporate structural information as input. There is even less understanding of the relative robustness of these methods with respect to partial information. RESULTS: We present a new method, Iterative HFold, for pseudoknotted RNA secondary structure prediction. Iterative HFold takes as input a pseudoknot-free structure, and produces a possibly pseudoknotted structure whose energy is at least as low as that of any (density-2) pseudoknotted structure containing the input structure. Iterative HFold leverages strengths of earlier methods, namely the fast running time of HFold, a method that is based on the hierarchical folding hypothesis, and the energy parameters of HotKnots V2.0. Our experimental evaluation on a large data set shows that Iterative HFold is robust with respect to partial information, with average accuracy on pseudoknotted structures steadily increasing from roughly 54% to 79% as the user provides up to 40% of the input structure. Iterative HFold is much faster than HotKnots V2.0, while having comparable accuracy. Iterative HFold also has significantly better accuracy than IPknot on our HK-PK and IP-pk168 data sets. CONCLUSIONS: Iterative HFold is a robust method for prediction of pseudoknotted RNA secondary structures, whose accuracy with more than 5% information about true pseudoknot-free structures is better than that of IPknot, and with about 35% information about true pseudoknot-free structures compares well with that of HotKnots V2.0 while being significantly faster. Iterative HFold and all data used in this work are freely available at http://www.cs.ubc.ca/~hjabbari/software.php. BioMed Central 2014-05-18 /pmc/articles/PMC4064103/ /pubmed/24884954 http://dx.doi.org/10.1186/1471-2105-15-147 Text en Copyright © 2014 Jabbari and Condon; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Jabbari, Hosna
Condon, Anne
A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures
title A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures
title_full A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures
title_fullStr A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures
title_full_unstemmed A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures
title_short A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures
title_sort fast and robust iterative algorithm for prediction of rna pseudoknotted secondary structures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4064103/
https://www.ncbi.nlm.nih.gov/pubmed/24884954
http://dx.doi.org/10.1186/1471-2105-15-147
work_keys_str_mv AT jabbarihosna afastandrobustiterativealgorithmforpredictionofrnapseudoknottedsecondarystructures
AT condonanne afastandrobustiterativealgorithmforpredictionofrnapseudoknottedsecondarystructures
AT jabbarihosna fastandrobustiterativealgorithmforpredictionofrnapseudoknottedsecondarystructures
AT condonanne fastandrobustiterativealgorithmforpredictionofrnapseudoknottedsecondarystructures