Cargando…

Evolving stochastic context--free grammars for RNA secondary structure prediction

BACKGROUND: Stochastic Context–Free Grammars (SCFGs) were applied successfully to RNA secondary structure prediction in the early 90s, and used in combination with comparative methods in the late 90s. The set of SCFGs potentially useful for RNA secondary structure prediction is very large, but a few...

Descripción completa

Detalles Bibliográficos
Autores principales: WJ Anderson, James, Tataru, Paula, Staines, Joe, Hein, Jotun, Lyngsø, Rune
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3464655/
https://www.ncbi.nlm.nih.gov/pubmed/22559985
http://dx.doi.org/10.1186/1471-2105-13-78
_version_ 1782245446151831552
author WJ Anderson, James
Tataru, Paula
Staines, Joe
Hein, Jotun
Lyngsø, Rune
author_facet WJ Anderson, James
Tataru, Paula
Staines, Joe
Hein, Jotun
Lyngsø, Rune
author_sort WJ Anderson, James
collection PubMed
description BACKGROUND: Stochastic Context–Free Grammars (SCFGs) were applied successfully to RNA secondary structure prediction in the early 90s, and used in combination with comparative methods in the late 90s. The set of SCFGs potentially useful for RNA secondary structure prediction is very large, but a few intuitively designed grammars have remained dominant. In this paper we investigate two automatic search techniques for effective grammars – exhaustive search for very compact grammars and an evolutionary algorithm to find larger grammars. We also examine whether grammar ambiguity is as problematic to structure prediction as has been previously suggested. RESULTS: These search techniques were applied to predict RNA secondary structure on a maximal data set and revealed new and interesting grammars, though none are dramatically better than classic grammars. In general, results showed that many grammars with quite different structure could have very similar predictive ability. Many ambiguous grammars were found which were at least as effective as the best current unambiguous grammars. CONCLUSIONS: Overall the method of evolving SCFGs for RNA secondary structure prediction proved effective in finding many grammars that had strong predictive accuracy, as good or slightly better than those designed manually. Furthermore, several of the best grammars found were ambiguous, demonstrating that such grammars should not be disregarded.
format Online
Article
Text
id pubmed-3464655
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34646552012-10-05 Evolving stochastic context--free grammars for RNA secondary structure prediction WJ Anderson, James Tataru, Paula Staines, Joe Hein, Jotun Lyngsø, Rune BMC Bioinformatics Research Article BACKGROUND: Stochastic Context–Free Grammars (SCFGs) were applied successfully to RNA secondary structure prediction in the early 90s, and used in combination with comparative methods in the late 90s. The set of SCFGs potentially useful for RNA secondary structure prediction is very large, but a few intuitively designed grammars have remained dominant. In this paper we investigate two automatic search techniques for effective grammars – exhaustive search for very compact grammars and an evolutionary algorithm to find larger grammars. We also examine whether grammar ambiguity is as problematic to structure prediction as has been previously suggested. RESULTS: These search techniques were applied to predict RNA secondary structure on a maximal data set and revealed new and interesting grammars, though none are dramatically better than classic grammars. In general, results showed that many grammars with quite different structure could have very similar predictive ability. Many ambiguous grammars were found which were at least as effective as the best current unambiguous grammars. CONCLUSIONS: Overall the method of evolving SCFGs for RNA secondary structure prediction proved effective in finding many grammars that had strong predictive accuracy, as good or slightly better than those designed manually. Furthermore, several of the best grammars found were ambiguous, demonstrating that such grammars should not be disregarded. BioMed Central 2012-05-04 /pmc/articles/PMC3464655/ /pubmed/22559985 http://dx.doi.org/10.1186/1471-2105-13-78 Text en Copyright ©2012 Anderson et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
WJ Anderson, James
Tataru, Paula
Staines, Joe
Hein, Jotun
Lyngsø, Rune
Evolving stochastic context--free grammars for RNA secondary structure prediction
title Evolving stochastic context--free grammars for RNA secondary structure prediction
title_full Evolving stochastic context--free grammars for RNA secondary structure prediction
title_fullStr Evolving stochastic context--free grammars for RNA secondary structure prediction
title_full_unstemmed Evolving stochastic context--free grammars for RNA secondary structure prediction
title_short Evolving stochastic context--free grammars for RNA secondary structure prediction
title_sort evolving stochastic context--free grammars for rna secondary structure prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3464655/
https://www.ncbi.nlm.nih.gov/pubmed/22559985
http://dx.doi.org/10.1186/1471-2105-13-78
work_keys_str_mv AT wjandersonjames evolvingstochasticcontextfreegrammarsforrnasecondarystructureprediction
AT tatarupaula evolvingstochasticcontextfreegrammarsforrnasecondarystructureprediction
AT stainesjoe evolvingstochasticcontextfreegrammarsforrnasecondarystructureprediction
AT heinjotun evolvingstochasticcontextfreegrammarsforrnasecondarystructureprediction
AT lyngsørune evolvingstochasticcontextfreegrammarsforrnasecondarystructureprediction