Cargando…

Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction

BACKGROUND: RNA secondary structure prediction methods based on probabilistic modeling can be developed using stochastic context-free grammars (SCFGs). Such methods can readily combine different sources of information that can be expressed probabilistically, such as an evolutionary model of comparat...

Descripción completa

Detalles Bibliográficos
Autores principales: Dowell, Robin D, Eddy, Sean R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC442121/
https://www.ncbi.nlm.nih.gov/pubmed/15180907
http://dx.doi.org/10.1186/1471-2105-5-71
_version_ 1782121549293158400
author Dowell, Robin D
Eddy, Sean R
author_facet Dowell, Robin D
Eddy, Sean R
author_sort Dowell, Robin D
collection PubMed
description BACKGROUND: RNA secondary structure prediction methods based on probabilistic modeling can be developed using stochastic context-free grammars (SCFGs). Such methods can readily combine different sources of information that can be expressed probabilistically, such as an evolutionary model of comparative RNA sequence analysis and a biophysical model of structure plausibility. However, the number of free parameters in an integrated model for consensus RNA structure prediction can become untenable if the underlying SCFG design is too complex. Thus a key question is, what small, simple SCFG designs perform best for RNA secondary structure prediction? RESULTS: Nine different small SCFGs were implemented to explore the tradeoffs between model complexity and prediction accuracy. Each model was tested for single sequence structure prediction accuracy on a benchmark set of RNA secondary structures. CONCLUSIONS: Four SCFG designs had prediction accuracies near the performance of current energy minimization programs. One of these designs, introduced by Knudsen and Hein in their PFOLD algorithm, has only 21 free parameters and is significantly simpler than the others.
format Text
id pubmed-442121
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-4421212004-07-03 Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction Dowell, Robin D Eddy, Sean R BMC Bioinformatics Research Article BACKGROUND: RNA secondary structure prediction methods based on probabilistic modeling can be developed using stochastic context-free grammars (SCFGs). Such methods can readily combine different sources of information that can be expressed probabilistically, such as an evolutionary model of comparative RNA sequence analysis and a biophysical model of structure plausibility. However, the number of free parameters in an integrated model for consensus RNA structure prediction can become untenable if the underlying SCFG design is too complex. Thus a key question is, what small, simple SCFG designs perform best for RNA secondary structure prediction? RESULTS: Nine different small SCFGs were implemented to explore the tradeoffs between model complexity and prediction accuracy. Each model was tested for single sequence structure prediction accuracy on a benchmark set of RNA secondary structures. CONCLUSIONS: Four SCFG designs had prediction accuracies near the performance of current energy minimization programs. One of these designs, introduced by Knudsen and Hein in their PFOLD algorithm, has only 21 free parameters and is significantly simpler than the others. BioMed Central 2004-06-04 /pmc/articles/PMC442121/ /pubmed/15180907 http://dx.doi.org/10.1186/1471-2105-5-71 Text en Copyright © 2004 Dowell and Eddy; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research Article
Dowell, Robin D
Eddy, Sean R
Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction
title Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction
title_full Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction
title_fullStr Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction
title_full_unstemmed Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction
title_short Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction
title_sort evaluation of several lightweight stochastic context-free grammars for rna secondary structure prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC442121/
https://www.ncbi.nlm.nih.gov/pubmed/15180907
http://dx.doi.org/10.1186/1471-2105-5-71
work_keys_str_mv AT dowellrobind evaluationofseverallightweightstochasticcontextfreegrammarsforrnasecondarystructureprediction
AT eddyseanr evaluationofseverallightweightstochasticcontextfreegrammarsforrnasecondarystructureprediction