Cargando…

Error statistics of hidden Markov model and hidden Boltzmann model results

BACKGROUND: Hidden Markov models and hidden Boltzmann models are employed in computational biology and a variety of other scientific fields for a variety of analyses of sequential data. Whether the associated algorithms are used to compute an actual probability or, more generally, an odds ratio or s...

Descripción completa

Detalles Bibliográficos
Autor principal: Newberg, Lee A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2722652/
https://www.ncbi.nlm.nih.gov/pubmed/19589158
http://dx.doi.org/10.1186/1471-2105-10-212
_version_ 1782170321396170752
author Newberg, Lee A
author_facet Newberg, Lee A
author_sort Newberg, Lee A
collection PubMed
description BACKGROUND: Hidden Markov models and hidden Boltzmann models are employed in computational biology and a variety of other scientific fields for a variety of analyses of sequential data. Whether the associated algorithms are used to compute an actual probability or, more generally, an odds ratio or some other score, a frequent requirement is that the error statistics of a given score be known. What is the chance that random data would achieve that score or better? What is the chance that a real signal would achieve a given score threshold? RESULTS: Here we present a novel general approach to estimating these false positive and true positive rates that is significantly more efficient than are existing general approaches. We validate the technique via an implementation within the HMMER 3.0 package, which scans DNA or protein sequence databases for patterns of interest, using a profile-HMM. CONCLUSION: The new approach is faster than general naïve sampling approaches, and more general than other current approaches. It provides an efficient mechanism by which to estimate error statistics for hidden Markov model and hidden Boltzmann model results.
format Text
id pubmed-2722652
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27226522009-08-07 Error statistics of hidden Markov model and hidden Boltzmann model results Newberg, Lee A BMC Bioinformatics Methodology Article BACKGROUND: Hidden Markov models and hidden Boltzmann models are employed in computational biology and a variety of other scientific fields for a variety of analyses of sequential data. Whether the associated algorithms are used to compute an actual probability or, more generally, an odds ratio or some other score, a frequent requirement is that the error statistics of a given score be known. What is the chance that random data would achieve that score or better? What is the chance that a real signal would achieve a given score threshold? RESULTS: Here we present a novel general approach to estimating these false positive and true positive rates that is significantly more efficient than are existing general approaches. We validate the technique via an implementation within the HMMER 3.0 package, which scans DNA or protein sequence databases for patterns of interest, using a profile-HMM. CONCLUSION: The new approach is faster than general naïve sampling approaches, and more general than other current approaches. It provides an efficient mechanism by which to estimate error statistics for hidden Markov model and hidden Boltzmann model results. BioMed Central 2009-07-09 /pmc/articles/PMC2722652/ /pubmed/19589158 http://dx.doi.org/10.1186/1471-2105-10-212 Text en Copyright © 2009 Newberg; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Newberg, Lee A
Error statistics of hidden Markov model and hidden Boltzmann model results
title Error statistics of hidden Markov model and hidden Boltzmann model results
title_full Error statistics of hidden Markov model and hidden Boltzmann model results
title_fullStr Error statistics of hidden Markov model and hidden Boltzmann model results
title_full_unstemmed Error statistics of hidden Markov model and hidden Boltzmann model results
title_short Error statistics of hidden Markov model and hidden Boltzmann model results
title_sort error statistics of hidden markov model and hidden boltzmann model results
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2722652/
https://www.ncbi.nlm.nih.gov/pubmed/19589158
http://dx.doi.org/10.1186/1471-2105-10-212
work_keys_str_mv AT newbergleea errorstatisticsofhiddenmarkovmodelandhiddenboltzmannmodelresults