Cargando…
Error statistics of hidden Markov model and hidden Boltzmann model results
BACKGROUND: Hidden Markov models and hidden Boltzmann models are employed in computational biology and a variety of other scientific fields for a variety of analyses of sequential data. Whether the associated algorithms are used to compute an actual probability or, more generally, an odds ratio or s...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2722652/ https://www.ncbi.nlm.nih.gov/pubmed/19589158 http://dx.doi.org/10.1186/1471-2105-10-212 |
_version_ | 1782170321396170752 |
---|---|
author | Newberg, Lee A |
author_facet | Newberg, Lee A |
author_sort | Newberg, Lee A |
collection | PubMed |
description | BACKGROUND: Hidden Markov models and hidden Boltzmann models are employed in computational biology and a variety of other scientific fields for a variety of analyses of sequential data. Whether the associated algorithms are used to compute an actual probability or, more generally, an odds ratio or some other score, a frequent requirement is that the error statistics of a given score be known. What is the chance that random data would achieve that score or better? What is the chance that a real signal would achieve a given score threshold? RESULTS: Here we present a novel general approach to estimating these false positive and true positive rates that is significantly more efficient than are existing general approaches. We validate the technique via an implementation within the HMMER 3.0 package, which scans DNA or protein sequence databases for patterns of interest, using a profile-HMM. CONCLUSION: The new approach is faster than general naïve sampling approaches, and more general than other current approaches. It provides an efficient mechanism by which to estimate error statistics for hidden Markov model and hidden Boltzmann model results. |
format | Text |
id | pubmed-2722652 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27226522009-08-07 Error statistics of hidden Markov model and hidden Boltzmann model results Newberg, Lee A BMC Bioinformatics Methodology Article BACKGROUND: Hidden Markov models and hidden Boltzmann models are employed in computational biology and a variety of other scientific fields for a variety of analyses of sequential data. Whether the associated algorithms are used to compute an actual probability or, more generally, an odds ratio or some other score, a frequent requirement is that the error statistics of a given score be known. What is the chance that random data would achieve that score or better? What is the chance that a real signal would achieve a given score threshold? RESULTS: Here we present a novel general approach to estimating these false positive and true positive rates that is significantly more efficient than are existing general approaches. We validate the technique via an implementation within the HMMER 3.0 package, which scans DNA or protein sequence databases for patterns of interest, using a profile-HMM. CONCLUSION: The new approach is faster than general naïve sampling approaches, and more general than other current approaches. It provides an efficient mechanism by which to estimate error statistics for hidden Markov model and hidden Boltzmann model results. BioMed Central 2009-07-09 /pmc/articles/PMC2722652/ /pubmed/19589158 http://dx.doi.org/10.1186/1471-2105-10-212 Text en Copyright © 2009 Newberg; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Newberg, Lee A Error statistics of hidden Markov model and hidden Boltzmann model results |
title | Error statistics of hidden Markov model and hidden Boltzmann model results |
title_full | Error statistics of hidden Markov model and hidden Boltzmann model results |
title_fullStr | Error statistics of hidden Markov model and hidden Boltzmann model results |
title_full_unstemmed | Error statistics of hidden Markov model and hidden Boltzmann model results |
title_short | Error statistics of hidden Markov model and hidden Boltzmann model results |
title_sort | error statistics of hidden markov model and hidden boltzmann model results |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2722652/ https://www.ncbi.nlm.nih.gov/pubmed/19589158 http://dx.doi.org/10.1186/1471-2105-10-212 |
work_keys_str_mv | AT newbergleea errorstatisticsofhiddenmarkovmodelandhiddenboltzmannmodelresults |