Cargando…

Measuring Global Credibility with Application to Local Sequence Alignment

Computational biology is replete with high-dimensional (high-D) discrete prediction and inference problems, including sequence alignment, RNA structure prediction, phylogenetic inference, motif finding, prediction of pathways, and model selection problems in statistical genetics. Even though predict...

Descripción completa

Detalles Bibliográficos
Autores principales:	Webb-Robertson, Bobbie-Jo M., McCue, Lee Ann, Lawrence, Charles E.
Formato:	Texto
Lenguaje:	English
Publicado:	Public Library of Science 2008
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367447/ https://www.ncbi.nlm.nih.gov/pubmed/18464927 http://dx.doi.org/10.1371/journal.pcbi.1000077

_version_	1782154294015819776
author	Webb-Robertson, Bobbie-Jo M. McCue, Lee Ann Lawrence, Charles E.
author_facet	Webb-Robertson, Bobbie-Jo M. McCue, Lee Ann Lawrence, Charles E.
author_sort	Webb-Robertson, Bobbie-Jo M.
collection	PubMed
description	Computational biology is replete with high-dimensional (high-D) discrete prediction and inference problems, including sequence alignment, RNA structure prediction, phylogenetic inference, motif finding, prediction of pathways, and model selection problems in statistical genetics. Even though prediction and inference in these settings are uncertain, little attention has been focused on the development of global measures of uncertainty. Regardless of the procedure employed to produce a prediction, when a procedure delivers a single answer, that answer is a point estimate selected from the solution ensemble, the set of all possible solutions. For high-D discrete space, these ensembles are immense, and thus there is considerable uncertainty. We recommend the use of Bayesian credibility limits to describe this uncertainty, where a (1−α)%, 0≤α≤1, credibility limit is the minimum Hamming distance radius of a hyper-sphere containing (1−α)% of the posterior distribution. Because sequence alignment is arguably the most extensively used procedure in computational biology, we employ it here to make these general concepts more concrete. The maximum similarity estimator (i.e., the alignment that maximizes the likelihood) and the centroid estimator (i.e., the alignment that minimizes the mean Hamming distance from the posterior weighted ensemble of alignments) are used to demonstrate the application of Bayesian credibility limits to alignment estimators. Application of Bayesian credibility limits to the alignment of 20 human/rodent orthologous sequence pairs and 125 orthologous sequence pairs from six Shewanella species shows that credibility limits of the alignments of promoter sequences of these species vary widely, and that centroid alignments dependably have tighter credibility limits than traditional maximum similarity alignments.
format	Text
id	pubmed-2367447
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-23674472008-05-16 Measuring Global Credibility with Application to Local Sequence Alignment Webb-Robertson, Bobbie-Jo M. McCue, Lee Ann Lawrence, Charles E. PLoS Comput Biol Research Article Computational biology is replete with high-dimensional (high-D) discrete prediction and inference problems, including sequence alignment, RNA structure prediction, phylogenetic inference, motif finding, prediction of pathways, and model selection problems in statistical genetics. Even though prediction and inference in these settings are uncertain, little attention has been focused on the development of global measures of uncertainty. Regardless of the procedure employed to produce a prediction, when a procedure delivers a single answer, that answer is a point estimate selected from the solution ensemble, the set of all possible solutions. For high-D discrete space, these ensembles are immense, and thus there is considerable uncertainty. We recommend the use of Bayesian credibility limits to describe this uncertainty, where a (1−α)%, 0≤α≤1, credibility limit is the minimum Hamming distance radius of a hyper-sphere containing (1−α)% of the posterior distribution. Because sequence alignment is arguably the most extensively used procedure in computational biology, we employ it here to make these general concepts more concrete. The maximum similarity estimator (i.e., the alignment that maximizes the likelihood) and the centroid estimator (i.e., the alignment that minimizes the mean Hamming distance from the posterior weighted ensemble of alignments) are used to demonstrate the application of Bayesian credibility limits to alignment estimators. Application of Bayesian credibility limits to the alignment of 20 human/rodent orthologous sequence pairs and 125 orthologous sequence pairs from six Shewanella species shows that credibility limits of the alignments of promoter sequences of these species vary widely, and that centroid alignments dependably have tighter credibility limits than traditional maximum similarity alignments. Public Library of Science 2008-05-16 /pmc/articles/PMC2367447/ /pubmed/18464927 http://dx.doi.org/10.1371/journal.pcbi.1000077 Text en Webb-Robertson et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Webb-Robertson, Bobbie-Jo M. McCue, Lee Ann Lawrence, Charles E. Measuring Global Credibility with Application to Local Sequence Alignment
title	Measuring Global Credibility with Application to Local Sequence Alignment
title_full	Measuring Global Credibility with Application to Local Sequence Alignment
title_fullStr	Measuring Global Credibility with Application to Local Sequence Alignment
title_full_unstemmed	Measuring Global Credibility with Application to Local Sequence Alignment
title_short	Measuring Global Credibility with Application to Local Sequence Alignment
title_sort	measuring global credibility with application to local sequence alignment
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367447/ https://www.ncbi.nlm.nih.gov/pubmed/18464927 http://dx.doi.org/10.1371/journal.pcbi.1000077
work_keys_str_mv	AT webbrobertsonbobbiejom measuringglobalcredibilitywithapplicationtolocalsequencealignment AT mccueleeann measuringglobalcredibilitywithapplicationtolocalsequencealignment AT lawrencecharlese measuringglobalcredibilitywithapplicationtolocalsequencealignment

Measuring Global Credibility with Application to Local Sequence Alignment

Ejemplares similares