Cargando…

A two-sample Bayesian t-test for microarray data

BACKGROUND: Determining whether a gene is differentially expressed in two different samples remains an important statistical problem. Prior work in this area has featured the use of t-tests with pooled estimates of the sample variance based on similarly expressed genes. These methods do not display...

Descripción completa

Detalles Bibliográficos
Autores principales: Fox, Richard J, Dimmic, Matthew W
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1431571/
https://www.ncbi.nlm.nih.gov/pubmed/16529652
http://dx.doi.org/10.1186/1471-2105-7-126
_version_ 1782127208055177216
author Fox, Richard J
Dimmic, Matthew W
author_facet Fox, Richard J
Dimmic, Matthew W
author_sort Fox, Richard J
collection PubMed
description BACKGROUND: Determining whether a gene is differentially expressed in two different samples remains an important statistical problem. Prior work in this area has featured the use of t-tests with pooled estimates of the sample variance based on similarly expressed genes. These methods do not display consistent behavior across the entire range of pooling and can be biased when the prior hyperparameters are specified heuristically. RESULTS: A two-sample Bayesian t-test is proposed for use in determining whether a gene is differentially expressed in two different samples. The test method is an extension of earlier work that made use of point estimates for the variance. The method proposed here explicitly calculates in analytic form the marginal distribution for the difference in the mean expression of two samples, obviating the need for point estimates of the variance without recourse to posterior simulation. The prior distribution involves a single hyperparameter that can be calculated in a statistically rigorous manner, making clear the connection between the prior degrees of freedom and prior variance. CONCLUSION: The test is easy to understand and implement and application to both real and simulated data shows that the method has equal or greater power compared to the previous method and demonstrates consistent Type I error rates. The test is generally applicable outside the microarray field to any situation where prior information about the variance is available and is not limited to cases where estimates of the variance are based on many similar observations.
format Text
id pubmed-1431571
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14315712006-04-21 A two-sample Bayesian t-test for microarray data Fox, Richard J Dimmic, Matthew W BMC Bioinformatics Research Article BACKGROUND: Determining whether a gene is differentially expressed in two different samples remains an important statistical problem. Prior work in this area has featured the use of t-tests with pooled estimates of the sample variance based on similarly expressed genes. These methods do not display consistent behavior across the entire range of pooling and can be biased when the prior hyperparameters are specified heuristically. RESULTS: A two-sample Bayesian t-test is proposed for use in determining whether a gene is differentially expressed in two different samples. The test method is an extension of earlier work that made use of point estimates for the variance. The method proposed here explicitly calculates in analytic form the marginal distribution for the difference in the mean expression of two samples, obviating the need for point estimates of the variance without recourse to posterior simulation. The prior distribution involves a single hyperparameter that can be calculated in a statistically rigorous manner, making clear the connection between the prior degrees of freedom and prior variance. CONCLUSION: The test is easy to understand and implement and application to both real and simulated data shows that the method has equal or greater power compared to the previous method and demonstrates consistent Type I error rates. The test is generally applicable outside the microarray field to any situation where prior information about the variance is available and is not limited to cases where estimates of the variance are based on many similar observations. BioMed Central 2006-03-10 /pmc/articles/PMC1431571/ /pubmed/16529652 http://dx.doi.org/10.1186/1471-2105-7-126 Text en Copyright © 2006 Fox and Dimmic; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Fox, Richard J
Dimmic, Matthew W
A two-sample Bayesian t-test for microarray data
title A two-sample Bayesian t-test for microarray data
title_full A two-sample Bayesian t-test for microarray data
title_fullStr A two-sample Bayesian t-test for microarray data
title_full_unstemmed A two-sample Bayesian t-test for microarray data
title_short A two-sample Bayesian t-test for microarray data
title_sort two-sample bayesian t-test for microarray data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1431571/
https://www.ncbi.nlm.nih.gov/pubmed/16529652
http://dx.doi.org/10.1186/1471-2105-7-126
work_keys_str_mv AT foxrichardj atwosamplebayesianttestformicroarraydata
AT dimmicmattheww atwosamplebayesianttestformicroarraydata
AT foxrichardj twosamplebayesianttestformicroarraydata
AT dimmicmattheww twosamplebayesianttestformicroarraydata