Cargando…

Bayesian meta-analysis models for microarray data: a comparative study

BACKGROUND: With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Conlon, Erin M, Song, Joon J, Liu, Anna
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2007
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1851021/ https://www.ncbi.nlm.nih.gov/pubmed/17343745 http://dx.doi.org/10.1186/1471-2105-8-80

_version_	1782132959287967744
author	Conlon, Erin M Song, Joon J Liu, Anna
author_facet	Conlon, Erin M Song, Joon J Liu, Anna
author_sort	Conlon, Erin M
collection	PubMed
description	BACKGROUND: With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods. RESULTS: Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets were pre-scaled. CONCLUSION: The Bayesian meta-analysis model that combines probabilities across studies does not aggregate gene expression measures, thus an inter-study variability parameter is not included in the model. This results in a simpler modeling approach than aggregating expression measures, which accounts for variability across studies. The probability integration model identified more true discovered genes and fewer true omitted genes than combining expression measures, for our data sets.
format	Text
id	pubmed-1851021
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-18510212007-04-11 Bayesian meta-analysis models for microarray data: a comparative study Conlon, Erin M Song, Joon J Liu, Anna BMC Bioinformatics Methodology Article BACKGROUND: With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods. RESULTS: Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets were pre-scaled. CONCLUSION: The Bayesian meta-analysis model that combines probabilities across studies does not aggregate gene expression measures, thus an inter-study variability parameter is not included in the model. This results in a simpler modeling approach than aggregating expression measures, which accounts for variability across studies. The probability integration model identified more true discovered genes and fewer true omitted genes than combining expression measures, for our data sets. BioMed Central 2007-03-07 /pmc/articles/PMC1851021/ /pubmed/17343745 http://dx.doi.org/10.1186/1471-2105-8-80 Text en Copyright © 2007 Conlon et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Conlon, Erin M Song, Joon J Liu, Anna Bayesian meta-analysis models for microarray data: a comparative study
title	Bayesian meta-analysis models for microarray data: a comparative study
title_full	Bayesian meta-analysis models for microarray data: a comparative study
title_fullStr	Bayesian meta-analysis models for microarray data: a comparative study
title_full_unstemmed	Bayesian meta-analysis models for microarray data: a comparative study
title_short	Bayesian meta-analysis models for microarray data: a comparative study
title_sort	bayesian meta-analysis models for microarray data: a comparative study
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1851021/ https://www.ncbi.nlm.nih.gov/pubmed/17343745 http://dx.doi.org/10.1186/1471-2105-8-80
work_keys_str_mv	AT conlonerinm bayesianmetaanalysismodelsformicroarraydataacomparativestudy AT songjoonj bayesianmetaanalysismodelsformicroarraydataacomparativestudy AT liuanna bayesianmetaanalysismodelsformicroarraydataacomparativestudy

Bayesian meta-analysis models for microarray data: a comparative study

Ejemplares similares