Cargando…

Bayesian meta-analysis models for microarray data: a comparative study

BACKGROUND: With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values,...

Descripción completa

Detalles Bibliográficos
Autores principales: Conlon, Erin M, Song, Joon J, Liu, Anna
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1851021/
https://www.ncbi.nlm.nih.gov/pubmed/17343745
http://dx.doi.org/10.1186/1471-2105-8-80
_version_ 1782132959287967744
author Conlon, Erin M
Song, Joon J
Liu, Anna
author_facet Conlon, Erin M
Song, Joon J
Liu, Anna
author_sort Conlon, Erin M
collection PubMed
description BACKGROUND: With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods. RESULTS: Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets were pre-scaled. CONCLUSION: The Bayesian meta-analysis model that combines probabilities across studies does not aggregate gene expression measures, thus an inter-study variability parameter is not included in the model. This results in a simpler modeling approach than aggregating expression measures, which accounts for variability across studies. The probability integration model identified more true discovered genes and fewer true omitted genes than combining expression measures, for our data sets.
format Text
id pubmed-1851021
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18510212007-04-11 Bayesian meta-analysis models for microarray data: a comparative study Conlon, Erin M Song, Joon J Liu, Anna BMC Bioinformatics Methodology Article BACKGROUND: With the growing abundance of microarray data, statistical methods are increasingly needed to integrate results across studies. Two common approaches for meta-analysis of microarrays include either combining gene expression measures across studies or combining summaries such as p-values, probabilities or ranks. Here, we compare two Bayesian meta-analysis models that are analogous to these methods. RESULTS: Two Bayesian meta-analysis models for microarray data have recently been introduced. The first model combines standardized gene expression measures across studies into an overall mean, accounting for inter-study variability, while the second combines probabilities of differential expression without combining expression values. Both models produce the gene-specific posterior probability of differential expression, which is the basis for inference. Since the standardized expression integration model includes inter-study variability, it may improve accuracy of results versus the probability integration model. However, due to the small number of studies typical in microarray meta-analyses, the variability between studies is challenging to estimate. The probability integration model eliminates the need to model variability between studies, and thus its implementation is more straightforward. We found in simulations of two and five studies that combining probabilities outperformed combining standardized gene expression measures for three comparison values: the percent of true discovered genes in meta-analysis versus individual studies; the percent of true genes omitted in meta-analysis versus separate studies, and the number of true discovered genes for fixed levels of Bayesian false discovery. We identified similar results when pooling two independent studies of Bacillus subtilis. We assumed that each study was produced from the same microarray platform with only two conditions: a treatment and control, and that the data sets were pre-scaled. CONCLUSION: The Bayesian meta-analysis model that combines probabilities across studies does not aggregate gene expression measures, thus an inter-study variability parameter is not included in the model. This results in a simpler modeling approach than aggregating expression measures, which accounts for variability across studies. The probability integration model identified more true discovered genes and fewer true omitted genes than combining expression measures, for our data sets. BioMed Central 2007-03-07 /pmc/articles/PMC1851021/ /pubmed/17343745 http://dx.doi.org/10.1186/1471-2105-8-80 Text en Copyright © 2007 Conlon et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Conlon, Erin M
Song, Joon J
Liu, Anna
Bayesian meta-analysis models for microarray data: a comparative study
title Bayesian meta-analysis models for microarray data: a comparative study
title_full Bayesian meta-analysis models for microarray data: a comparative study
title_fullStr Bayesian meta-analysis models for microarray data: a comparative study
title_full_unstemmed Bayesian meta-analysis models for microarray data: a comparative study
title_short Bayesian meta-analysis models for microarray data: a comparative study
title_sort bayesian meta-analysis models for microarray data: a comparative study
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1851021/
https://www.ncbi.nlm.nih.gov/pubmed/17343745
http://dx.doi.org/10.1186/1471-2105-8-80
work_keys_str_mv AT conlonerinm bayesianmetaanalysismodelsformicroarraydataacomparativestudy
AT songjoonj bayesianmetaanalysismodelsformicroarraydataacomparativestudy
AT liuanna bayesianmetaanalysismodelsformicroarraydataacomparativestudy