Cargando…

A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies

Motivation: Much research effort has been devoted to the identification of enriched gene sets for microarray experiments. However, identified gene sets are often found to be inconsistent among independent studies. This is probably owing to the noisy data of microarray experiments coupled with small...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Min, Zang, Miao, Wang, Xinlei, Xiao, Guanghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3605602/
https://www.ncbi.nlm.nih.gov/pubmed/23418184
http://dx.doi.org/10.1093/bioinformatics/btt068
_version_ 1782263921646764032
author Chen, Min
Zang, Miao
Wang, Xinlei
Xiao, Guanghua
author_facet Chen, Min
Zang, Miao
Wang, Xinlei
Xiao, Guanghua
author_sort Chen, Min
collection PubMed
description Motivation: Much research effort has been devoted to the identification of enriched gene sets for microarray experiments. However, identified gene sets are often found to be inconsistent among independent studies. This is probably owing to the noisy data of microarray experiments coupled with small sample sizes of individual studies. Therefore, combining information from multiple studies is likely to improve the detection of truly enriched gene classes. As more and more data become available, it calls for statistical methods to integrate information from multiple studies, also known as meta-analysis, to improve the power of identifying enriched gene sets. Results: We propose a Bayesian model that provides a coherent framework for joint modeling of both gene set information and gene expression data from multiple studies, to improve the detection of enriched gene sets by leveraging information from different sources available. One distinct feature of our method is that it directly models the gene expression data, instead of using summary statistics, when synthesizing studies. Besides, the proposed model is flexible and offers an appropriate treatment of between-study heterogeneities that frequently arise in the meta-analysis of microarray experiments. We show that under our Bayesian model, the full posterior conditionals all have known distributions, which greatly facilitates the MCMC computation. Simulation results show that the proposed method can improve the power of gene set enrichment meta-analysis, as opposed to existing methods developed by Shen and Tseng (2010, Bioinformatics, 26, 1316–1323), and it is not sensitive to mild or moderate deviations from the distributional assumption for gene expression data. We illustrate the proposed method through an application of combining eight lung cancer datasets for gene set enrichment analysis, which demonstrates the usefulness of the method. Availability: http://qbrc.swmed.edu/software/ Contact: Min.Chen@UTSouthwestern.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3605602
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36056022013-03-22 A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies Chen, Min Zang, Miao Wang, Xinlei Xiao, Guanghua Bioinformatics Original Papers Motivation: Much research effort has been devoted to the identification of enriched gene sets for microarray experiments. However, identified gene sets are often found to be inconsistent among independent studies. This is probably owing to the noisy data of microarray experiments coupled with small sample sizes of individual studies. Therefore, combining information from multiple studies is likely to improve the detection of truly enriched gene classes. As more and more data become available, it calls for statistical methods to integrate information from multiple studies, also known as meta-analysis, to improve the power of identifying enriched gene sets. Results: We propose a Bayesian model that provides a coherent framework for joint modeling of both gene set information and gene expression data from multiple studies, to improve the detection of enriched gene sets by leveraging information from different sources available. One distinct feature of our method is that it directly models the gene expression data, instead of using summary statistics, when synthesizing studies. Besides, the proposed model is flexible and offers an appropriate treatment of between-study heterogeneities that frequently arise in the meta-analysis of microarray experiments. We show that under our Bayesian model, the full posterior conditionals all have known distributions, which greatly facilitates the MCMC computation. Simulation results show that the proposed method can improve the power of gene set enrichment meta-analysis, as opposed to existing methods developed by Shen and Tseng (2010, Bioinformatics, 26, 1316–1323), and it is not sensitive to mild or moderate deviations from the distributional assumption for gene expression data. We illustrate the proposed method through an application of combining eight lung cancer datasets for gene set enrichment analysis, which demonstrates the usefulness of the method. Availability: http://qbrc.swmed.edu/software/ Contact: Min.Chen@UTSouthwestern.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-04-01 2013-02-15 /pmc/articles/PMC3605602/ /pubmed/23418184 http://dx.doi.org/10.1093/bioinformatics/btt068 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Chen, Min
Zang, Miao
Wang, Xinlei
Xiao, Guanghua
A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies
title A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies
title_full A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies
title_fullStr A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies
title_full_unstemmed A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies
title_short A powerful Bayesian meta-analysis method to integrate multiple gene set enrichment studies
title_sort powerful bayesian meta-analysis method to integrate multiple gene set enrichment studies
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3605602/
https://www.ncbi.nlm.nih.gov/pubmed/23418184
http://dx.doi.org/10.1093/bioinformatics/btt068
work_keys_str_mv AT chenmin apowerfulbayesianmetaanalysismethodtointegratemultiplegenesetenrichmentstudies
AT zangmiao apowerfulbayesianmetaanalysismethodtointegratemultiplegenesetenrichmentstudies
AT wangxinlei apowerfulbayesianmetaanalysismethodtointegratemultiplegenesetenrichmentstudies
AT xiaoguanghua apowerfulbayesianmetaanalysismethodtointegratemultiplegenesetenrichmentstudies
AT chenmin powerfulbayesianmetaanalysismethodtointegratemultiplegenesetenrichmentstudies
AT zangmiao powerfulbayesianmetaanalysismethodtointegratemultiplegenesetenrichmentstudies
AT wangxinlei powerfulbayesianmetaanalysismethodtointegratemultiplegenesetenrichmentstudies
AT xiaoguanghua powerfulbayesianmetaanalysismethodtointegratemultiplegenesetenrichmentstudies