Cargando…

Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies

BACKGROUND: Genome-wide Association Studies (GWAS) are typically designed to identify phenotype-associated single nucleotide polymorphisms (SNPs) individually using univariate analysis methods. Though providing valuable insights into genetic risks of common diseases, the genetic variants identified...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xiaoshuai, Xue, Fuzhong, Liu, Hong, Zhu, Dianwen, Peng, Bin, Wiemels, Joseph L, Yang, Xiaowei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275962/
https://www.ncbi.nlm.nih.gov/pubmed/25491445
http://dx.doi.org/10.1186/s12863-014-0130-7
_version_ 1782350202234994688
author Zhang, Xiaoshuai
Xue, Fuzhong
Liu, Hong
Zhu, Dianwen
Peng, Bin
Wiemels, Joseph L
Yang, Xiaowei
author_facet Zhang, Xiaoshuai
Xue, Fuzhong
Liu, Hong
Zhu, Dianwen
Peng, Bin
Wiemels, Joseph L
Yang, Xiaowei
author_sort Zhang, Xiaoshuai
collection PubMed
description BACKGROUND: Genome-wide Association Studies (GWAS) are typically designed to identify phenotype-associated single nucleotide polymorphisms (SNPs) individually using univariate analysis methods. Though providing valuable insights into genetic risks of common diseases, the genetic variants identified by GWAS generally account for only a small proportion of the total heritability for complex diseases. To solve this “missing heritability” problem, we implemented a strategy called integrative Bayesian Variable Selection (iBVS), which is based on a hierarchical model that incorporates an informative prior by considering the gene interrelationship as a network. It was applied here to both simulated and real data sets. RESULTS: Simulation studies indicated that the iBVS method was advantageous in its performance with highest AUC in both variable selection and outcome prediction, when compared to Stepwise and LASSO based strategies. In an analysis of a leprosy case–control study, iBVS selected 94 SNPs as predictors, while LASSO selected 100 SNPs. The Stepwise regression yielded a more parsimonious model with only 3 SNPs. The prediction results demonstrated that the iBVS method had comparable performance with that of LASSO, but better than Stepwise strategies. CONCLUSIONS: The proposed iBVS strategy is a novel and valid method for Genome-wide Association Studies, with the additional advantage in that it produces more interpretable posterior probabilities for each variable unlike LASSO and other penalized regression methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-014-0130-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4275962
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42759622015-01-13 Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies Zhang, Xiaoshuai Xue, Fuzhong Liu, Hong Zhu, Dianwen Peng, Bin Wiemels, Joseph L Yang, Xiaowei BMC Genet Research Article BACKGROUND: Genome-wide Association Studies (GWAS) are typically designed to identify phenotype-associated single nucleotide polymorphisms (SNPs) individually using univariate analysis methods. Though providing valuable insights into genetic risks of common diseases, the genetic variants identified by GWAS generally account for only a small proportion of the total heritability for complex diseases. To solve this “missing heritability” problem, we implemented a strategy called integrative Bayesian Variable Selection (iBVS), which is based on a hierarchical model that incorporates an informative prior by considering the gene interrelationship as a network. It was applied here to both simulated and real data sets. RESULTS: Simulation studies indicated that the iBVS method was advantageous in its performance with highest AUC in both variable selection and outcome prediction, when compared to Stepwise and LASSO based strategies. In an analysis of a leprosy case–control study, iBVS selected 94 SNPs as predictors, while LASSO selected 100 SNPs. The Stepwise regression yielded a more parsimonious model with only 3 SNPs. The prediction results demonstrated that the iBVS method had comparable performance with that of LASSO, but better than Stepwise strategies. CONCLUSIONS: The proposed iBVS strategy is a novel and valid method for Genome-wide Association Studies, with the additional advantage in that it produces more interpretable posterior probabilities for each variable unlike LASSO and other penalized regression methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-014-0130-7) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-10 /pmc/articles/PMC4275962/ /pubmed/25491445 http://dx.doi.org/10.1186/s12863-014-0130-7 Text en © Zhang et al.; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Zhang, Xiaoshuai
Xue, Fuzhong
Liu, Hong
Zhu, Dianwen
Peng, Bin
Wiemels, Joseph L
Yang, Xiaowei
Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies
title Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies
title_full Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies
title_fullStr Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies
title_full_unstemmed Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies
title_short Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies
title_sort integrative bayesian variable selection with gene-based informative priors for genome-wide association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275962/
https://www.ncbi.nlm.nih.gov/pubmed/25491445
http://dx.doi.org/10.1186/s12863-014-0130-7
work_keys_str_mv AT zhangxiaoshuai integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies
AT xuefuzhong integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies
AT liuhong integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies
AT zhudianwen integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies
AT pengbin integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies
AT wiemelsjosephl integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies
AT yangxiaowei integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies