Cargando…
Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies
BACKGROUND: Genome-wide Association Studies (GWAS) are typically designed to identify phenotype-associated single nucleotide polymorphisms (SNPs) individually using univariate analysis methods. Though providing valuable insights into genetic risks of common diseases, the genetic variants identified...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275962/ https://www.ncbi.nlm.nih.gov/pubmed/25491445 http://dx.doi.org/10.1186/s12863-014-0130-7 |
_version_ | 1782350202234994688 |
---|---|
author | Zhang, Xiaoshuai Xue, Fuzhong Liu, Hong Zhu, Dianwen Peng, Bin Wiemels, Joseph L Yang, Xiaowei |
author_facet | Zhang, Xiaoshuai Xue, Fuzhong Liu, Hong Zhu, Dianwen Peng, Bin Wiemels, Joseph L Yang, Xiaowei |
author_sort | Zhang, Xiaoshuai |
collection | PubMed |
description | BACKGROUND: Genome-wide Association Studies (GWAS) are typically designed to identify phenotype-associated single nucleotide polymorphisms (SNPs) individually using univariate analysis methods. Though providing valuable insights into genetic risks of common diseases, the genetic variants identified by GWAS generally account for only a small proportion of the total heritability for complex diseases. To solve this “missing heritability” problem, we implemented a strategy called integrative Bayesian Variable Selection (iBVS), which is based on a hierarchical model that incorporates an informative prior by considering the gene interrelationship as a network. It was applied here to both simulated and real data sets. RESULTS: Simulation studies indicated that the iBVS method was advantageous in its performance with highest AUC in both variable selection and outcome prediction, when compared to Stepwise and LASSO based strategies. In an analysis of a leprosy case–control study, iBVS selected 94 SNPs as predictors, while LASSO selected 100 SNPs. The Stepwise regression yielded a more parsimonious model with only 3 SNPs. The prediction results demonstrated that the iBVS method had comparable performance with that of LASSO, but better than Stepwise strategies. CONCLUSIONS: The proposed iBVS strategy is a novel and valid method for Genome-wide Association Studies, with the additional advantage in that it produces more interpretable posterior probabilities for each variable unlike LASSO and other penalized regression methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-014-0130-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4275962 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42759622015-01-13 Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies Zhang, Xiaoshuai Xue, Fuzhong Liu, Hong Zhu, Dianwen Peng, Bin Wiemels, Joseph L Yang, Xiaowei BMC Genet Research Article BACKGROUND: Genome-wide Association Studies (GWAS) are typically designed to identify phenotype-associated single nucleotide polymorphisms (SNPs) individually using univariate analysis methods. Though providing valuable insights into genetic risks of common diseases, the genetic variants identified by GWAS generally account for only a small proportion of the total heritability for complex diseases. To solve this “missing heritability” problem, we implemented a strategy called integrative Bayesian Variable Selection (iBVS), which is based on a hierarchical model that incorporates an informative prior by considering the gene interrelationship as a network. It was applied here to both simulated and real data sets. RESULTS: Simulation studies indicated that the iBVS method was advantageous in its performance with highest AUC in both variable selection and outcome prediction, when compared to Stepwise and LASSO based strategies. In an analysis of a leprosy case–control study, iBVS selected 94 SNPs as predictors, while LASSO selected 100 SNPs. The Stepwise regression yielded a more parsimonious model with only 3 SNPs. The prediction results demonstrated that the iBVS method had comparable performance with that of LASSO, but better than Stepwise strategies. CONCLUSIONS: The proposed iBVS strategy is a novel and valid method for Genome-wide Association Studies, with the additional advantage in that it produces more interpretable posterior probabilities for each variable unlike LASSO and other penalized regression methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-014-0130-7) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-10 /pmc/articles/PMC4275962/ /pubmed/25491445 http://dx.doi.org/10.1186/s12863-014-0130-7 Text en © Zhang et al.; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Zhang, Xiaoshuai Xue, Fuzhong Liu, Hong Zhu, Dianwen Peng, Bin Wiemels, Joseph L Yang, Xiaowei Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies |
title | Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies |
title_full | Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies |
title_fullStr | Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies |
title_full_unstemmed | Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies |
title_short | Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies |
title_sort | integrative bayesian variable selection with gene-based informative priors for genome-wide association studies |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275962/ https://www.ncbi.nlm.nih.gov/pubmed/25491445 http://dx.doi.org/10.1186/s12863-014-0130-7 |
work_keys_str_mv | AT zhangxiaoshuai integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies AT xuefuzhong integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies AT liuhong integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies AT zhudianwen integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies AT pengbin integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies AT wiemelsjosephl integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies AT yangxiaowei integrativebayesianvariableselectionwithgenebasedinformativepriorsforgenomewideassociationstudies |