Cargando…

A Bayesian Method for Evaluating and Discovering Disease Loci Associations

BACKGROUND: A genome-wide association study (GWAS) typically involves examining representative SNPs in individuals from some population. A GWAS data set can concern a million SNPs and may soon concern billions. Researchers investigate the association of each SNP individually with a disease, and it i...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Xia, Barmada, M. Michael, Cooper, Gregory F., Becich, Michael J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3154195/
https://www.ncbi.nlm.nih.gov/pubmed/21853025
http://dx.doi.org/10.1371/journal.pone.0022075
_version_ 1782209992964702208
author Jiang, Xia
Barmada, M. Michael
Cooper, Gregory F.
Becich, Michael J.
author_facet Jiang, Xia
Barmada, M. Michael
Cooper, Gregory F.
Becich, Michael J.
author_sort Jiang, Xia
collection PubMed
description BACKGROUND: A genome-wide association study (GWAS) typically involves examining representative SNPs in individuals from some population. A GWAS data set can concern a million SNPs and may soon concern billions. Researchers investigate the association of each SNP individually with a disease, and it is becoming increasingly commonplace to also analyze multi-SNP associations. Techniques for handling so many hypotheses include the Bonferroni correction and recently developed Bayesian methods. These methods can encounter problems. Most importantly, they are not applicable to a complex multi-locus hypothesis which has several competing hypotheses rather than only a null hypothesis. A method that computes the posterior probability of complex hypotheses is a pressing need. METHODOLOGY/FINDINGS: We introduce the Bayesian network posterior probability (BNPP) method which addresses the difficulties. The method represents the relationship between a disease and SNPs using a directed acyclic graph (DAG) model, and computes the likelihood of such models using a Bayesian network scoring criterion. The posterior probability of a hypothesis is computed based on the likelihoods of all competing hypotheses. The BNPP can not only be used to evaluate a hypothesis that has previously been discovered or suspected, but also to discover new disease loci associations. The results of experiments using simulated and real data sets are presented. Our results concerning simulated data sets indicate that the BNPP exhibits both better evaluation and discovery performance than does a p-value based method. For the real data sets, previous findings in the literature are confirmed and additional findings are found. CONCLUSIONS/SIGNIFICANCE: We conclude that the BNPP resolves a pressing problem by providing a way to compute the posterior probability of complex multi-locus hypotheses. A researcher can use the BNPP to determine the expected utility of investigating a hypothesis further. Furthermore, we conclude that the BNPP is a promising method for discovering disease loci associations.
format Online
Article
Text
id pubmed-3154195
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31541952011-08-18 A Bayesian Method for Evaluating and Discovering Disease Loci Associations Jiang, Xia Barmada, M. Michael Cooper, Gregory F. Becich, Michael J. PLoS One Research Article BACKGROUND: A genome-wide association study (GWAS) typically involves examining representative SNPs in individuals from some population. A GWAS data set can concern a million SNPs and may soon concern billions. Researchers investigate the association of each SNP individually with a disease, and it is becoming increasingly commonplace to also analyze multi-SNP associations. Techniques for handling so many hypotheses include the Bonferroni correction and recently developed Bayesian methods. These methods can encounter problems. Most importantly, they are not applicable to a complex multi-locus hypothesis which has several competing hypotheses rather than only a null hypothesis. A method that computes the posterior probability of complex hypotheses is a pressing need. METHODOLOGY/FINDINGS: We introduce the Bayesian network posterior probability (BNPP) method which addresses the difficulties. The method represents the relationship between a disease and SNPs using a directed acyclic graph (DAG) model, and computes the likelihood of such models using a Bayesian network scoring criterion. The posterior probability of a hypothesis is computed based on the likelihoods of all competing hypotheses. The BNPP can not only be used to evaluate a hypothesis that has previously been discovered or suspected, but also to discover new disease loci associations. The results of experiments using simulated and real data sets are presented. Our results concerning simulated data sets indicate that the BNPP exhibits both better evaluation and discovery performance than does a p-value based method. For the real data sets, previous findings in the literature are confirmed and additional findings are found. CONCLUSIONS/SIGNIFICANCE: We conclude that the BNPP resolves a pressing problem by providing a way to compute the posterior probability of complex multi-locus hypotheses. A researcher can use the BNPP to determine the expected utility of investigating a hypothesis further. Furthermore, we conclude that the BNPP is a promising method for discovering disease loci associations. Public Library of Science 2011-08-10 /pmc/articles/PMC3154195/ /pubmed/21853025 http://dx.doi.org/10.1371/journal.pone.0022075 Text en Jiang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Jiang, Xia
Barmada, M. Michael
Cooper, Gregory F.
Becich, Michael J.
A Bayesian Method for Evaluating and Discovering Disease Loci Associations
title A Bayesian Method for Evaluating and Discovering Disease Loci Associations
title_full A Bayesian Method for Evaluating and Discovering Disease Loci Associations
title_fullStr A Bayesian Method for Evaluating and Discovering Disease Loci Associations
title_full_unstemmed A Bayesian Method for Evaluating and Discovering Disease Loci Associations
title_short A Bayesian Method for Evaluating and Discovering Disease Loci Associations
title_sort bayesian method for evaluating and discovering disease loci associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3154195/
https://www.ncbi.nlm.nih.gov/pubmed/21853025
http://dx.doi.org/10.1371/journal.pone.0022075
work_keys_str_mv AT jiangxia abayesianmethodforevaluatinganddiscoveringdiseaselociassociations
AT barmadammichael abayesianmethodforevaluatinganddiscoveringdiseaselociassociations
AT coopergregoryf abayesianmethodforevaluatinganddiscoveringdiseaselociassociations
AT becichmichaelj abayesianmethodforevaluatinganddiscoveringdiseaselociassociations
AT jiangxia bayesianmethodforevaluatinganddiscoveringdiseaselociassociations
AT barmadammichael bayesianmethodforevaluatinganddiscoveringdiseaselociassociations
AT coopergregoryf bayesianmethodforevaluatinganddiscoveringdiseaselociassociations
AT becichmichaelj bayesianmethodforevaluatinganddiscoveringdiseaselociassociations