Cargando…

Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies

Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subse...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoggart, Clive J., Whittaker, John C., De Iorio, Maria, Balding, David J.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2464715/
https://www.ncbi.nlm.nih.gov/pubmed/18654633
http://dx.doi.org/10.1371/journal.pgen.1000130
_version_ 1782157432873549824
author Hoggart, Clive J.
Whittaker, John C.
De Iorio, Maria
Balding, David J.
author_facet Hoggart, Clive J.
Whittaker, John C.
De Iorio, Maria
Balding, David J.
author_sort Hoggart, Clive J.
collection PubMed
description Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subset that best predicts disease outcome is now feasible, thanks to developments in stochastic search methods. We used a Bayesian-inspired penalised maximum likelihood approach in which every SNP can be considered for additive, dominant, and recessive contributions to disease risk. Posterior mode estimates were obtained for regression coefficients that were each assigned a prior with a sharp mode at zero. A non-zero coefficient estimate was interpreted as corresponding to a significant SNP. We investigated two prior distributions and show that the normal-exponential-gamma prior leads to improved SNP selection in comparison with single-SNP tests. We also derived an explicit approximation for type-I error that avoids the need to use permutation procedures. As well as genome-wide analyses, our method is well-suited to fine mapping with very dense SNP sets obtained from re-sequencing and/or imputation. It can accommodate quantitative as well as case-control phenotypes, covariate adjustment, and can be extended to search for interactions. Here, we demonstrate the power and empirical type-I error of our approach using simulated case-control data sets of up to 500 K SNPs, a real genome-wide data set of 300 K SNPs, and a sequence-based dataset, each of which can be analysed in a few hours on a desktop workstation.
format Text
id pubmed-2464715
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-24647152008-07-25 Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies Hoggart, Clive J. Whittaker, John C. De Iorio, Maria Balding, David J. PLoS Genet Research Article Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subset that best predicts disease outcome is now feasible, thanks to developments in stochastic search methods. We used a Bayesian-inspired penalised maximum likelihood approach in which every SNP can be considered for additive, dominant, and recessive contributions to disease risk. Posterior mode estimates were obtained for regression coefficients that were each assigned a prior with a sharp mode at zero. A non-zero coefficient estimate was interpreted as corresponding to a significant SNP. We investigated two prior distributions and show that the normal-exponential-gamma prior leads to improved SNP selection in comparison with single-SNP tests. We also derived an explicit approximation for type-I error that avoids the need to use permutation procedures. As well as genome-wide analyses, our method is well-suited to fine mapping with very dense SNP sets obtained from re-sequencing and/or imputation. It can accommodate quantitative as well as case-control phenotypes, covariate adjustment, and can be extended to search for interactions. Here, we demonstrate the power and empirical type-I error of our approach using simulated case-control data sets of up to 500 K SNPs, a real genome-wide data set of 300 K SNPs, and a sequence-based dataset, each of which can be analysed in a few hours on a desktop workstation. Public Library of Science 2008-07-25 /pmc/articles/PMC2464715/ /pubmed/18654633 http://dx.doi.org/10.1371/journal.pgen.1000130 Text en Hoggart et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hoggart, Clive J.
Whittaker, John C.
De Iorio, Maria
Balding, David J.
Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies
title Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies
title_full Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies
title_fullStr Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies
title_full_unstemmed Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies
title_short Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies
title_sort simultaneous analysis of all snps in genome-wide and re-sequencing association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2464715/
https://www.ncbi.nlm.nih.gov/pubmed/18654633
http://dx.doi.org/10.1371/journal.pgen.1000130
work_keys_str_mv AT hoggartclivej simultaneousanalysisofallsnpsingenomewideandresequencingassociationstudies
AT whittakerjohnc simultaneousanalysisofallsnpsingenomewideandresequencingassociationstudies
AT deioriomaria simultaneousanalysisofallsnpsingenomewideandresequencingassociationstudies
AT baldingdavidj simultaneousanalysisofallsnpsingenomewideandresequencingassociationstudies