Cargando…

Optimizing the Power of Genome-Wide Association Studies by Using Publicly Available Reference Samples to Expand the Control Group

Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consist...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhuang, Joanna J, Zondervan, Krina, Nyberg, Fredrik, Harbron, Chris, Jawaid, Ansar, Cardon, Lon R, Barratt, Bryan J, Morris, Andrew P
Formato: Texto
Lenguaje:English
Publicado: Wiley Subscription Services, Inc., A Wiley Company 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2962805/
https://www.ncbi.nlm.nih.gov/pubmed/20088020
http://dx.doi.org/10.1002/gepi.20482
_version_ 1782189214402609152
author Zhuang, Joanna J
Zondervan, Krina
Nyberg, Fredrik
Harbron, Chris
Jawaid, Ansar
Cardon, Lon R
Barratt, Bryan J
Morris, Andrew P
author_facet Zhuang, Joanna J
Zondervan, Krina
Nyberg, Fredrik
Harbron, Chris
Jawaid, Ansar
Cardon, Lon R
Barratt, Bryan J
Morris, Andrew P
author_sort Zhuang, Joanna J
collection PubMed
description Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consisting of many thousands of individuals. Large-scale international initiatives, such as the Wellcome Trust Case Control Consortium, the Genetic Association Information Network, and the database of genetic and phenotypic information, aim to facilitate discovery of modest-effect genes by making genome-wide data publicly available, allowing information to be combined for the purpose of pooled analysis. In principle, disease or control samples from these studies could be used to increase the power of any GWA study via judicious use as “genetically matched controls” for other traits. Here, we present the biological motivation for the problem and the theoretical potential for expanding the control group with publicly available disease or reference samples. We demonstrate that a naïve application of this strategy can greatly inflate the false-positive error rate in the presence of population structure. As a remedy, we make use of genome-wide data and model selection techniques to identify “axes” of genetic variation which are associated with disease. These axes are then included as covariates in association analysis to correct for population structure, which can result in increases in power over standard analysis of genetic information from the samples in the original GWA study. Genet. Epidemiol. 34: 319–326, 2010. © 2010 Wiley-Liss, Inc.
format Text
id pubmed-2962805
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Wiley Subscription Services, Inc., A Wiley Company
record_format MEDLINE/PubMed
spelling pubmed-29628052010-11-02 Optimizing the Power of Genome-Wide Association Studies by Using Publicly Available Reference Samples to Expand the Control Group Zhuang, Joanna J Zondervan, Krina Nyberg, Fredrik Harbron, Chris Jawaid, Ansar Cardon, Lon R Barratt, Bryan J Morris, Andrew P Genet Epidemiol Original Article Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consisting of many thousands of individuals. Large-scale international initiatives, such as the Wellcome Trust Case Control Consortium, the Genetic Association Information Network, and the database of genetic and phenotypic information, aim to facilitate discovery of modest-effect genes by making genome-wide data publicly available, allowing information to be combined for the purpose of pooled analysis. In principle, disease or control samples from these studies could be used to increase the power of any GWA study via judicious use as “genetically matched controls” for other traits. Here, we present the biological motivation for the problem and the theoretical potential for expanding the control group with publicly available disease or reference samples. We demonstrate that a naïve application of this strategy can greatly inflate the false-positive error rate in the presence of population structure. As a remedy, we make use of genome-wide data and model selection techniques to identify “axes” of genetic variation which are associated with disease. These axes are then included as covariates in association analysis to correct for population structure, which can result in increases in power over standard analysis of genetic information from the samples in the original GWA study. Genet. Epidemiol. 34: 319–326, 2010. © 2010 Wiley-Liss, Inc. Wiley Subscription Services, Inc., A Wiley Company 2010-05 2010-01-20 /pmc/articles/PMC2962805/ /pubmed/20088020 http://dx.doi.org/10.1002/gepi.20482 Text en Copyright © 2010 Wiley-Liss, Inc., A Wiley Company http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation.
spellingShingle Original Article
Zhuang, Joanna J
Zondervan, Krina
Nyberg, Fredrik
Harbron, Chris
Jawaid, Ansar
Cardon, Lon R
Barratt, Bryan J
Morris, Andrew P
Optimizing the Power of Genome-Wide Association Studies by Using Publicly Available Reference Samples to Expand the Control Group
title Optimizing the Power of Genome-Wide Association Studies by Using Publicly Available Reference Samples to Expand the Control Group
title_full Optimizing the Power of Genome-Wide Association Studies by Using Publicly Available Reference Samples to Expand the Control Group
title_fullStr Optimizing the Power of Genome-Wide Association Studies by Using Publicly Available Reference Samples to Expand the Control Group
title_full_unstemmed Optimizing the Power of Genome-Wide Association Studies by Using Publicly Available Reference Samples to Expand the Control Group
title_short Optimizing the Power of Genome-Wide Association Studies by Using Publicly Available Reference Samples to Expand the Control Group
title_sort optimizing the power of genome-wide association studies by using publicly available reference samples to expand the control group
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2962805/
https://www.ncbi.nlm.nih.gov/pubmed/20088020
http://dx.doi.org/10.1002/gepi.20482
work_keys_str_mv AT zhuangjoannaj optimizingthepowerofgenomewideassociationstudiesbyusingpubliclyavailablereferencesamplestoexpandthecontrolgroup
AT zondervankrina optimizingthepowerofgenomewideassociationstudiesbyusingpubliclyavailablereferencesamplestoexpandthecontrolgroup
AT nybergfredrik optimizingthepowerofgenomewideassociationstudiesbyusingpubliclyavailablereferencesamplestoexpandthecontrolgroup
AT harbronchris optimizingthepowerofgenomewideassociationstudiesbyusingpubliclyavailablereferencesamplestoexpandthecontrolgroup
AT jawaidansar optimizingthepowerofgenomewideassociationstudiesbyusingpubliclyavailablereferencesamplestoexpandthecontrolgroup
AT cardonlonr optimizingthepowerofgenomewideassociationstudiesbyusingpubliclyavailablereferencesamplestoexpandthecontrolgroup
AT barrattbryanj optimizingthepowerofgenomewideassociationstudiesbyusingpubliclyavailablereferencesamplestoexpandthecontrolgroup
AT morrisandrewp optimizingthepowerofgenomewideassociationstudiesbyusingpubliclyavailablereferencesamplestoexpandthecontrolgroup