Cargando…

Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies

Genome-wide association studies (GWAS) have identified hundreds of SNPs responsible for variation in human quantitative traits. However, genome-wide-significant associations often fail to replicate across independent cohorts, in apparent inconsistency with their apparent strong effects in discovery...

Descripción completa

Detalles Bibliográficos
Autores principales: Palmer, Cameron, Pe’er, Itsik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5536394/
https://www.ncbi.nlm.nih.gov/pubmed/28715421
http://dx.doi.org/10.1371/journal.pgen.1006916
_version_ 1783254012067315712
author Palmer, Cameron
Pe’er, Itsik
author_facet Palmer, Cameron
Pe’er, Itsik
author_sort Palmer, Cameron
collection PubMed
description Genome-wide association studies (GWAS) have identified hundreds of SNPs responsible for variation in human quantitative traits. However, genome-wide-significant associations often fail to replicate across independent cohorts, in apparent inconsistency with their apparent strong effects in discovery cohorts. This limited success of replication raises pervasive questions about the utility of the GWAS field. We identify all 332 studies of quantitative traits from the NHGRI-EBI GWAS Database with attempted replication. We find that the majority of studies provide insufficient data to evaluate replication rates. The remaining papers replicate significantly worse than expected (p < 10(−14)), even when adjusting for regression-to-the-mean of effect size between discovery- and replication-cohorts termed the Winner’s Curse (p < 10(−16)). We show this is due in part to misreporting replication cohort-size as a maximum number, rather than per-locus one. In 39 studies accurately reporting per-locus cohort-size for attempted replication of 707 loci in samples with similar ancestry, replication rate matched expectation (predicted 458, observed 457, p = 0.94). In contrast, ancestry differences between replication and discovery (13 studies, 385 loci) cause the most highly-powered decile of loci to replicate worse than expected, due to difference in linkage disequilibrium.
format Online
Article
Text
id pubmed-5536394
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55363942017-08-07 Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies Palmer, Cameron Pe’er, Itsik PLoS Genet Research Article Genome-wide association studies (GWAS) have identified hundreds of SNPs responsible for variation in human quantitative traits. However, genome-wide-significant associations often fail to replicate across independent cohorts, in apparent inconsistency with their apparent strong effects in discovery cohorts. This limited success of replication raises pervasive questions about the utility of the GWAS field. We identify all 332 studies of quantitative traits from the NHGRI-EBI GWAS Database with attempted replication. We find that the majority of studies provide insufficient data to evaluate replication rates. The remaining papers replicate significantly worse than expected (p < 10(−14)), even when adjusting for regression-to-the-mean of effect size between discovery- and replication-cohorts termed the Winner’s Curse (p < 10(−16)). We show this is due in part to misreporting replication cohort-size as a maximum number, rather than per-locus one. In 39 studies accurately reporting per-locus cohort-size for attempted replication of 707 loci in samples with similar ancestry, replication rate matched expectation (predicted 458, observed 457, p = 0.94). In contrast, ancestry differences between replication and discovery (13 studies, 385 loci) cause the most highly-powered decile of loci to replicate worse than expected, due to difference in linkage disequilibrium. Public Library of Science 2017-07-17 /pmc/articles/PMC5536394/ /pubmed/28715421 http://dx.doi.org/10.1371/journal.pgen.1006916 Text en © 2017 Palmer, Pe’er http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Palmer, Cameron
Pe’er, Itsik
Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies
title Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies
title_full Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies
title_fullStr Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies
title_full_unstemmed Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies
title_short Statistical correction of the Winner’s Curse explains replication variability in quantitative trait genome-wide association studies
title_sort statistical correction of the winner’s curse explains replication variability in quantitative trait genome-wide association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5536394/
https://www.ncbi.nlm.nih.gov/pubmed/28715421
http://dx.doi.org/10.1371/journal.pgen.1006916
work_keys_str_mv AT palmercameron statisticalcorrectionofthewinnerscurseexplainsreplicationvariabilityinquantitativetraitgenomewideassociationstudies
AT peeritsik statisticalcorrectionofthewinnerscurseexplainsreplicationvariabilityinquantitativetraitgenomewideassociationstudies