Cargando…

Power, false discovery rate and Winner’s Curse in eQTL studies

Investigation of the genetic architecture of gene expression traits has aided interpretation of disease and trait-associated genetic variants; however, key aspects of expression quantitative trait loci (eQTL) study design and analysis remain understudied. We used extensive, empirically driven simula...

Descripción completa

Detalles Bibliográficos
Autores principales:	Huang, Qin Qin, Ritchie, Scott C, Brozynska, Marta, Inouye, Michael
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2018
Materias:	Methods Online
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6294523/ https://www.ncbi.nlm.nih.gov/pubmed/30189032 http://dx.doi.org/10.1093/nar/gky780

_version_	1783380747609964544
author	Huang, Qin Qin Ritchie, Scott C Brozynska, Marta Inouye, Michael
author_facet	Huang, Qin Qin Ritchie, Scott C Brozynska, Marta Inouye, Michael
author_sort	Huang, Qin Qin
collection	PubMed
description	Investigation of the genetic architecture of gene expression traits has aided interpretation of disease and trait-associated genetic variants; however, key aspects of expression quantitative trait loci (eQTL) study design and analysis remain understudied. We used extensive, empirically driven simulations to explore eQTL study design and the performance of various analysis strategies. Across multiple testing correction methods, false discoveries of genes with eQTLs (eGenes) were substantially inflated when false discovery rate (FDR) control was applied to all tests and only appropriately controlled using hierarchical procedures. All multiple testing correction procedures had low power and inflated FDR for eGenes whose causal SNPs had small allele frequencies using small sample sizes (e.g. frequency <10% in 100 samples), indicating that even moderately low frequency eQTL SNPs (eSNPs) in these studies are enriched for false discoveries. In scenarios with ≥80% power, the top eSNP was the true simulated eSNP 90% of the time, but substantially less frequently for very common eSNPs (minor allele frequencies >25%). Overestimation of eQTL effect sizes, so-called ‘Winner’s Curse’, was common in low and moderate power settings. To address this, we developed a bootstrap method (BootstrapQTL) that led to more accurate effect size estimation. These insights provide a foundation for future eQTL studies, especially those with sampling constraints and subtly different conditions.
format	Online Article Text
id	pubmed-6294523
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-62945232018-12-21 Power, false discovery rate and Winner’s Curse in eQTL studies Huang, Qin Qin Ritchie, Scott C Brozynska, Marta Inouye, Michael Nucleic Acids Res Methods Online Investigation of the genetic architecture of gene expression traits has aided interpretation of disease and trait-associated genetic variants; however, key aspects of expression quantitative trait loci (eQTL) study design and analysis remain understudied. We used extensive, empirically driven simulations to explore eQTL study design and the performance of various analysis strategies. Across multiple testing correction methods, false discoveries of genes with eQTLs (eGenes) were substantially inflated when false discovery rate (FDR) control was applied to all tests and only appropriately controlled using hierarchical procedures. All multiple testing correction procedures had low power and inflated FDR for eGenes whose causal SNPs had small allele frequencies using small sample sizes (e.g. frequency <10% in 100 samples), indicating that even moderately low frequency eQTL SNPs (eSNPs) in these studies are enriched for false discoveries. In scenarios with ≥80% power, the top eSNP was the true simulated eSNP 90% of the time, but substantially less frequently for very common eSNPs (minor allele frequencies >25%). Overestimation of eQTL effect sizes, so-called ‘Winner’s Curse’, was common in low and moderate power settings. To address this, we developed a bootstrap method (BootstrapQTL) that led to more accurate effect size estimation. These insights provide a foundation for future eQTL studies, especially those with sampling constraints and subtly different conditions. Oxford University Press 2018-12-14 2018-09-05 /pmc/articles/PMC6294523/ /pubmed/30189032 http://dx.doi.org/10.1093/nar/gky780 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methods Online Huang, Qin Qin Ritchie, Scott C Brozynska, Marta Inouye, Michael Power, false discovery rate and Winner’s Curse in eQTL studies
title	Power, false discovery rate and Winner’s Curse in eQTL studies
title_full	Power, false discovery rate and Winner’s Curse in eQTL studies
title_fullStr	Power, false discovery rate and Winner’s Curse in eQTL studies
title_full_unstemmed	Power, false discovery rate and Winner’s Curse in eQTL studies
title_short	Power, false discovery rate and Winner’s Curse in eQTL studies
title_sort	power, false discovery rate and winner’s curse in eqtl studies
topic	Methods Online
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6294523/ https://www.ncbi.nlm.nih.gov/pubmed/30189032 http://dx.doi.org/10.1093/nar/gky780
work_keys_str_mv	AT huangqinqin powerfalsediscoveryrateandwinnerscurseineqtlstudies AT ritchiescottc powerfalsediscoveryrateandwinnerscurseineqtlstudies AT brozynskamarta powerfalsediscoveryrateandwinnerscurseineqtlstudies AT inouyemichael powerfalsediscoveryrateandwinnerscurseineqtlstudies

Power, false discovery rate and Winner’s Curse in eQTL studies

Ejemplares similares