Cargando…

A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans

Motivation: For genetic studies, statistically significant variants explain far less trait variance than ‘sub-threshold’ association signals. To dimension follow-up studies, researchers need to accurately estimate ‘true’ effect sizes at each SNP, e.g. the true mean of odds ratios (ORs)/regression co...

Descripción completa

Detalles Bibliográficos
Autores principales: Bigdeli, T. Bernard, Lee, Donghyung, Webb, Bradley Todd, Riley, Brien P., Vladimirov, Vladimir I., Fanous, Ayman H., Kendler, Kenneth S., Bacanu, Silviu-Alin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5013908/
https://www.ncbi.nlm.nih.gov/pubmed/27187203
http://dx.doi.org/10.1093/bioinformatics/btw303
_version_ 1782452236835618816
author Bigdeli, T. Bernard
Lee, Donghyung
Webb, Bradley Todd
Riley, Brien P.
Vladimirov, Vladimir I.
Fanous, Ayman H.
Kendler, Kenneth S.
Bacanu, Silviu-Alin
author_facet Bigdeli, T. Bernard
Lee, Donghyung
Webb, Bradley Todd
Riley, Brien P.
Vladimirov, Vladimir I.
Fanous, Ayman H.
Kendler, Kenneth S.
Bacanu, Silviu-Alin
author_sort Bigdeli, T. Bernard
collection PubMed
description Motivation: For genetic studies, statistically significant variants explain far less trait variance than ‘sub-threshold’ association signals. To dimension follow-up studies, researchers need to accurately estimate ‘true’ effect sizes at each SNP, e.g. the true mean of odds ratios (ORs)/regression coefficients (RRs) or Z-score noncentralities. Naïve estimates of effect sizes incur winner’s curse biases, which are reduced only by laborious winner’s curse adjustments (WCAs). Given that Z-scores estimates can be theoretically translated on other scales, we propose a simple method to compute WCA for Z-scores, i.e. their true means/noncentralities. Results:WCA of Z-scores shrinks these towards zero while, on P-value scale, multiple testing adjustment (MTA) shrinks P-values toward one, which corresponds to the zero Z-score value. Thus, WCA on Z-scores scale is a proxy for MTA on P-value scale. Therefore, to estimate Z-score noncentralities for all SNPs in genome scans, we propose FDR Inverse Quantile Transformation (FIQT). It (i) performs the simpler MTA of P-values using FDR and (ii) obtains noncentralities by back-transforming MTA P-values on Z-score scale. When compared to competitors, realistic simulations suggest that FIQT is more (i) accurate and (ii) computationally efficient by orders of magnitude. Practical application of FIQT to Psychiatric Genetic Consortium schizophrenia cohort predicts a non-trivial fraction of sub-threshold signals which become significant in much larger supersamples. Conclusions: FIQT is a simple, yet accurate, WCA method for Z-scores (and ORs/RRs, via simple transformations). Availability and Implementation: A 10 lines R function implementation is available at https://github.com/bacanusa/FIQT. Contact: sabacanu@vcu.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5013908
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-50139082016-09-12 A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans Bigdeli, T. Bernard Lee, Donghyung Webb, Bradley Todd Riley, Brien P. Vladimirov, Vladimir I. Fanous, Ayman H. Kendler, Kenneth S. Bacanu, Silviu-Alin Bioinformatics Original Papers Motivation: For genetic studies, statistically significant variants explain far less trait variance than ‘sub-threshold’ association signals. To dimension follow-up studies, researchers need to accurately estimate ‘true’ effect sizes at each SNP, e.g. the true mean of odds ratios (ORs)/regression coefficients (RRs) or Z-score noncentralities. Naïve estimates of effect sizes incur winner’s curse biases, which are reduced only by laborious winner’s curse adjustments (WCAs). Given that Z-scores estimates can be theoretically translated on other scales, we propose a simple method to compute WCA for Z-scores, i.e. their true means/noncentralities. Results:WCA of Z-scores shrinks these towards zero while, on P-value scale, multiple testing adjustment (MTA) shrinks P-values toward one, which corresponds to the zero Z-score value. Thus, WCA on Z-scores scale is a proxy for MTA on P-value scale. Therefore, to estimate Z-score noncentralities for all SNPs in genome scans, we propose FDR Inverse Quantile Transformation (FIQT). It (i) performs the simpler MTA of P-values using FDR and (ii) obtains noncentralities by back-transforming MTA P-values on Z-score scale. When compared to competitors, realistic simulations suggest that FIQT is more (i) accurate and (ii) computationally efficient by orders of magnitude. Practical application of FIQT to Psychiatric Genetic Consortium schizophrenia cohort predicts a non-trivial fraction of sub-threshold signals which become significant in much larger supersamples. Conclusions: FIQT is a simple, yet accurate, WCA method for Z-scores (and ORs/RRs, via simple transformations). Availability and Implementation: A 10 lines R function implementation is available at https://github.com/bacanusa/FIQT. Contact: sabacanu@vcu.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-09-01 2016-05-13 /pmc/articles/PMC5013908/ /pubmed/27187203 http://dx.doi.org/10.1093/bioinformatics/btw303 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Bigdeli, T. Bernard
Lee, Donghyung
Webb, Bradley Todd
Riley, Brien P.
Vladimirov, Vladimir I.
Fanous, Ayman H.
Kendler, Kenneth S.
Bacanu, Silviu-Alin
A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans
title A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans
title_full A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans
title_fullStr A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans
title_full_unstemmed A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans
title_short A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans
title_sort simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5013908/
https://www.ncbi.nlm.nih.gov/pubmed/27187203
http://dx.doi.org/10.1093/bioinformatics/btw303
work_keys_str_mv AT bigdelitbernard asimpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT leedonghyung asimpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT webbbradleytodd asimpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT rileybrienp asimpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT vladimirovvladimiri asimpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT fanousaymanh asimpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT kendlerkenneths asimpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT bacanusilviualin asimpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT bigdelitbernard simpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT leedonghyung simpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT webbbradleytodd simpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT rileybrienp simpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT vladimirovvladimiri simpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT fanousaymanh simpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT kendlerkenneths simpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans
AT bacanusilviualin simpleyetaccuratecorrectionforwinnerscursecanpredictsignalsdiscoveredinmuchlargergenomescans