Cargando…

Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association

To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of caus...

Descripción completa

Detalles Bibliográficos
Autores principales: Grinde, Kelsey E., Arbet, Jaron, Green, Alden, O'Connell, Michael, Valcarcel, Alessandra, Westra, Jason, Tintle, Nathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5603735/
https://www.ncbi.nlm.nih.gov/pubmed/28959274
http://dx.doi.org/10.3389/fgene.2017.00117
_version_ 1783264759749017600
author Grinde, Kelsey E.
Arbet, Jaron
Green, Alden
O'Connell, Michael
Valcarcel, Alessandra
Westra, Jason
Tintle, Nathan
author_facet Grinde, Kelsey E.
Arbet, Jaron
Green, Alden
O'Connell, Michael
Valcarcel, Alessandra
Westra, Jason
Tintle, Nathan
author_sort Grinde, Kelsey E.
collection PubMed
description To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as “winner's curse.” We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p < 2.2 × 10(−6)) and, consequently, substantially improves mean squared error and variant prioritization/ranking. The method is particularly helpful in adjustment for winner's curse effects when the initial gene-based test has low power and for relatively more common, non-causal variants. Adjustment for winner's curse is recommended for all post-hoc estimation and ranking of variants after a gene-based test. Further work is necessary to continue seeking ways to reduce bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures.
format Online
Article
Text
id pubmed-5603735
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-56037352017-09-28 Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association Grinde, Kelsey E. Arbet, Jaron Green, Alden O'Connell, Michael Valcarcel, Alessandra Westra, Jason Tintle, Nathan Front Genet Genetics To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as “winner's curse.” We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p < 2.2 × 10(−6)) and, consequently, substantially improves mean squared error and variant prioritization/ranking. The method is particularly helpful in adjustment for winner's curse effects when the initial gene-based test has low power and for relatively more common, non-causal variants. Adjustment for winner's curse is recommended for all post-hoc estimation and ranking of variants after a gene-based test. Further work is necessary to continue seeking ways to reduce bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures. Frontiers Media S.A. 2017-09-14 /pmc/articles/PMC5603735/ /pubmed/28959274 http://dx.doi.org/10.3389/fgene.2017.00117 Text en Copyright © 2017 Grinde, Arbet, Green, O'Connell, Valcarcel, Westra and Tintle. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Grinde, Kelsey E.
Arbet, Jaron
Green, Alden
O'Connell, Michael
Valcarcel, Alessandra
Westra, Jason
Tintle, Nathan
Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
title Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
title_full Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
title_fullStr Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
title_full_unstemmed Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
title_short Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
title_sort illustrating, quantifying, and correcting for bias in post-hoc analysis of gene-based rare variant tests of association
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5603735/
https://www.ncbi.nlm.nih.gov/pubmed/28959274
http://dx.doi.org/10.3389/fgene.2017.00117
work_keys_str_mv AT grindekelseye illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation
AT arbetjaron illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation
AT greenalden illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation
AT oconnellmichael illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation
AT valcarcelalessandra illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation
AT westrajason illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation
AT tintlenathan illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation