Cargando…
Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association
To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of caus...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5603735/ https://www.ncbi.nlm.nih.gov/pubmed/28959274 http://dx.doi.org/10.3389/fgene.2017.00117 |
_version_ | 1783264759749017600 |
---|---|
author | Grinde, Kelsey E. Arbet, Jaron Green, Alden O'Connell, Michael Valcarcel, Alessandra Westra, Jason Tintle, Nathan |
author_facet | Grinde, Kelsey E. Arbet, Jaron Green, Alden O'Connell, Michael Valcarcel, Alessandra Westra, Jason Tintle, Nathan |
author_sort | Grinde, Kelsey E. |
collection | PubMed |
description | To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as “winner's curse.” We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p < 2.2 × 10(−6)) and, consequently, substantially improves mean squared error and variant prioritization/ranking. The method is particularly helpful in adjustment for winner's curse effects when the initial gene-based test has low power and for relatively more common, non-causal variants. Adjustment for winner's curse is recommended for all post-hoc estimation and ranking of variants after a gene-based test. Further work is necessary to continue seeking ways to reduce bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures. |
format | Online Article Text |
id | pubmed-5603735 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-56037352017-09-28 Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association Grinde, Kelsey E. Arbet, Jaron Green, Alden O'Connell, Michael Valcarcel, Alessandra Westra, Jason Tintle, Nathan Front Genet Genetics To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as “winner's curse.” We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p < 2.2 × 10(−6)) and, consequently, substantially improves mean squared error and variant prioritization/ranking. The method is particularly helpful in adjustment for winner's curse effects when the initial gene-based test has low power and for relatively more common, non-causal variants. Adjustment for winner's curse is recommended for all post-hoc estimation and ranking of variants after a gene-based test. Further work is necessary to continue seeking ways to reduce bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures. Frontiers Media S.A. 2017-09-14 /pmc/articles/PMC5603735/ /pubmed/28959274 http://dx.doi.org/10.3389/fgene.2017.00117 Text en Copyright © 2017 Grinde, Arbet, Green, O'Connell, Valcarcel, Westra and Tintle. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Grinde, Kelsey E. Arbet, Jaron Green, Alden O'Connell, Michael Valcarcel, Alessandra Westra, Jason Tintle, Nathan Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association |
title | Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association |
title_full | Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association |
title_fullStr | Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association |
title_full_unstemmed | Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association |
title_short | Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association |
title_sort | illustrating, quantifying, and correcting for bias in post-hoc analysis of gene-based rare variant tests of association |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5603735/ https://www.ncbi.nlm.nih.gov/pubmed/28959274 http://dx.doi.org/10.3389/fgene.2017.00117 |
work_keys_str_mv | AT grindekelseye illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation AT arbetjaron illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation AT greenalden illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation AT oconnellmichael illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation AT valcarcelalessandra illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation AT westrajason illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation AT tintlenathan illustratingquantifyingandcorrectingforbiasinposthocanalysisofgenebasedrarevarianttestsofassociation |