Cargando…
Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples
Participant overlap can induce overfitting bias into Mendelian randomization (MR) and polygenic risk score (PRS) studies. Here, we evaluated a block jackknife resampling framework for genome-wide association studies (GWAS) and PRS construction to mitigate overfitting bias in MR analyses and implemen...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840213/ https://www.ncbi.nlm.nih.gov/pubmed/35932451 http://dx.doi.org/10.1093/hmg/ddac186 |
_version_ | 1784869594436993024 |
---|---|
author | Fang, Si Hemani, Gibran Richardson, Tom G Gaunt, Tom R Davey Smith, George |
author_facet | Fang, Si Hemani, Gibran Richardson, Tom G Gaunt, Tom R Davey Smith, George |
author_sort | Fang, Si |
collection | PubMed |
description | Participant overlap can induce overfitting bias into Mendelian randomization (MR) and polygenic risk score (PRS) studies. Here, we evaluated a block jackknife resampling framework for genome-wide association studies (GWAS) and PRS construction to mitigate overfitting bias in MR analyses and implemented this study design in a causal inference setting using data from the UK Biobank. We simulated PRS and MR under three scenarios: (1) using weighted SNP estimates from an external GWAS, (2) using weighted SNP estimates from an overlapping GWAS sample and (3) using a block jackknife resampling framework. Based on a P-value threshold to derive genetic instruments for MR studies (P < 5 × 10(−8)) and a 10% variance in the exposure explained by all SNPs, block-jackknifing PRS did not suffer from overfitting bias (mean R(2) = 0.034) compared with the externally weighted PRS (mean R(2) = 0.040). In contrast, genetic instruments derived from overlapping samples explained a higher variance (mean R(2) = 0.048) compared with the externally derived score. Overfitting became considerably more severe when using a more liberal P-value threshold to construct PRS (e.g. P < 0.05, overlapping sample PRS mean R(2) = 0.103, externally weighted PRS mean R(2) = 0.086), whereas estimates using jackknife score remained robust to overfitting (mean R(2) = 0.084). Using block jackknife resampling MR in an applied analysis, we examined the effects of body mass index on circulating biomarkers which provided comparable estimates to an externally weighted instrument, whereas the overfitted scores typically provided narrower confidence intervals. Furthermore, we extended this framework into sex-stratified, multivariate and bidirectional settings to investigate the effect of childhood body size on adult testosterone levels. |
format | Online Article Text |
id | pubmed-9840213 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98402132023-01-17 Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples Fang, Si Hemani, Gibran Richardson, Tom G Gaunt, Tom R Davey Smith, George Hum Mol Genet Original Article Participant overlap can induce overfitting bias into Mendelian randomization (MR) and polygenic risk score (PRS) studies. Here, we evaluated a block jackknife resampling framework for genome-wide association studies (GWAS) and PRS construction to mitigate overfitting bias in MR analyses and implemented this study design in a causal inference setting using data from the UK Biobank. We simulated PRS and MR under three scenarios: (1) using weighted SNP estimates from an external GWAS, (2) using weighted SNP estimates from an overlapping GWAS sample and (3) using a block jackknife resampling framework. Based on a P-value threshold to derive genetic instruments for MR studies (P < 5 × 10(−8)) and a 10% variance in the exposure explained by all SNPs, block-jackknifing PRS did not suffer from overfitting bias (mean R(2) = 0.034) compared with the externally weighted PRS (mean R(2) = 0.040). In contrast, genetic instruments derived from overlapping samples explained a higher variance (mean R(2) = 0.048) compared with the externally derived score. Overfitting became considerably more severe when using a more liberal P-value threshold to construct PRS (e.g. P < 0.05, overlapping sample PRS mean R(2) = 0.103, externally weighted PRS mean R(2) = 0.086), whereas estimates using jackknife score remained robust to overfitting (mean R(2) = 0.084). Using block jackknife resampling MR in an applied analysis, we examined the effects of body mass index on circulating biomarkers which provided comparable estimates to an externally weighted instrument, whereas the overfitted scores typically provided narrower confidence intervals. Furthermore, we extended this framework into sex-stratified, multivariate and bidirectional settings to investigate the effect of childhood body size on adult testosterone levels. Oxford University Press 2022-08-06 /pmc/articles/PMC9840213/ /pubmed/35932451 http://dx.doi.org/10.1093/hmg/ddac186 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Fang, Si Hemani, Gibran Richardson, Tom G Gaunt, Tom R Davey Smith, George Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples |
title | Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples |
title_full | Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples |
title_fullStr | Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples |
title_full_unstemmed | Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples |
title_short | Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples |
title_sort | evaluating and implementing block jackknife resampling mendelian randomization to mitigate bias induced by overlapping samples |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840213/ https://www.ncbi.nlm.nih.gov/pubmed/35932451 http://dx.doi.org/10.1093/hmg/ddac186 |
work_keys_str_mv | AT fangsi evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples AT hemanigibran evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples AT richardsontomg evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples AT gaunttomr evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples AT daveysmithgeorge evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples |