Cargando…

Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples

Participant overlap can induce overfitting bias into Mendelian randomization (MR) and polygenic risk score (PRS) studies. Here, we evaluated a block jackknife resampling framework for genome-wide association studies (GWAS) and PRS construction to mitigate overfitting bias in MR analyses and implemen...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Si, Hemani, Gibran, Richardson, Tom G, Gaunt, Tom R, Davey Smith, George
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840213/
https://www.ncbi.nlm.nih.gov/pubmed/35932451
http://dx.doi.org/10.1093/hmg/ddac186
_version_ 1784869594436993024
author Fang, Si
Hemani, Gibran
Richardson, Tom G
Gaunt, Tom R
Davey Smith, George
author_facet Fang, Si
Hemani, Gibran
Richardson, Tom G
Gaunt, Tom R
Davey Smith, George
author_sort Fang, Si
collection PubMed
description Participant overlap can induce overfitting bias into Mendelian randomization (MR) and polygenic risk score (PRS) studies. Here, we evaluated a block jackknife resampling framework for genome-wide association studies (GWAS) and PRS construction to mitigate overfitting bias in MR analyses and implemented this study design in a causal inference setting using data from the UK Biobank. We simulated PRS and MR under three scenarios: (1) using weighted SNP estimates from an external GWAS, (2) using weighted SNP estimates from an overlapping GWAS sample and (3) using a block jackknife resampling framework. Based on a P-value threshold to derive genetic instruments for MR studies (P < 5 × 10(−8)) and a 10% variance in the exposure explained by all SNPs, block-jackknifing PRS did not suffer from overfitting bias (mean R(2) = 0.034) compared with the externally weighted PRS (mean R(2) = 0.040). In contrast, genetic instruments derived from overlapping samples explained a higher variance (mean R(2) = 0.048) compared with the externally derived score. Overfitting became considerably more severe when using a more liberal P-value threshold to construct PRS (e.g. P < 0.05, overlapping sample PRS mean R(2) = 0.103, externally weighted PRS mean R(2) = 0.086), whereas estimates using jackknife score remained robust to overfitting (mean R(2) = 0.084). Using block jackknife resampling MR in an applied analysis, we examined the effects of body mass index on circulating biomarkers which provided comparable estimates to an externally weighted instrument, whereas the overfitted scores typically provided narrower confidence intervals. Furthermore, we extended this framework into sex-stratified, multivariate and bidirectional settings to investigate the effect of childhood body size on adult testosterone levels.
format Online
Article
Text
id pubmed-9840213
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98402132023-01-17 Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples Fang, Si Hemani, Gibran Richardson, Tom G Gaunt, Tom R Davey Smith, George Hum Mol Genet Original Article Participant overlap can induce overfitting bias into Mendelian randomization (MR) and polygenic risk score (PRS) studies. Here, we evaluated a block jackknife resampling framework for genome-wide association studies (GWAS) and PRS construction to mitigate overfitting bias in MR analyses and implemented this study design in a causal inference setting using data from the UK Biobank. We simulated PRS and MR under three scenarios: (1) using weighted SNP estimates from an external GWAS, (2) using weighted SNP estimates from an overlapping GWAS sample and (3) using a block jackknife resampling framework. Based on a P-value threshold to derive genetic instruments for MR studies (P < 5 × 10(−8)) and a 10% variance in the exposure explained by all SNPs, block-jackknifing PRS did not suffer from overfitting bias (mean R(2) = 0.034) compared with the externally weighted PRS (mean R(2) = 0.040). In contrast, genetic instruments derived from overlapping samples explained a higher variance (mean R(2) = 0.048) compared with the externally derived score. Overfitting became considerably more severe when using a more liberal P-value threshold to construct PRS (e.g. P < 0.05, overlapping sample PRS mean R(2) = 0.103, externally weighted PRS mean R(2) = 0.086), whereas estimates using jackknife score remained robust to overfitting (mean R(2) = 0.084). Using block jackknife resampling MR in an applied analysis, we examined the effects of body mass index on circulating biomarkers which provided comparable estimates to an externally weighted instrument, whereas the overfitted scores typically provided narrower confidence intervals. Furthermore, we extended this framework into sex-stratified, multivariate and bidirectional settings to investigate the effect of childhood body size on adult testosterone levels. Oxford University Press 2022-08-06 /pmc/articles/PMC9840213/ /pubmed/35932451 http://dx.doi.org/10.1093/hmg/ddac186 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Fang, Si
Hemani, Gibran
Richardson, Tom G
Gaunt, Tom R
Davey Smith, George
Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples
title Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples
title_full Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples
title_fullStr Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples
title_full_unstemmed Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples
title_short Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples
title_sort evaluating and implementing block jackknife resampling mendelian randomization to mitigate bias induced by overlapping samples
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840213/
https://www.ncbi.nlm.nih.gov/pubmed/35932451
http://dx.doi.org/10.1093/hmg/ddac186
work_keys_str_mv AT fangsi evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples
AT hemanigibran evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples
AT richardsontomg evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples
AT gaunttomr evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples
AT daveysmithgeorge evaluatingandimplementingblockjackkniferesamplingmendelianrandomizationtomitigatebiasinducedbyoverlappingsamples