Cargando…

Bias due to participant overlap in two‐sample Mendelian randomization

Mendelian randomization analyses are often performed using summarized data. The causal estimate from a one‐sample analysis (in which data are taken from a single data source) with weak instrumental variables is biased in the direction of the observational association between the risk factor and outc...

Descripción completa

Detalles Bibliográficos
Autores principales: Burgess, Stephen, Davies, Neil M., Thompson, Simon G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5082560/
https://www.ncbi.nlm.nih.gov/pubmed/27625185
http://dx.doi.org/10.1002/gepi.21998
_version_ 1782463081686761472
author Burgess, Stephen
Davies, Neil M.
Thompson, Simon G.
author_facet Burgess, Stephen
Davies, Neil M.
Thompson, Simon G.
author_sort Burgess, Stephen
collection PubMed
description Mendelian randomization analyses are often performed using summarized data. The causal estimate from a one‐sample analysis (in which data are taken from a single data source) with weak instrumental variables is biased in the direction of the observational association between the risk factor and outcome, whereas the estimate from a two‐sample analysis (in which data on the risk factor and outcome are taken from non‐overlapping datasets) is less biased and any bias is in the direction of the null. When using genetic consortia that have partially overlapping sets of participants, the direction and extent of bias are uncertain. In this paper, we perform simulation studies to investigate the magnitude of bias and Type 1 error rate inflation arising from sample overlap. We consider both a continuous outcome and a case‐control setting with a binary outcome. For a continuous outcome, bias due to sample overlap is a linear function of the proportion of overlap between the samples. So, in the case of a null causal effect, if the relative bias of the one‐sample instrumental variable estimate is 10% (corresponding to an F parameter of 10), then the relative bias with 50% sample overlap is 5%, and with 30% sample overlap is 3%. In a case‐control setting, if risk factor measurements are only included for the control participants, unbiased estimates are obtained even in a one‐sample setting. However, if risk factor data on both control and case participants are used, then bias is similar with a binary outcome as with a continuous outcome. Consortia releasing publicly available data on the associations of genetic variants with continuous risk factors should provide estimates that exclude case participants from case‐control samples.
format Online
Article
Text
id pubmed-5082560
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-50825602016-11-09 Bias due to participant overlap in two‐sample Mendelian randomization Burgess, Stephen Davies, Neil M. Thompson, Simon G. Genet Epidemiol Research Articles Mendelian randomization analyses are often performed using summarized data. The causal estimate from a one‐sample analysis (in which data are taken from a single data source) with weak instrumental variables is biased in the direction of the observational association between the risk factor and outcome, whereas the estimate from a two‐sample analysis (in which data on the risk factor and outcome are taken from non‐overlapping datasets) is less biased and any bias is in the direction of the null. When using genetic consortia that have partially overlapping sets of participants, the direction and extent of bias are uncertain. In this paper, we perform simulation studies to investigate the magnitude of bias and Type 1 error rate inflation arising from sample overlap. We consider both a continuous outcome and a case‐control setting with a binary outcome. For a continuous outcome, bias due to sample overlap is a linear function of the proportion of overlap between the samples. So, in the case of a null causal effect, if the relative bias of the one‐sample instrumental variable estimate is 10% (corresponding to an F parameter of 10), then the relative bias with 50% sample overlap is 5%, and with 30% sample overlap is 3%. In a case‐control setting, if risk factor measurements are only included for the control participants, unbiased estimates are obtained even in a one‐sample setting. However, if risk factor data on both control and case participants are used, then bias is similar with a binary outcome as with a continuous outcome. Consortia releasing publicly available data on the associations of genetic variants with continuous risk factors should provide estimates that exclude case participants from case‐control samples. John Wiley and Sons Inc. 2016-09-14 2016-11 /pmc/articles/PMC5082560/ /pubmed/27625185 http://dx.doi.org/10.1002/gepi.21998 Text en © 2016 The Authors Genetic Epidemiology Published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Burgess, Stephen
Davies, Neil M.
Thompson, Simon G.
Bias due to participant overlap in two‐sample Mendelian randomization
title Bias due to participant overlap in two‐sample Mendelian randomization
title_full Bias due to participant overlap in two‐sample Mendelian randomization
title_fullStr Bias due to participant overlap in two‐sample Mendelian randomization
title_full_unstemmed Bias due to participant overlap in two‐sample Mendelian randomization
title_short Bias due to participant overlap in two‐sample Mendelian randomization
title_sort bias due to participant overlap in two‐sample mendelian randomization
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5082560/
https://www.ncbi.nlm.nih.gov/pubmed/27625185
http://dx.doi.org/10.1002/gepi.21998
work_keys_str_mv AT burgessstephen biasduetoparticipantoverlapintwosamplemendelianrandomization
AT daviesneilm biasduetoparticipantoverlapintwosamplemendelianrandomization
AT thompsonsimong biasduetoparticipantoverlapintwosamplemendelianrandomization