Cargando…
Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the [Formula: see text] statistic
Background: MR-Egger regression has recently been proposed as a method for Mendelian randomization (MR) analyses incorporating summary data estimates of causal effect from multiple individual variants, which is robust to invalid instruments. It can be used to test for directional pleiotropy and prov...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5446088/ https://www.ncbi.nlm.nih.gov/pubmed/27616674 http://dx.doi.org/10.1093/ije/dyw220 |
_version_ | 1783239006401593344 |
---|---|
author | Bowden, Jack Del Greco M, Fabiola Minelli, Cosetta Davey Smith, George Sheehan, Nuala A Thompson, John R |
author_facet | Bowden, Jack Del Greco M, Fabiola Minelli, Cosetta Davey Smith, George Sheehan, Nuala A Thompson, John R |
author_sort | Bowden, Jack |
collection | PubMed |
description | Background: MR-Egger regression has recently been proposed as a method for Mendelian randomization (MR) analyses incorporating summary data estimates of causal effect from multiple individual variants, which is robust to invalid instruments. It can be used to test for directional pleiotropy and provides an estimate of the causal effect adjusted for its presence. MR-Egger regression provides a useful additional sensitivity analysis to the standard inverse variance weighted (IVW) approach that assumes all variants are valid instruments. Both methods use weights that consider the single nucleotide polymorphism (SNP)-exposure associations to be known, rather than estimated. We call this the `NO Measurement Error' (NOME) assumption. Causal effect estimates from the IVW approach exhibit weak instrument bias whenever the genetic variants utilized violate the NOME assumption, which can be reliably measured using the F-statistic. The effect of NOME violation on MR-Egger regression has yet to be studied. Methods: An adaptation of the [Formula: see text] statistic from the field of meta-analysis is proposed to quantify the strength of NOME violation for MR-Egger. It lies between 0 and 1, and indicates the expected relative bias (or dilution) of the MR-Egger causal estimate in the two-sample MR context. We call it [Formula: see text]. The method of simulation extrapolation is also explored to counteract the dilution. Their joint utility is evaluated using simulated data and applied to a real MR example. Results: In simulated two-sample MR analyses we show that, when a causal effect exists, the MR-Egger estimate of causal effect is biased towards the null when NOME is violated, and the stronger the violation (as indicated by lower values of [Formula: see text]), the stronger the dilution. When additionally all genetic variants are valid instruments, the type I error rate of the MR-Egger test for pleiotropy is inflated and the causal effect underestimated. Simulation extrapolation is shown to substantially mitigate these adverse effects. We demonstrate our proposed approach for a two-sample summary data MR analysis to estimate the causal effect of low-density lipoprotein on heart disease risk. A high value of [Formula: see text] close to 1 indicates that dilution does not materially affect the standard MR-Egger analyses for these data. Conclusions: Care must be taken to assess the NOME assumption via the [Formula: see text] statistic before implementing standard MR-Egger regression in the two-sample summary data context. If [Formula: see text] is sufficiently low (less than 90%), inferences from the method should be interpreted with caution and adjustment methods considered. |
format | Online Article Text |
id | pubmed-5446088 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54460882017-05-26 Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the [Formula: see text] statistic Bowden, Jack Del Greco M, Fabiola Minelli, Cosetta Davey Smith, George Sheehan, Nuala A Thompson, John R Int J Epidemiol Mendelian Randomisation and Instrumental Variable Analysis Background: MR-Egger regression has recently been proposed as a method for Mendelian randomization (MR) analyses incorporating summary data estimates of causal effect from multiple individual variants, which is robust to invalid instruments. It can be used to test for directional pleiotropy and provides an estimate of the causal effect adjusted for its presence. MR-Egger regression provides a useful additional sensitivity analysis to the standard inverse variance weighted (IVW) approach that assumes all variants are valid instruments. Both methods use weights that consider the single nucleotide polymorphism (SNP)-exposure associations to be known, rather than estimated. We call this the `NO Measurement Error' (NOME) assumption. Causal effect estimates from the IVW approach exhibit weak instrument bias whenever the genetic variants utilized violate the NOME assumption, which can be reliably measured using the F-statistic. The effect of NOME violation on MR-Egger regression has yet to be studied. Methods: An adaptation of the [Formula: see text] statistic from the field of meta-analysis is proposed to quantify the strength of NOME violation for MR-Egger. It lies between 0 and 1, and indicates the expected relative bias (or dilution) of the MR-Egger causal estimate in the two-sample MR context. We call it [Formula: see text]. The method of simulation extrapolation is also explored to counteract the dilution. Their joint utility is evaluated using simulated data and applied to a real MR example. Results: In simulated two-sample MR analyses we show that, when a causal effect exists, the MR-Egger estimate of causal effect is biased towards the null when NOME is violated, and the stronger the violation (as indicated by lower values of [Formula: see text]), the stronger the dilution. When additionally all genetic variants are valid instruments, the type I error rate of the MR-Egger test for pleiotropy is inflated and the causal effect underestimated. Simulation extrapolation is shown to substantially mitigate these adverse effects. We demonstrate our proposed approach for a two-sample summary data MR analysis to estimate the causal effect of low-density lipoprotein on heart disease risk. A high value of [Formula: see text] close to 1 indicates that dilution does not materially affect the standard MR-Egger analyses for these data. Conclusions: Care must be taken to assess the NOME assumption via the [Formula: see text] statistic before implementing standard MR-Egger regression in the two-sample summary data context. If [Formula: see text] is sufficiently low (less than 90%), inferences from the method should be interpreted with caution and adjustment methods considered. Oxford University Press 2016-12 2016-09-11 /pmc/articles/PMC5446088/ /pubmed/27616674 http://dx.doi.org/10.1093/ije/dyw220 Text en © The Author 2016. Published by Oxford University Press on behalf of the International Epidemiological Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Mendelian Randomisation and Instrumental Variable Analysis Bowden, Jack Del Greco M, Fabiola Minelli, Cosetta Davey Smith, George Sheehan, Nuala A Thompson, John R Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the [Formula: see text] statistic |
title | Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the [Formula: see text] statistic |
title_full | Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the [Formula: see text] statistic |
title_fullStr | Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the [Formula: see text] statistic |
title_full_unstemmed | Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the [Formula: see text] statistic |
title_short | Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the [Formula: see text] statistic |
title_sort | assessing the suitability of summary data for two-sample mendelian randomization analyses using mr-egger regression: the role of the [formula: see text] statistic |
topic | Mendelian Randomisation and Instrumental Variable Analysis |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5446088/ https://www.ncbi.nlm.nih.gov/pubmed/27616674 http://dx.doi.org/10.1093/ije/dyw220 |
work_keys_str_mv | AT bowdenjack assessingthesuitabilityofsummarydatafortwosamplemendelianrandomizationanalysesusingmreggerregressiontheroleoftheformulaseetextstatistic AT delgrecomfabiola assessingthesuitabilityofsummarydatafortwosamplemendelianrandomizationanalysesusingmreggerregressiontheroleoftheformulaseetextstatistic AT minellicosetta assessingthesuitabilityofsummarydatafortwosamplemendelianrandomizationanalysesusingmreggerregressiontheroleoftheformulaseetextstatistic AT daveysmithgeorge assessingthesuitabilityofsummarydatafortwosamplemendelianrandomizationanalysesusingmreggerregressiontheroleoftheformulaseetextstatistic AT sheehannualaa assessingthesuitabilityofsummarydatafortwosamplemendelianrandomizationanalysesusingmreggerregressiontheroleoftheformulaseetextstatistic AT thompsonjohnr assessingthesuitabilityofsummarydatafortwosamplemendelianrandomizationanalysesusingmreggerregressiontheroleoftheformulaseetextstatistic |