Cargando…

Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data

Genome-wide association studies, which typically report regression coefficients summarizing the associations of many genetic variants with various traits, are potentially a powerful source of data for Mendelian randomization investigations. We demonstrate how such coefficients from multiple variants...

Descripción completa

Detalles Bibliográficos
Autores principales: Burgess, Stephen, Butterworth, Adam, Thompson, Simon G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BlackWell Publishing Ltd 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4377079/
https://www.ncbi.nlm.nih.gov/pubmed/24114802
http://dx.doi.org/10.1002/gepi.21758
_version_ 1782363847729872896
author Burgess, Stephen
Butterworth, Adam
Thompson, Simon G
author_facet Burgess, Stephen
Butterworth, Adam
Thompson, Simon G
author_sort Burgess, Stephen
collection PubMed
description Genome-wide association studies, which typically report regression coefficients summarizing the associations of many genetic variants with various traits, are potentially a powerful source of data for Mendelian randomization investigations. We demonstrate how such coefficients from multiple variants can be combined in a Mendelian randomization analysis to estimate the causal effect of a risk factor on an outcome. The bias and efficiency of estimates based on summarized data are compared to those based on individual-level data in simulation studies. We investigate the impact of gene–gene interactions, linkage disequilibrium, and ‘weak instruments’ on these estimates. Both an inverse-variance weighted average of variant-specific associations and a likelihood-based approach for summarized data give similar estimates and precision to the two-stage least squares method for individual-level data, even when there are gene–gene interactions. However, these summarized data methods overstate precision when variants are in linkage disequilibrium. If the P-value in a linear regression of the risk factor for each variant is less than [Image: see text], then weak instrument bias will be small. We use these methods to estimate the causal association of low-density lipoprotein cholesterol (LDL-C) on coronary artery disease using published data on five genetic variants. A 30% reduction in LDL-C is estimated to reduce coronary artery disease risk by 67% (95% CI: 54% to 76%). We conclude that Mendelian randomization investigations using summarized data from uncorrelated variants are similarly efficient to those using individual-level data, although the necessary assumptions cannot be so fully assessed.
format Online
Article
Text
id pubmed-4377079
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BlackWell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-43770792015-03-30 Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data Burgess, Stephen Butterworth, Adam Thompson, Simon G Genet Epidemiol Research Articles Genome-wide association studies, which typically report regression coefficients summarizing the associations of many genetic variants with various traits, are potentially a powerful source of data for Mendelian randomization investigations. We demonstrate how such coefficients from multiple variants can be combined in a Mendelian randomization analysis to estimate the causal effect of a risk factor on an outcome. The bias and efficiency of estimates based on summarized data are compared to those based on individual-level data in simulation studies. We investigate the impact of gene–gene interactions, linkage disequilibrium, and ‘weak instruments’ on these estimates. Both an inverse-variance weighted average of variant-specific associations and a likelihood-based approach for summarized data give similar estimates and precision to the two-stage least squares method for individual-level data, even when there are gene–gene interactions. However, these summarized data methods overstate precision when variants are in linkage disequilibrium. If the P-value in a linear regression of the risk factor for each variant is less than [Image: see text], then weak instrument bias will be small. We use these methods to estimate the causal association of low-density lipoprotein cholesterol (LDL-C) on coronary artery disease using published data on five genetic variants. A 30% reduction in LDL-C is estimated to reduce coronary artery disease risk by 67% (95% CI: 54% to 76%). We conclude that Mendelian randomization investigations using summarized data from uncorrelated variants are similarly efficient to those using individual-level data, although the necessary assumptions cannot be so fully assessed. BlackWell Publishing Ltd 2013-11 2013-09-20 /pmc/articles/PMC4377079/ /pubmed/24114802 http://dx.doi.org/10.1002/gepi.21758 Text en © 2013 The Authors. *Genetic Epidemiology published by Wiley Periodicals, Inc. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Burgess, Stephen
Butterworth, Adam
Thompson, Simon G
Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data
title Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data
title_full Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data
title_fullStr Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data
title_full_unstemmed Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data
title_short Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data
title_sort mendelian randomization analysis with multiple genetic variants using summarized data
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4377079/
https://www.ncbi.nlm.nih.gov/pubmed/24114802
http://dx.doi.org/10.1002/gepi.21758
work_keys_str_mv AT burgessstephen mendelianrandomizationanalysiswithmultiplegeneticvariantsusingsummarizeddata
AT butterworthadam mendelianrandomizationanalysiswithmultiplegeneticvariantsusingsummarizeddata
AT thompsonsimong mendelianrandomizationanalysiswithmultiplegeneticvariantsusingsummarizeddata