Cargando…

Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data

Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are gen...

Descripción completa

Detalles Bibliográficos
Autores principales: DiPrete, Thomas A., Burik, Casper A. P., Koellinger, Philipp D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5984483/
https://www.ncbi.nlm.nih.gov/pubmed/29686100
http://dx.doi.org/10.1073/pnas.1707388115
_version_ 1783328627649150976
author DiPrete, Thomas A.
Burik, Casper A. P.
Koellinger, Philipp D.
author_facet DiPrete, Thomas A.
Burik, Casper A. P.
Koellinger, Philipp D.
author_sort DiPrete, Thomas A.
collection PubMed
description Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA.
format Online
Article
Text
id pubmed-5984483
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-59844832018-06-07 Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data DiPrete, Thomas A. Burik, Casper A. P. Koellinger, Philipp D. Proc Natl Acad Sci U S A PNAS Plus Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA. National Academy of Sciences 2018-05-29 2018-04-23 /pmc/articles/PMC5984483/ /pubmed/29686100 http://dx.doi.org/10.1073/pnas.1707388115 Text en Copyright © 2018 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle PNAS Plus
DiPrete, Thomas A.
Burik, Casper A. P.
Koellinger, Philipp D.
Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data
title Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data
title_full Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data
title_fullStr Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data
title_full_unstemmed Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data
title_short Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data
title_sort genetic instrumental variable regression: explaining socioeconomic and health outcomes in nonexperimental data
topic PNAS Plus
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5984483/
https://www.ncbi.nlm.nih.gov/pubmed/29686100
http://dx.doi.org/10.1073/pnas.1707388115
work_keys_str_mv AT dipretethomasa geneticinstrumentalvariableregressionexplainingsocioeconomicandhealthoutcomesinnonexperimentaldata
AT burikcasperap geneticinstrumentalvariableregressionexplainingsocioeconomicandhealthoutcomesinnonexperimentaldata
AT koellingerphilippd geneticinstrumentalvariableregressionexplainingsocioeconomicandhealthoutcomesinnonexperimentaldata