Cargando…

Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model

The contribution of gene-by-environment (GxE) interactions for many human traits and diseases is poorly characterized. We propose a Bayesian whole-genome regression model for joint modeling of main genetic effects and GxE interactions in large-scale datasets, such as the UK Biobank, where many envir...

Descripción completa

Detalles Bibliográficos
Autores principales: Kerin, Matthew, Marchini, Jonathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7536582/
https://www.ncbi.nlm.nih.gov/pubmed/32888427
http://dx.doi.org/10.1016/j.ajhg.2020.08.009
_version_ 1783590599411105792
author Kerin, Matthew
Marchini, Jonathan
author_facet Kerin, Matthew
Marchini, Jonathan
author_sort Kerin, Matthew
collection PubMed
description The contribution of gene-by-environment (GxE) interactions for many human traits and diseases is poorly characterized. We propose a Bayesian whole-genome regression model for joint modeling of main genetic effects and GxE interactions in large-scale datasets, such as the UK Biobank, where many environmental variables have been measured. The method is called LEMMA (Linear Environment Mixed Model Analysis) and estimates a linear combination of environmental variables, called an environmental score (ES), that interacts with genetic markers throughout the genome. The ES provides a readily interpretable way to examine the combined effect of many environmental variables. The ES can be used both to estimate the proportion of phenotypic variance attributable to GxE effects and to test for GxE effects at genetic variants across the genome. GxE effects can induce heteroskedasticity in quantitative traits, and LEMMA accounts for this by using robust standard error estimates when testing for GxE effects. When applied to body mass index, systolic blood pressure, diastolic blood pressure, and pulse pressure in the UK Biobank, we estimate that [Formula: see text] , [Formula: see text] , [Formula: see text] , and [Formula: see text] , respectively, of phenotypic variance is explained by GxE interactions and that low-frequency variants explain most of this variance. We also identify three loci that interact with the estimated environmental scores ([Formula: see text]).
format Online
Article
Text
id pubmed-7536582
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-75365822021-04-01 Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model Kerin, Matthew Marchini, Jonathan Am J Hum Genet Article The contribution of gene-by-environment (GxE) interactions for many human traits and diseases is poorly characterized. We propose a Bayesian whole-genome regression model for joint modeling of main genetic effects and GxE interactions in large-scale datasets, such as the UK Biobank, where many environmental variables have been measured. The method is called LEMMA (Linear Environment Mixed Model Analysis) and estimates a linear combination of environmental variables, called an environmental score (ES), that interacts with genetic markers throughout the genome. The ES provides a readily interpretable way to examine the combined effect of many environmental variables. The ES can be used both to estimate the proportion of phenotypic variance attributable to GxE effects and to test for GxE effects at genetic variants across the genome. GxE effects can induce heteroskedasticity in quantitative traits, and LEMMA accounts for this by using robust standard error estimates when testing for GxE effects. When applied to body mass index, systolic blood pressure, diastolic blood pressure, and pulse pressure in the UK Biobank, we estimate that [Formula: see text] , [Formula: see text] , [Formula: see text] , and [Formula: see text] , respectively, of phenotypic variance is explained by GxE interactions and that low-frequency variants explain most of this variance. We also identify three loci that interact with the estimated environmental scores ([Formula: see text]). Elsevier 2020-10-01 2020-09-03 /pmc/articles/PMC7536582/ /pubmed/32888427 http://dx.doi.org/10.1016/j.ajhg.2020.08.009 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kerin, Matthew
Marchini, Jonathan
Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model
title Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model
title_full Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model
title_fullStr Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model
title_full_unstemmed Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model
title_short Inferring Gene-by-Environment Interactions with a Bayesian Whole-Genome Regression Model
title_sort inferring gene-by-environment interactions with a bayesian whole-genome regression model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7536582/
https://www.ncbi.nlm.nih.gov/pubmed/32888427
http://dx.doi.org/10.1016/j.ajhg.2020.08.009
work_keys_str_mv AT kerinmatthew inferringgenebyenvironmentinteractionswithabayesianwholegenomeregressionmodel
AT marchinijonathan inferringgenebyenvironmentinteractionswithabayesianwholegenomeregressionmodel