Cargando…

Efficient gene–environment interaction testing through bootstrap aggregating

Gene–environment (GxE) interactions are an important and sophisticated component in the manifestation of complex phenotypes. Simple univariate tests lack statistical power due to the need for multiple testing adjustment and not incorporating potential interplay between several genetic loci. Approach...

Descripción completa

Detalles Bibliográficos
Autores principales: Lau, Michael, Kress, Sara, Schikowski, Tamara, Schwender, Holger
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9845231/
https://www.ncbi.nlm.nih.gov/pubmed/36650248
http://dx.doi.org/10.1038/s41598-023-28172-4
_version_ 1784870845851631616
author Lau, Michael
Kress, Sara
Schikowski, Tamara
Schwender, Holger
author_facet Lau, Michael
Kress, Sara
Schikowski, Tamara
Schwender, Holger
author_sort Lau, Michael
collection PubMed
description Gene–environment (GxE) interactions are an important and sophisticated component in the manifestation of complex phenotypes. Simple univariate tests lack statistical power due to the need for multiple testing adjustment and not incorporating potential interplay between several genetic loci. Approaches based on internally constructed genetic risk scores (GRS) require the partitioning of the available sample into training and testing data sets, thus, lowering the effective sample size for testing the GxE interaction itself. To overcome these issues, we propose a statistical test that employs bagging (bootstrap aggregating) in the GRS construction step and utilizes its out-of-bag prediction mechanism. This approach has the key advantage that the full available data set can be used for both constructing the GRS and testing the GxE interaction. To also incorporate interactions between genetic loci, we, furthermore, investigate if using random forests as the GRS construction method in GxE interaction testing further increases the statistical power. In a simulation study, we show that both novel procedures lead to a higher statistical power for detecting GxE interactions, while still controlling the type I error. The random-forests-based test outperforms a bagging-based test that uses the elastic net as its base learner in most scenarios. An application of the testing procedures to a real data set from a German cohort study suggests that there might be a GxE interaction involving exposure to air pollution regarding rheumatoid arthritis.
format Online
Article
Text
id pubmed-9845231
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-98452312023-01-19 Efficient gene–environment interaction testing through bootstrap aggregating Lau, Michael Kress, Sara Schikowski, Tamara Schwender, Holger Sci Rep Article Gene–environment (GxE) interactions are an important and sophisticated component in the manifestation of complex phenotypes. Simple univariate tests lack statistical power due to the need for multiple testing adjustment and not incorporating potential interplay between several genetic loci. Approaches based on internally constructed genetic risk scores (GRS) require the partitioning of the available sample into training and testing data sets, thus, lowering the effective sample size for testing the GxE interaction itself. To overcome these issues, we propose a statistical test that employs bagging (bootstrap aggregating) in the GRS construction step and utilizes its out-of-bag prediction mechanism. This approach has the key advantage that the full available data set can be used for both constructing the GRS and testing the GxE interaction. To also incorporate interactions between genetic loci, we, furthermore, investigate if using random forests as the GRS construction method in GxE interaction testing further increases the statistical power. In a simulation study, we show that both novel procedures lead to a higher statistical power for detecting GxE interactions, while still controlling the type I error. The random-forests-based test outperforms a bagging-based test that uses the elastic net as its base learner in most scenarios. An application of the testing procedures to a real data set from a German cohort study suggests that there might be a GxE interaction involving exposure to air pollution regarding rheumatoid arthritis. Nature Publishing Group UK 2023-01-17 /pmc/articles/PMC9845231/ /pubmed/36650248 http://dx.doi.org/10.1038/s41598-023-28172-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Lau, Michael
Kress, Sara
Schikowski, Tamara
Schwender, Holger
Efficient gene–environment interaction testing through bootstrap aggregating
title Efficient gene–environment interaction testing through bootstrap aggregating
title_full Efficient gene–environment interaction testing through bootstrap aggregating
title_fullStr Efficient gene–environment interaction testing through bootstrap aggregating
title_full_unstemmed Efficient gene–environment interaction testing through bootstrap aggregating
title_short Efficient gene–environment interaction testing through bootstrap aggregating
title_sort efficient gene–environment interaction testing through bootstrap aggregating
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9845231/
https://www.ncbi.nlm.nih.gov/pubmed/36650248
http://dx.doi.org/10.1038/s41598-023-28172-4
work_keys_str_mv AT laumichael efficientgeneenvironmentinteractiontestingthroughbootstrapaggregating
AT kresssara efficientgeneenvironmentinteractiontestingthroughbootstrapaggregating
AT schikowskitamara efficientgeneenvironmentinteractiontestingthroughbootstrapaggregating
AT schwenderholger efficientgeneenvironmentinteractiontestingthroughbootstrapaggregating