Cargando…
Efficient gene–environment interaction testing through bootstrap aggregating
Gene–environment (GxE) interactions are an important and sophisticated component in the manifestation of complex phenotypes. Simple univariate tests lack statistical power due to the need for multiple testing adjustment and not incorporating potential interplay between several genetic loci. Approach...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9845231/ https://www.ncbi.nlm.nih.gov/pubmed/36650248 http://dx.doi.org/10.1038/s41598-023-28172-4 |
_version_ | 1784870845851631616 |
---|---|
author | Lau, Michael Kress, Sara Schikowski, Tamara Schwender, Holger |
author_facet | Lau, Michael Kress, Sara Schikowski, Tamara Schwender, Holger |
author_sort | Lau, Michael |
collection | PubMed |
description | Gene–environment (GxE) interactions are an important and sophisticated component in the manifestation of complex phenotypes. Simple univariate tests lack statistical power due to the need for multiple testing adjustment and not incorporating potential interplay between several genetic loci. Approaches based on internally constructed genetic risk scores (GRS) require the partitioning of the available sample into training and testing data sets, thus, lowering the effective sample size for testing the GxE interaction itself. To overcome these issues, we propose a statistical test that employs bagging (bootstrap aggregating) in the GRS construction step and utilizes its out-of-bag prediction mechanism. This approach has the key advantage that the full available data set can be used for both constructing the GRS and testing the GxE interaction. To also incorporate interactions between genetic loci, we, furthermore, investigate if using random forests as the GRS construction method in GxE interaction testing further increases the statistical power. In a simulation study, we show that both novel procedures lead to a higher statistical power for detecting GxE interactions, while still controlling the type I error. The random-forests-based test outperforms a bagging-based test that uses the elastic net as its base learner in most scenarios. An application of the testing procedures to a real data set from a German cohort study suggests that there might be a GxE interaction involving exposure to air pollution regarding rheumatoid arthritis. |
format | Online Article Text |
id | pubmed-9845231 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-98452312023-01-19 Efficient gene–environment interaction testing through bootstrap aggregating Lau, Michael Kress, Sara Schikowski, Tamara Schwender, Holger Sci Rep Article Gene–environment (GxE) interactions are an important and sophisticated component in the manifestation of complex phenotypes. Simple univariate tests lack statistical power due to the need for multiple testing adjustment and not incorporating potential interplay between several genetic loci. Approaches based on internally constructed genetic risk scores (GRS) require the partitioning of the available sample into training and testing data sets, thus, lowering the effective sample size for testing the GxE interaction itself. To overcome these issues, we propose a statistical test that employs bagging (bootstrap aggregating) in the GRS construction step and utilizes its out-of-bag prediction mechanism. This approach has the key advantage that the full available data set can be used for both constructing the GRS and testing the GxE interaction. To also incorporate interactions between genetic loci, we, furthermore, investigate if using random forests as the GRS construction method in GxE interaction testing further increases the statistical power. In a simulation study, we show that both novel procedures lead to a higher statistical power for detecting GxE interactions, while still controlling the type I error. The random-forests-based test outperforms a bagging-based test that uses the elastic net as its base learner in most scenarios. An application of the testing procedures to a real data set from a German cohort study suggests that there might be a GxE interaction involving exposure to air pollution regarding rheumatoid arthritis. Nature Publishing Group UK 2023-01-17 /pmc/articles/PMC9845231/ /pubmed/36650248 http://dx.doi.org/10.1038/s41598-023-28172-4 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Lau, Michael Kress, Sara Schikowski, Tamara Schwender, Holger Efficient gene–environment interaction testing through bootstrap aggregating |
title | Efficient gene–environment interaction testing through bootstrap aggregating |
title_full | Efficient gene–environment interaction testing through bootstrap aggregating |
title_fullStr | Efficient gene–environment interaction testing through bootstrap aggregating |
title_full_unstemmed | Efficient gene–environment interaction testing through bootstrap aggregating |
title_short | Efficient gene–environment interaction testing through bootstrap aggregating |
title_sort | efficient gene–environment interaction testing through bootstrap aggregating |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9845231/ https://www.ncbi.nlm.nih.gov/pubmed/36650248 http://dx.doi.org/10.1038/s41598-023-28172-4 |
work_keys_str_mv | AT laumichael efficientgeneenvironmentinteractiontestingthroughbootstrapaggregating AT kresssara efficientgeneenvironmentinteractiontestingthroughbootstrapaggregating AT schikowskitamara efficientgeneenvironmentinteractiontestingthroughbootstrapaggregating AT schwenderholger efficientgeneenvironmentinteractiontestingthroughbootstrapaggregating |