Cargando…

The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees

BACKGROUND: There is a need to evaluate complex interaction effects on human health, such as those induced by mixtures of environmental contaminants. The usual approach is to formulate an additive statistical model and check for departures using product terms between the variables of interest. In th...

Descripción completa

Detalles Bibliográficos
Autores principales: Lampa, Erik, Lind, Lars, Lind, P Monica, Bornefalk-Hermansson, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4120739/
https://www.ncbi.nlm.nih.gov/pubmed/24993424
http://dx.doi.org/10.1186/1476-069X-13-57
_version_ 1782329139494125568
author Lampa, Erik
Lind, Lars
Lind, P Monica
Bornefalk-Hermansson, Anna
author_facet Lampa, Erik
Lind, Lars
Lind, P Monica
Bornefalk-Hermansson, Anna
author_sort Lampa, Erik
collection PubMed
description BACKGROUND: There is a need to evaluate complex interaction effects on human health, such as those induced by mixtures of environmental contaminants. The usual approach is to formulate an additive statistical model and check for departures using product terms between the variables of interest. In this paper, we present an approach to search for interaction effects among several variables using boosted regression trees. METHODS: We simulate a continuous outcome from real data on 27 environmental contaminants, some of which are correlated, and test the method’s ability to uncover the simulated interactions. The simulated outcome contains one four-way interaction, one non-linear effect and one interaction between a continuous variable and a binary variable. Four scenarios reflecting different strengths of association are simulated. We illustrate the method using real data. RESULTS: The method succeeded in identifying the true interactions in all scenarios except where the association was weakest. Some spurious interactions were also found, however. The method was also capable to identify interactions in the real data set. CONCLUSIONS: We conclude that boosted regression trees can be used to uncover complex interaction effects in epidemiological studies.
format Online
Article
Text
id pubmed-4120739
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41207392014-08-06 The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees Lampa, Erik Lind, Lars Lind, P Monica Bornefalk-Hermansson, Anna Environ Health Research BACKGROUND: There is a need to evaluate complex interaction effects on human health, such as those induced by mixtures of environmental contaminants. The usual approach is to formulate an additive statistical model and check for departures using product terms between the variables of interest. In this paper, we present an approach to search for interaction effects among several variables using boosted regression trees. METHODS: We simulate a continuous outcome from real data on 27 environmental contaminants, some of which are correlated, and test the method’s ability to uncover the simulated interactions. The simulated outcome contains one four-way interaction, one non-linear effect and one interaction between a continuous variable and a binary variable. Four scenarios reflecting different strengths of association are simulated. We illustrate the method using real data. RESULTS: The method succeeded in identifying the true interactions in all scenarios except where the association was weakest. Some spurious interactions were also found, however. The method was also capable to identify interactions in the real data set. CONCLUSIONS: We conclude that boosted regression trees can be used to uncover complex interaction effects in epidemiological studies. BioMed Central 2014-07-04 /pmc/articles/PMC4120739/ /pubmed/24993424 http://dx.doi.org/10.1186/1476-069X-13-57 Text en Copyright © 2014 Lampa et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lampa, Erik
Lind, Lars
Lind, P Monica
Bornefalk-Hermansson, Anna
The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees
title The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees
title_full The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees
title_fullStr The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees
title_full_unstemmed The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees
title_short The identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees
title_sort identification of complex interactions in epidemiology and toxicology: a simulation study of boosted regression trees
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4120739/
https://www.ncbi.nlm.nih.gov/pubmed/24993424
http://dx.doi.org/10.1186/1476-069X-13-57
work_keys_str_mv AT lampaerik theidentificationofcomplexinteractionsinepidemiologyandtoxicologyasimulationstudyofboostedregressiontrees
AT lindlars theidentificationofcomplexinteractionsinepidemiologyandtoxicologyasimulationstudyofboostedregressiontrees
AT lindpmonica theidentificationofcomplexinteractionsinepidemiologyandtoxicologyasimulationstudyofboostedregressiontrees
AT bornefalkhermanssonanna theidentificationofcomplexinteractionsinepidemiologyandtoxicologyasimulationstudyofboostedregressiontrees
AT lampaerik identificationofcomplexinteractionsinepidemiologyandtoxicologyasimulationstudyofboostedregressiontrees
AT lindlars identificationofcomplexinteractionsinepidemiologyandtoxicologyasimulationstudyofboostedregressiontrees
AT lindpmonica identificationofcomplexinteractionsinepidemiologyandtoxicologyasimulationstudyofboostedregressiontrees
AT bornefalkhermanssonanna identificationofcomplexinteractionsinepidemiologyandtoxicologyasimulationstudyofboostedregressiontrees