Cargando…

Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression

BACKGROUND: Estimating the health effects of multi-pollutant mixtures is of increasing interest in environmental epidemiology. Recently, a new approach for estimating the health effects of mixtures, Bayesian kernel machine regression (BKMR), has been developed. This method estimates the multivariabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Bobb, Jennifer F., Claus Henn, Birgit, Valeri, Linda, Coull, Brent A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6102907/
https://www.ncbi.nlm.nih.gov/pubmed/30126431
http://dx.doi.org/10.1186/s12940-018-0413-y
_version_ 1783349265853054976
author Bobb, Jennifer F.
Claus Henn, Birgit
Valeri, Linda
Coull, Brent A.
author_facet Bobb, Jennifer F.
Claus Henn, Birgit
Valeri, Linda
Coull, Brent A.
author_sort Bobb, Jennifer F.
collection PubMed
description BACKGROUND: Estimating the health effects of multi-pollutant mixtures is of increasing interest in environmental epidemiology. Recently, a new approach for estimating the health effects of mixtures, Bayesian kernel machine regression (BKMR), has been developed. This method estimates the multivariable exposure-response function in a flexible and parsimonious way, conducts variable selection on the (potentially high-dimensional) vector of exposures, and allows for a grouped variable selection approach that can accommodate highly correlated exposures. However, the application of this novel method has been limited by a lack of available software, the need to derive interpretable output in a computationally efficient manner, and the inability to apply the method to non-continuous outcome variables. METHODS: This paper addresses these limitations by (i) introducing an open-source software package in the R programming language, the bkmr R package, (ii) demonstrating methods for visualizing high-dimensional exposure-response functions, and for estimating scientifically relevant summaries, (iii) illustrating a probit regression implementation of BKMR for binary outcomes, and (iv) describing a fast version of BKMR that utilizes a Gaussian predictive process approach. All of the methods are illustrated using fully reproducible examples with the provided R code. RESULTS: Applying the methods to a continuous outcome example illustrated the ability of the BKMR implementation to estimate the health effects of multi-pollutant mixtures in the context of a highly nonlinear, biologically-based dose-response function, and to estimate overall, single-exposure, and interactive health effects. The Gaussian predictive process method led to a substantial reduction in the runtime, without a major decrease in accuracy. In the setting of a larger number of exposures and a dichotomous outcome, the probit BKMR implementation was able to correctly identify the variables included in the exposure-response function and yielded interpretable quantities on the scale of a latent continuous outcome or on the scale of the outcome probability. CONCLUSIONS: This newly developed software, integrated suite of tools, and extended methodology makes BKMR accessible for use across a broad range of epidemiological applications in which multiple risk factors have complex effects on health.
format Online
Article
Text
id pubmed-6102907
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61029072018-08-30 Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression Bobb, Jennifer F. Claus Henn, Birgit Valeri, Linda Coull, Brent A. Environ Health Methodology BACKGROUND: Estimating the health effects of multi-pollutant mixtures is of increasing interest in environmental epidemiology. Recently, a new approach for estimating the health effects of mixtures, Bayesian kernel machine regression (BKMR), has been developed. This method estimates the multivariable exposure-response function in a flexible and parsimonious way, conducts variable selection on the (potentially high-dimensional) vector of exposures, and allows for a grouped variable selection approach that can accommodate highly correlated exposures. However, the application of this novel method has been limited by a lack of available software, the need to derive interpretable output in a computationally efficient manner, and the inability to apply the method to non-continuous outcome variables. METHODS: This paper addresses these limitations by (i) introducing an open-source software package in the R programming language, the bkmr R package, (ii) demonstrating methods for visualizing high-dimensional exposure-response functions, and for estimating scientifically relevant summaries, (iii) illustrating a probit regression implementation of BKMR for binary outcomes, and (iv) describing a fast version of BKMR that utilizes a Gaussian predictive process approach. All of the methods are illustrated using fully reproducible examples with the provided R code. RESULTS: Applying the methods to a continuous outcome example illustrated the ability of the BKMR implementation to estimate the health effects of multi-pollutant mixtures in the context of a highly nonlinear, biologically-based dose-response function, and to estimate overall, single-exposure, and interactive health effects. The Gaussian predictive process method led to a substantial reduction in the runtime, without a major decrease in accuracy. In the setting of a larger number of exposures and a dichotomous outcome, the probit BKMR implementation was able to correctly identify the variables included in the exposure-response function and yielded interpretable quantities on the scale of a latent continuous outcome or on the scale of the outcome probability. CONCLUSIONS: This newly developed software, integrated suite of tools, and extended methodology makes BKMR accessible for use across a broad range of epidemiological applications in which multiple risk factors have complex effects on health. BioMed Central 2018-08-20 /pmc/articles/PMC6102907/ /pubmed/30126431 http://dx.doi.org/10.1186/s12940-018-0413-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Bobb, Jennifer F.
Claus Henn, Birgit
Valeri, Linda
Coull, Brent A.
Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression
title Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression
title_full Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression
title_fullStr Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression
title_full_unstemmed Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression
title_short Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression
title_sort statistical software for analyzing the health effects of multiple concurrent exposures via bayesian kernel machine regression
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6102907/
https://www.ncbi.nlm.nih.gov/pubmed/30126431
http://dx.doi.org/10.1186/s12940-018-0413-y
work_keys_str_mv AT bobbjenniferf statisticalsoftwareforanalyzingthehealtheffectsofmultipleconcurrentexposuresviabayesiankernelmachineregression
AT claushennbirgit statisticalsoftwareforanalyzingthehealtheffectsofmultipleconcurrentexposuresviabayesiankernelmachineregression
AT valerilinda statisticalsoftwareforanalyzingthehealtheffectsofmultipleconcurrentexposuresviabayesiankernelmachineregression
AT coullbrenta statisticalsoftwareforanalyzingthehealtheffectsofmultipleconcurrentexposuresviabayesiankernelmachineregression