Cargando…
Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression
BACKGROUND: Estimating the health effects of multi-pollutant mixtures is of increasing interest in environmental epidemiology. Recently, a new approach for estimating the health effects of mixtures, Bayesian kernel machine regression (BKMR), has been developed. This method estimates the multivariabl...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6102907/ https://www.ncbi.nlm.nih.gov/pubmed/30126431 http://dx.doi.org/10.1186/s12940-018-0413-y |
_version_ | 1783349265853054976 |
---|---|
author | Bobb, Jennifer F. Claus Henn, Birgit Valeri, Linda Coull, Brent A. |
author_facet | Bobb, Jennifer F. Claus Henn, Birgit Valeri, Linda Coull, Brent A. |
author_sort | Bobb, Jennifer F. |
collection | PubMed |
description | BACKGROUND: Estimating the health effects of multi-pollutant mixtures is of increasing interest in environmental epidemiology. Recently, a new approach for estimating the health effects of mixtures, Bayesian kernel machine regression (BKMR), has been developed. This method estimates the multivariable exposure-response function in a flexible and parsimonious way, conducts variable selection on the (potentially high-dimensional) vector of exposures, and allows for a grouped variable selection approach that can accommodate highly correlated exposures. However, the application of this novel method has been limited by a lack of available software, the need to derive interpretable output in a computationally efficient manner, and the inability to apply the method to non-continuous outcome variables. METHODS: This paper addresses these limitations by (i) introducing an open-source software package in the R programming language, the bkmr R package, (ii) demonstrating methods for visualizing high-dimensional exposure-response functions, and for estimating scientifically relevant summaries, (iii) illustrating a probit regression implementation of BKMR for binary outcomes, and (iv) describing a fast version of BKMR that utilizes a Gaussian predictive process approach. All of the methods are illustrated using fully reproducible examples with the provided R code. RESULTS: Applying the methods to a continuous outcome example illustrated the ability of the BKMR implementation to estimate the health effects of multi-pollutant mixtures in the context of a highly nonlinear, biologically-based dose-response function, and to estimate overall, single-exposure, and interactive health effects. The Gaussian predictive process method led to a substantial reduction in the runtime, without a major decrease in accuracy. In the setting of a larger number of exposures and a dichotomous outcome, the probit BKMR implementation was able to correctly identify the variables included in the exposure-response function and yielded interpretable quantities on the scale of a latent continuous outcome or on the scale of the outcome probability. CONCLUSIONS: This newly developed software, integrated suite of tools, and extended methodology makes BKMR accessible for use across a broad range of epidemiological applications in which multiple risk factors have complex effects on health. |
format | Online Article Text |
id | pubmed-6102907 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61029072018-08-30 Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression Bobb, Jennifer F. Claus Henn, Birgit Valeri, Linda Coull, Brent A. Environ Health Methodology BACKGROUND: Estimating the health effects of multi-pollutant mixtures is of increasing interest in environmental epidemiology. Recently, a new approach for estimating the health effects of mixtures, Bayesian kernel machine regression (BKMR), has been developed. This method estimates the multivariable exposure-response function in a flexible and parsimonious way, conducts variable selection on the (potentially high-dimensional) vector of exposures, and allows for a grouped variable selection approach that can accommodate highly correlated exposures. However, the application of this novel method has been limited by a lack of available software, the need to derive interpretable output in a computationally efficient manner, and the inability to apply the method to non-continuous outcome variables. METHODS: This paper addresses these limitations by (i) introducing an open-source software package in the R programming language, the bkmr R package, (ii) demonstrating methods for visualizing high-dimensional exposure-response functions, and for estimating scientifically relevant summaries, (iii) illustrating a probit regression implementation of BKMR for binary outcomes, and (iv) describing a fast version of BKMR that utilizes a Gaussian predictive process approach. All of the methods are illustrated using fully reproducible examples with the provided R code. RESULTS: Applying the methods to a continuous outcome example illustrated the ability of the BKMR implementation to estimate the health effects of multi-pollutant mixtures in the context of a highly nonlinear, biologically-based dose-response function, and to estimate overall, single-exposure, and interactive health effects. The Gaussian predictive process method led to a substantial reduction in the runtime, without a major decrease in accuracy. In the setting of a larger number of exposures and a dichotomous outcome, the probit BKMR implementation was able to correctly identify the variables included in the exposure-response function and yielded interpretable quantities on the scale of a latent continuous outcome or on the scale of the outcome probability. CONCLUSIONS: This newly developed software, integrated suite of tools, and extended methodology makes BKMR accessible for use across a broad range of epidemiological applications in which multiple risk factors have complex effects on health. BioMed Central 2018-08-20 /pmc/articles/PMC6102907/ /pubmed/30126431 http://dx.doi.org/10.1186/s12940-018-0413-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Bobb, Jennifer F. Claus Henn, Birgit Valeri, Linda Coull, Brent A. Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression |
title | Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression |
title_full | Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression |
title_fullStr | Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression |
title_full_unstemmed | Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression |
title_short | Statistical software for analyzing the health effects of multiple concurrent exposures via Bayesian kernel machine regression |
title_sort | statistical software for analyzing the health effects of multiple concurrent exposures via bayesian kernel machine regression |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6102907/ https://www.ncbi.nlm.nih.gov/pubmed/30126431 http://dx.doi.org/10.1186/s12940-018-0413-y |
work_keys_str_mv | AT bobbjenniferf statisticalsoftwareforanalyzingthehealtheffectsofmultipleconcurrentexposuresviabayesiankernelmachineregression AT claushennbirgit statisticalsoftwareforanalyzingthehealtheffectsofmultipleconcurrentexposuresviabayesiankernelmachineregression AT valerilinda statisticalsoftwareforanalyzingthehealtheffectsofmultipleconcurrentexposuresviabayesiankernelmachineregression AT coullbrenta statisticalsoftwareforanalyzingthehealtheffectsofmultipleconcurrentexposuresviabayesiankernelmachineregression |