Cargando…
Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches
BACKGROUND: Disparities in adverse COVID-19 health outcomes have been associated with multiple social and environmental stressors. However, research is needed to evaluate the consistency and efficiency of methods for studying these associations at local scales. OBJECTIVE: To assess socioexposomic as...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889956/ https://www.ncbi.nlm.nih.gov/pubmed/36725924 http://dx.doi.org/10.1038/s41370-023-00518-0 |
_version_ | 1784880847029010432 |
---|---|
author | Ren, Xiang Mi, Zhongyuan Georgopoulos, Panos G. |
author_facet | Ren, Xiang Mi, Zhongyuan Georgopoulos, Panos G. |
author_sort | Ren, Xiang |
collection | PubMed |
description | BACKGROUND: Disparities in adverse COVID-19 health outcomes have been associated with multiple social and environmental stressors. However, research is needed to evaluate the consistency and efficiency of methods for studying these associations at local scales. OBJECTIVE: To assess socioexposomic associations with COVID-19 outcomes across New Jersey and evaluate consistency of findings from multiple modeling approaches. METHODS: We retrieved data for COVID-19 cases and deaths for the 565 municipalities of New Jersey up to the end of the first phase of the pandemic, and calculated mortality rates with and without long-term-care (LTC) facility deaths. We considered 84 spatially heterogeneous environmental, demographic and socioeconomic factors from publicly available databases, including air pollution, proximity to industrial sites/facilities, transportation-related noise, occupation and commuting, neighborhood and housing characteristics, age structure, racial/ethnic composition, poverty, etc. Six geostatistical models (Poisson/Negative-Binomial regression, Poison/Negative-Binomial mixed effect model, Poisson/Negative-Binomial Bersag-York-Mollie spatial model) and two Machine Learning (ML) methods (Random Forest, Extreme Gradient Boosting) were implemented to assess association patterns. The Shapley effects plot was established for explainable ML and change of support validation was introduced to compare performances of different approaches. RESULTS: We found robust positive associations of COVID-19 mortality with historic exposures to NO(2), population density, percentage of minority and below high school education, and other social and environmental factors. Exclusion of LTC deaths does not significantly affect correlations for most factors but findings can be substantially influenced by model structures and assumptions. The best performing geostatistical models involved flexible structures representing data variations. ML methods captured association patterns consistent with the best performing geostatistical models, and furthermore detected consistent nonlinear associations not captured by geostatistical models. SIGNIFICANCE: The findings of this work improve the understanding of how social and environmental disparities impacted COVID-19 outcomes across New Jersey. [Image: see text] |
format | Online Article Text |
id | pubmed-9889956 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-98899562023-02-01 Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches Ren, Xiang Mi, Zhongyuan Georgopoulos, Panos G. J Expo Sci Environ Epidemiol Article BACKGROUND: Disparities in adverse COVID-19 health outcomes have been associated with multiple social and environmental stressors. However, research is needed to evaluate the consistency and efficiency of methods for studying these associations at local scales. OBJECTIVE: To assess socioexposomic associations with COVID-19 outcomes across New Jersey and evaluate consistency of findings from multiple modeling approaches. METHODS: We retrieved data for COVID-19 cases and deaths for the 565 municipalities of New Jersey up to the end of the first phase of the pandemic, and calculated mortality rates with and without long-term-care (LTC) facility deaths. We considered 84 spatially heterogeneous environmental, demographic and socioeconomic factors from publicly available databases, including air pollution, proximity to industrial sites/facilities, transportation-related noise, occupation and commuting, neighborhood and housing characteristics, age structure, racial/ethnic composition, poverty, etc. Six geostatistical models (Poisson/Negative-Binomial regression, Poison/Negative-Binomial mixed effect model, Poisson/Negative-Binomial Bersag-York-Mollie spatial model) and two Machine Learning (ML) methods (Random Forest, Extreme Gradient Boosting) were implemented to assess association patterns. The Shapley effects plot was established for explainable ML and change of support validation was introduced to compare performances of different approaches. RESULTS: We found robust positive associations of COVID-19 mortality with historic exposures to NO(2), population density, percentage of minority and below high school education, and other social and environmental factors. Exclusion of LTC deaths does not significantly affect correlations for most factors but findings can be substantially influenced by model structures and assumptions. The best performing geostatistical models involved flexible structures representing data variations. ML methods captured association patterns consistent with the best performing geostatistical models, and furthermore detected consistent nonlinear associations not captured by geostatistical models. SIGNIFICANCE: The findings of this work improve the understanding of how social and environmental disparities impacted COVID-19 outcomes across New Jersey. [Image: see text] Nature Publishing Group US 2023-02-01 /pmc/articles/PMC9889956/ /pubmed/36725924 http://dx.doi.org/10.1038/s41370-023-00518-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Ren, Xiang Mi, Zhongyuan Georgopoulos, Panos G. Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches |
title | Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches |
title_full | Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches |
title_fullStr | Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches |
title_full_unstemmed | Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches |
title_short | Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches |
title_sort | socioexposomics of covid-19 across new jersey: a comparison of geostatistical and machine learning approaches |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889956/ https://www.ncbi.nlm.nih.gov/pubmed/36725924 http://dx.doi.org/10.1038/s41370-023-00518-0 |
work_keys_str_mv | AT renxiang socioexposomicsofcovid19acrossnewjerseyacomparisonofgeostatisticalandmachinelearningapproaches AT mizhongyuan socioexposomicsofcovid19acrossnewjerseyacomparisonofgeostatisticalandmachinelearningapproaches AT georgopoulospanosg socioexposomicsofcovid19acrossnewjerseyacomparisonofgeostatisticalandmachinelearningapproaches |