Cargando…

Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches

BACKGROUND: Disparities in adverse COVID-19 health outcomes have been associated with multiple social and environmental stressors. However, research is needed to evaluate the consistency and efficiency of methods for studying these associations at local scales. OBJECTIVE: To assess socioexposomic as...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Xiang, Mi, Zhongyuan, Georgopoulos, Panos G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889956/
https://www.ncbi.nlm.nih.gov/pubmed/36725924
http://dx.doi.org/10.1038/s41370-023-00518-0
_version_ 1784880847029010432
author Ren, Xiang
Mi, Zhongyuan
Georgopoulos, Panos G.
author_facet Ren, Xiang
Mi, Zhongyuan
Georgopoulos, Panos G.
author_sort Ren, Xiang
collection PubMed
description BACKGROUND: Disparities in adverse COVID-19 health outcomes have been associated with multiple social and environmental stressors. However, research is needed to evaluate the consistency and efficiency of methods for studying these associations at local scales. OBJECTIVE: To assess socioexposomic associations with COVID-19 outcomes across New Jersey and evaluate consistency of findings from multiple modeling approaches. METHODS: We retrieved data for COVID-19 cases and deaths for the 565 municipalities of New Jersey up to the end of the first phase of the pandemic, and calculated mortality rates with and without long-term-care (LTC) facility deaths. We considered 84 spatially heterogeneous environmental, demographic and socioeconomic factors from publicly available databases, including air pollution, proximity to industrial sites/facilities, transportation-related noise, occupation and commuting, neighborhood and housing characteristics, age structure, racial/ethnic composition, poverty, etc. Six geostatistical models (Poisson/Negative-Binomial regression, Poison/Negative-Binomial mixed effect model, Poisson/Negative-Binomial Bersag-York-Mollie spatial model) and two Machine Learning (ML) methods (Random Forest, Extreme Gradient Boosting) were implemented to assess association patterns. The Shapley effects plot was established for explainable ML and change of support validation was introduced to compare performances of different approaches. RESULTS: We found robust positive associations of COVID-19 mortality with historic exposures to NO(2), population density, percentage of minority and below high school education, and other social and environmental factors. Exclusion of LTC deaths does not significantly affect correlations for most factors but findings can be substantially influenced by model structures and assumptions. The best performing geostatistical models involved flexible structures representing data variations. ML methods captured association patterns consistent with the best performing geostatistical models, and furthermore detected consistent nonlinear associations not captured by geostatistical models. SIGNIFICANCE: The findings of this work improve the understanding of how social and environmental disparities impacted COVID-19 outcomes across New Jersey. [Image: see text]
format Online
Article
Text
id pubmed-9889956
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-98899562023-02-01 Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches Ren, Xiang Mi, Zhongyuan Georgopoulos, Panos G. J Expo Sci Environ Epidemiol Article BACKGROUND: Disparities in adverse COVID-19 health outcomes have been associated with multiple social and environmental stressors. However, research is needed to evaluate the consistency and efficiency of methods for studying these associations at local scales. OBJECTIVE: To assess socioexposomic associations with COVID-19 outcomes across New Jersey and evaluate consistency of findings from multiple modeling approaches. METHODS: We retrieved data for COVID-19 cases and deaths for the 565 municipalities of New Jersey up to the end of the first phase of the pandemic, and calculated mortality rates with and without long-term-care (LTC) facility deaths. We considered 84 spatially heterogeneous environmental, demographic and socioeconomic factors from publicly available databases, including air pollution, proximity to industrial sites/facilities, transportation-related noise, occupation and commuting, neighborhood and housing characteristics, age structure, racial/ethnic composition, poverty, etc. Six geostatistical models (Poisson/Negative-Binomial regression, Poison/Negative-Binomial mixed effect model, Poisson/Negative-Binomial Bersag-York-Mollie spatial model) and two Machine Learning (ML) methods (Random Forest, Extreme Gradient Boosting) were implemented to assess association patterns. The Shapley effects plot was established for explainable ML and change of support validation was introduced to compare performances of different approaches. RESULTS: We found robust positive associations of COVID-19 mortality with historic exposures to NO(2), population density, percentage of minority and below high school education, and other social and environmental factors. Exclusion of LTC deaths does not significantly affect correlations for most factors but findings can be substantially influenced by model structures and assumptions. The best performing geostatistical models involved flexible structures representing data variations. ML methods captured association patterns consistent with the best performing geostatistical models, and furthermore detected consistent nonlinear associations not captured by geostatistical models. SIGNIFICANCE: The findings of this work improve the understanding of how social and environmental disparities impacted COVID-19 outcomes across New Jersey. [Image: see text] Nature Publishing Group US 2023-02-01 /pmc/articles/PMC9889956/ /pubmed/36725924 http://dx.doi.org/10.1038/s41370-023-00518-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ren, Xiang
Mi, Zhongyuan
Georgopoulos, Panos G.
Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches
title Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches
title_full Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches
title_fullStr Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches
title_full_unstemmed Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches
title_short Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches
title_sort socioexposomics of covid-19 across new jersey: a comparison of geostatistical and machine learning approaches
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889956/
https://www.ncbi.nlm.nih.gov/pubmed/36725924
http://dx.doi.org/10.1038/s41370-023-00518-0
work_keys_str_mv AT renxiang socioexposomicsofcovid19acrossnewjerseyacomparisonofgeostatisticalandmachinelearningapproaches
AT mizhongyuan socioexposomicsofcovid19acrossnewjerseyacomparisonofgeostatisticalandmachinelearningapproaches
AT georgopoulospanosg socioexposomicsofcovid19acrossnewjerseyacomparisonofgeostatisticalandmachinelearningapproaches