Cargando…

73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.

ABSTRACT IMPACT: We explored the use of machine learning to explore how multi-pollutant air quality is related to type 2 diabetes, which is more representative than the single pollutant models often employed to assess this relationship. OBJECTIVES/GOALS: Single pollutant air pollution models have co...

Descripción completa

Detalles Bibliográficos
Autores principales: Riches, Naomi Oiwa, Gouripeddi, Ramkiran, Facelli, Julio C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8827755/
http://dx.doi.org/10.1017/cts.2021.522
_version_ 1784647704272437248
author Riches, Naomi Oiwa
Gouripeddi, Ramkiran
Facelli, Julio C.
author_facet Riches, Naomi Oiwa
Gouripeddi, Ramkiran
Facelli, Julio C.
author_sort Riches, Naomi Oiwa
collection PubMed
description ABSTRACT IMPACT: We explored the use of machine learning to explore how multi-pollutant air quality is related to type 2 diabetes, which is more representative than the single pollutant models often employed to assess this relationship. OBJECTIVES/GOALS: Single pollutant air pollution models have correlated air pollution components with type 2 diabetes mellitus (DM). However, air pollution is a complex mixture, therefore, we explored the relationship between multi-pollutant air quality and DM incidence using machine learning. METHODS/STUDY POPULATION: Annual diabetes incidence from the CDC for each US county was downloaded for the years 2007-2016. Daily air pollution concentrations for PM2.5, PM10, CO, SO2, NO2, and O3 were downloaded from the US EPA for the years 2006-2015. K-means clustering, an unsupervised machine learning method, was employed to partition all air pollution components, for each day and county monitored, into the optimal number of clusters. Change in DM incidence was matched to air pollution clusters by county, lagged by one year. Additionally, NASA satellite-derived air pollution data will be compared to EPA data to inspect as a potential source for future clustering analysis of counties that do not have an EPA monitor. RESULTS/ANTICIPATED RESULTS: The largest increase of annual DM incidence was associated with the cluster having the highest average PM10, PM2.5, and CO, and the second greatest average NO2 concentrations. Inversely, the most significant decrease of annual DM incidence was associated with the cluster having the lowest PM10, PM2.5, and CO. While average PM10, PM2.5, SO2, NO2, and CO showed a rising tendency with elevating change of DM incidence, ozone did not show any such trend. It is anticipated that the NASA satellite-derived air pollution data will approximate the EPA air quality data and will be usable in assessing the air pollution-DM relationship for areas currently not monitored by the EPA. DISCUSSION/SIGNIFICANCE OF FINDINGS: Using an unsupervised k-means algorithm, we showed multiple ambient air components were related to increased incidence of T2DM even when average concentrations were below the National Ambient Air Quality Standards. This work could help guide policy making regarding air quality standards in the future.
format Online
Article
Text
id pubmed-8827755
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-88277552022-02-28 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. Riches, Naomi Oiwa Gouripeddi, Ramkiran Facelli, Julio C. J Clin Transl Sci Data Science/Biostatistics/Informatics ABSTRACT IMPACT: We explored the use of machine learning to explore how multi-pollutant air quality is related to type 2 diabetes, which is more representative than the single pollutant models often employed to assess this relationship. OBJECTIVES/GOALS: Single pollutant air pollution models have correlated air pollution components with type 2 diabetes mellitus (DM). However, air pollution is a complex mixture, therefore, we explored the relationship between multi-pollutant air quality and DM incidence using machine learning. METHODS/STUDY POPULATION: Annual diabetes incidence from the CDC for each US county was downloaded for the years 2007-2016. Daily air pollution concentrations for PM2.5, PM10, CO, SO2, NO2, and O3 were downloaded from the US EPA for the years 2006-2015. K-means clustering, an unsupervised machine learning method, was employed to partition all air pollution components, for each day and county monitored, into the optimal number of clusters. Change in DM incidence was matched to air pollution clusters by county, lagged by one year. Additionally, NASA satellite-derived air pollution data will be compared to EPA data to inspect as a potential source for future clustering analysis of counties that do not have an EPA monitor. RESULTS/ANTICIPATED RESULTS: The largest increase of annual DM incidence was associated with the cluster having the highest average PM10, PM2.5, and CO, and the second greatest average NO2 concentrations. Inversely, the most significant decrease of annual DM incidence was associated with the cluster having the lowest PM10, PM2.5, and CO. While average PM10, PM2.5, SO2, NO2, and CO showed a rising tendency with elevating change of DM incidence, ozone did not show any such trend. It is anticipated that the NASA satellite-derived air pollution data will approximate the EPA air quality data and will be usable in assessing the air pollution-DM relationship for areas currently not monitored by the EPA. DISCUSSION/SIGNIFICANCE OF FINDINGS: Using an unsupervised k-means algorithm, we showed multiple ambient air components were related to increased incidence of T2DM even when average concentrations were below the National Ambient Air Quality Standards. This work could help guide policy making regarding air quality standards in the future. Cambridge University Press 2021-03-30 /pmc/articles/PMC8827755/ http://dx.doi.org/10.1017/cts.2021.522 Text en © The Association for Clinical and Translational Science 2021 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Science/Biostatistics/Informatics
Riches, Naomi Oiwa
Gouripeddi, Ramkiran
Facelli, Julio C.
73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.
title 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.
title_full 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.
title_fullStr 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.
title_full_unstemmed 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.
title_short 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.
title_sort 73432 assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.
topic Data Science/Biostatistics/Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8827755/
http://dx.doi.org/10.1017/cts.2021.522
work_keys_str_mv AT richesnaomioiwa 73432assessmentofmultipollutantambientaircompositionontype2diabetesmellitususingmachinelearning
AT gouripeddiramkiran 73432assessmentofmultipollutantambientaircompositionontype2diabetesmellitususingmachinelearning
AT facellijulioc 73432assessmentofmultipollutantambientaircompositionontype2diabetesmellitususingmachinelearning