Cargando…
73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning.
ABSTRACT IMPACT: We explored the use of machine learning to explore how multi-pollutant air quality is related to type 2 diabetes, which is more representative than the single pollutant models often employed to assess this relationship. OBJECTIVES/GOALS: Single pollutant air pollution models have co...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cambridge University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8827755/ http://dx.doi.org/10.1017/cts.2021.522 |
_version_ | 1784647704272437248 |
---|---|
author | Riches, Naomi Oiwa Gouripeddi, Ramkiran Facelli, Julio C. |
author_facet | Riches, Naomi Oiwa Gouripeddi, Ramkiran Facelli, Julio C. |
author_sort | Riches, Naomi Oiwa |
collection | PubMed |
description | ABSTRACT IMPACT: We explored the use of machine learning to explore how multi-pollutant air quality is related to type 2 diabetes, which is more representative than the single pollutant models often employed to assess this relationship. OBJECTIVES/GOALS: Single pollutant air pollution models have correlated air pollution components with type 2 diabetes mellitus (DM). However, air pollution is a complex mixture, therefore, we explored the relationship between multi-pollutant air quality and DM incidence using machine learning. METHODS/STUDY POPULATION: Annual diabetes incidence from the CDC for each US county was downloaded for the years 2007-2016. Daily air pollution concentrations for PM2.5, PM10, CO, SO2, NO2, and O3 were downloaded from the US EPA for the years 2006-2015. K-means clustering, an unsupervised machine learning method, was employed to partition all air pollution components, for each day and county monitored, into the optimal number of clusters. Change in DM incidence was matched to air pollution clusters by county, lagged by one year. Additionally, NASA satellite-derived air pollution data will be compared to EPA data to inspect as a potential source for future clustering analysis of counties that do not have an EPA monitor. RESULTS/ANTICIPATED RESULTS: The largest increase of annual DM incidence was associated with the cluster having the highest average PM10, PM2.5, and CO, and the second greatest average NO2 concentrations. Inversely, the most significant decrease of annual DM incidence was associated with the cluster having the lowest PM10, PM2.5, and CO. While average PM10, PM2.5, SO2, NO2, and CO showed a rising tendency with elevating change of DM incidence, ozone did not show any such trend. It is anticipated that the NASA satellite-derived air pollution data will approximate the EPA air quality data and will be usable in assessing the air pollution-DM relationship for areas currently not monitored by the EPA. DISCUSSION/SIGNIFICANCE OF FINDINGS: Using an unsupervised k-means algorithm, we showed multiple ambient air components were related to increased incidence of T2DM even when average concentrations were below the National Ambient Air Quality Standards. This work could help guide policy making regarding air quality standards in the future. |
format | Online Article Text |
id | pubmed-8827755 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Cambridge University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-88277552022-02-28 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. Riches, Naomi Oiwa Gouripeddi, Ramkiran Facelli, Julio C. J Clin Transl Sci Data Science/Biostatistics/Informatics ABSTRACT IMPACT: We explored the use of machine learning to explore how multi-pollutant air quality is related to type 2 diabetes, which is more representative than the single pollutant models often employed to assess this relationship. OBJECTIVES/GOALS: Single pollutant air pollution models have correlated air pollution components with type 2 diabetes mellitus (DM). However, air pollution is a complex mixture, therefore, we explored the relationship between multi-pollutant air quality and DM incidence using machine learning. METHODS/STUDY POPULATION: Annual diabetes incidence from the CDC for each US county was downloaded for the years 2007-2016. Daily air pollution concentrations for PM2.5, PM10, CO, SO2, NO2, and O3 were downloaded from the US EPA for the years 2006-2015. K-means clustering, an unsupervised machine learning method, was employed to partition all air pollution components, for each day and county monitored, into the optimal number of clusters. Change in DM incidence was matched to air pollution clusters by county, lagged by one year. Additionally, NASA satellite-derived air pollution data will be compared to EPA data to inspect as a potential source for future clustering analysis of counties that do not have an EPA monitor. RESULTS/ANTICIPATED RESULTS: The largest increase of annual DM incidence was associated with the cluster having the highest average PM10, PM2.5, and CO, and the second greatest average NO2 concentrations. Inversely, the most significant decrease of annual DM incidence was associated with the cluster having the lowest PM10, PM2.5, and CO. While average PM10, PM2.5, SO2, NO2, and CO showed a rising tendency with elevating change of DM incidence, ozone did not show any such trend. It is anticipated that the NASA satellite-derived air pollution data will approximate the EPA air quality data and will be usable in assessing the air pollution-DM relationship for areas currently not monitored by the EPA. DISCUSSION/SIGNIFICANCE OF FINDINGS: Using an unsupervised k-means algorithm, we showed multiple ambient air components were related to increased incidence of T2DM even when average concentrations were below the National Ambient Air Quality Standards. This work could help guide policy making regarding air quality standards in the future. Cambridge University Press 2021-03-30 /pmc/articles/PMC8827755/ http://dx.doi.org/10.1017/cts.2021.522 Text en © The Association for Clinical and Translational Science 2021 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Data Science/Biostatistics/Informatics Riches, Naomi Oiwa Gouripeddi, Ramkiran Facelli, Julio C. 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. |
title | 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. |
title_full | 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. |
title_fullStr | 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. |
title_full_unstemmed | 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. |
title_short | 73432 Assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. |
title_sort | 73432 assessment of multi-pollutant ambient air composition on type 2 diabetes mellitus using machine learning. |
topic | Data Science/Biostatistics/Informatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8827755/ http://dx.doi.org/10.1017/cts.2021.522 |
work_keys_str_mv | AT richesnaomioiwa 73432assessmentofmultipollutantambientaircompositionontype2diabetesmellitususingmachinelearning AT gouripeddiramkiran 73432assessmentofmultipollutantambientaircompositionontype2diabetesmellitususingmachinelearning AT facellijulioc 73432assessmentofmultipollutantambientaircompositionontype2diabetesmellitususingmachinelearning |