Cargando…

Predicting ambient PM(2.5) concentrations in Ulaanbaatar, Mongolia with machine learning approaches

BACKGROUND: Accurately assessing individual ambient air pollution exposure is a crucial part of epidemiological studies looking at the adverse health effect of poor air quality. This is particularly challenging in developing countries with high levels of air pollution but having sparse monitoring ne...

Descripción completa

Detalles Bibliográficos
Autores principales: Enebish, Temuulen, Chau, Khang, Jadamba, Batbayar, Franklin, Meredith
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9871862/
https://www.ncbi.nlm.nih.gov/pubmed/32747729
http://dx.doi.org/10.1038/s41370-020-0257-8
Descripción
Sumario:BACKGROUND: Accurately assessing individual ambient air pollution exposure is a crucial part of epidemiological studies looking at the adverse health effect of poor air quality. This is particularly challenging in developing countries with high levels of air pollution but having sparse monitoring networks with a lack of consistent data. METHODS: We evaluated the performance of 6 different machine learning algorithms in predicting fine particulate matter (PM(2.5)) concentrations in Ulaanbaatar, Mongolia from 2010 to 2018. We found that the algorithms produce robust results based on performance metrics. RESULTS: Random forest (RF) and gradient boosting models performed the best with leave-one-location-out cross-validated R(2) of 0.82 for when using data from the entire study period. After applying tuned models on the hold-out test set, R(2) increased to 0.96 for the RF and 0.90 for the gradient boosting model. We also predicted PM(2.5) concentrations for each administrative area (khoroo) of the city using RF and maps of predictions show spatiotemporal variations that are in line with the location of the ger district, city center, and population density. CONCLUSION: Our results provide evidence of the advantage and feasibility of machine learning approaches in predicting ambient PM(2.5) levels in a setting with limited resources and extreme air pollution levels.