Cargando…

Comparing Methods to Impute Missing Daily Ground-Level PM(10) Concentrations between 2010–2017 in South Africa

Good quality and completeness of ambient air quality monitoring data is central in supporting actions towards mitigating the impact of ambient air pollution. In South Africa, however, availability of continuous ground-level air pollution monitoring data is scarce and incomplete. To address this issu...

Descripción completa

Detalles Bibliográficos
Autores principales: Arowosegbe, Oluwaseyi Olalekan, Röösli, Martin, Künzli, Nino, Saucy, Apolline, Adebayo-Ojo, Temitope Christina, Jeebhay, Mohamed F., Dalvie, Mohammed Aqiel, de Hoogh, Kees
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8037804/
https://www.ncbi.nlm.nih.gov/pubmed/33805155
http://dx.doi.org/10.3390/ijerph18073374
_version_ 1783677229119569920
author Arowosegbe, Oluwaseyi Olalekan
Röösli, Martin
Künzli, Nino
Saucy, Apolline
Adebayo-Ojo, Temitope Christina
Jeebhay, Mohamed F.
Dalvie, Mohammed Aqiel
de Hoogh, Kees
author_facet Arowosegbe, Oluwaseyi Olalekan
Röösli, Martin
Künzli, Nino
Saucy, Apolline
Adebayo-Ojo, Temitope Christina
Jeebhay, Mohamed F.
Dalvie, Mohammed Aqiel
de Hoogh, Kees
author_sort Arowosegbe, Oluwaseyi Olalekan
collection PubMed
description Good quality and completeness of ambient air quality monitoring data is central in supporting actions towards mitigating the impact of ambient air pollution. In South Africa, however, availability of continuous ground-level air pollution monitoring data is scarce and incomplete. To address this issue, we developed and compared different modeling approaches to impute missing daily average particulate matter (PM(10)) data between 2010 and 2017 using spatiotemporal predictor variables. The random forest (RF) machine learning method was used to explore the relationship between average daily PM(10) concentrations and spatiotemporal predictors like meteorological, land use and source-related variables. National (8 models), provincial (32) and site-specific (44) RF models were developed to impute missing daily PM(10) data. The annual national, provincial and site-specific RF cross-validation (CV) models explained on average 78%, 70% and 55% of ground-level PM(10) concentrations, respectively. The spatial components of the national and provincial CV RF models explained on average 22% and 48%, while the temporal components of the national, provincial and site-specific CV RF models explained on average 78%, 68% and 57% of ground-level PM(10) concentrations, respectively. This study demonstrates a feasible approach based on RF to impute missing measurement data in areas where data collection is sparse and incomplete.
format Online
Article
Text
id pubmed-8037804
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80378042021-04-12 Comparing Methods to Impute Missing Daily Ground-Level PM(10) Concentrations between 2010–2017 in South Africa Arowosegbe, Oluwaseyi Olalekan Röösli, Martin Künzli, Nino Saucy, Apolline Adebayo-Ojo, Temitope Christina Jeebhay, Mohamed F. Dalvie, Mohammed Aqiel de Hoogh, Kees Int J Environ Res Public Health Article Good quality and completeness of ambient air quality monitoring data is central in supporting actions towards mitigating the impact of ambient air pollution. In South Africa, however, availability of continuous ground-level air pollution monitoring data is scarce and incomplete. To address this issue, we developed and compared different modeling approaches to impute missing daily average particulate matter (PM(10)) data between 2010 and 2017 using spatiotemporal predictor variables. The random forest (RF) machine learning method was used to explore the relationship between average daily PM(10) concentrations and spatiotemporal predictors like meteorological, land use and source-related variables. National (8 models), provincial (32) and site-specific (44) RF models were developed to impute missing daily PM(10) data. The annual national, provincial and site-specific RF cross-validation (CV) models explained on average 78%, 70% and 55% of ground-level PM(10) concentrations, respectively. The spatial components of the national and provincial CV RF models explained on average 22% and 48%, while the temporal components of the national, provincial and site-specific CV RF models explained on average 78%, 68% and 57% of ground-level PM(10) concentrations, respectively. This study demonstrates a feasible approach based on RF to impute missing measurement data in areas where data collection is sparse and incomplete. MDPI 2021-03-24 /pmc/articles/PMC8037804/ /pubmed/33805155 http://dx.doi.org/10.3390/ijerph18073374 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Arowosegbe, Oluwaseyi Olalekan
Röösli, Martin
Künzli, Nino
Saucy, Apolline
Adebayo-Ojo, Temitope Christina
Jeebhay, Mohamed F.
Dalvie, Mohammed Aqiel
de Hoogh, Kees
Comparing Methods to Impute Missing Daily Ground-Level PM(10) Concentrations between 2010–2017 in South Africa
title Comparing Methods to Impute Missing Daily Ground-Level PM(10) Concentrations between 2010–2017 in South Africa
title_full Comparing Methods to Impute Missing Daily Ground-Level PM(10) Concentrations between 2010–2017 in South Africa
title_fullStr Comparing Methods to Impute Missing Daily Ground-Level PM(10) Concentrations between 2010–2017 in South Africa
title_full_unstemmed Comparing Methods to Impute Missing Daily Ground-Level PM(10) Concentrations between 2010–2017 in South Africa
title_short Comparing Methods to Impute Missing Daily Ground-Level PM(10) Concentrations between 2010–2017 in South Africa
title_sort comparing methods to impute missing daily ground-level pm(10) concentrations between 2010–2017 in south africa
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8037804/
https://www.ncbi.nlm.nih.gov/pubmed/33805155
http://dx.doi.org/10.3390/ijerph18073374
work_keys_str_mv AT arowosegbeoluwaseyiolalekan comparingmethodstoimputemissingdailygroundlevelpm10concentrationsbetween20102017insouthafrica
AT rooslimartin comparingmethodstoimputemissingdailygroundlevelpm10concentrationsbetween20102017insouthafrica
AT kunzlinino comparingmethodstoimputemissingdailygroundlevelpm10concentrationsbetween20102017insouthafrica
AT saucyapolline comparingmethodstoimputemissingdailygroundlevelpm10concentrationsbetween20102017insouthafrica
AT adebayoojotemitopechristina comparingmethodstoimputemissingdailygroundlevelpm10concentrationsbetween20102017insouthafrica
AT jeebhaymohamedf comparingmethodstoimputemissingdailygroundlevelpm10concentrationsbetween20102017insouthafrica
AT dalviemohammedaqiel comparingmethodstoimputemissingdailygroundlevelpm10concentrationsbetween20102017insouthafrica
AT dehooghkees comparingmethodstoimputemissingdailygroundlevelpm10concentrationsbetween20102017insouthafrica