Cargando…

Prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (PM(2.5), PM(10), O(3), NO(2), SO(2), CO) in Seoul, South Korea

Amidst recent industrialization in South Korea, Seoul has experienced high levels of air pollution, an issue that is magnified due to a lack of effective air pollution prediction techniques. In this study, the Prophet forecasting model (PFM) was used to predict both short-term and long-term air poll...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Justin, Valagolam, Davesh, McCalla, Serena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7500321/
https://www.ncbi.nlm.nih.gov/pubmed/32983651
http://dx.doi.org/10.7717/peerj.9961
_version_ 1783583840409747456
author Shen, Justin
Valagolam, Davesh
McCalla, Serena
author_facet Shen, Justin
Valagolam, Davesh
McCalla, Serena
author_sort Shen, Justin
collection PubMed
description Amidst recent industrialization in South Korea, Seoul has experienced high levels of air pollution, an issue that is magnified due to a lack of effective air pollution prediction techniques. In this study, the Prophet forecasting model (PFM) was used to predict both short-term and long-term air pollution in Seoul. The air pollutants forecasted in this study were PM(2.5), PM(10), O(3), NO(2), SO(2), and CO, air pollutants responsible for numerous health conditions upon long-term exposure. Current chemical models to predict air pollution require complex source lists making them difficult to use. Machine learning models have also been implemented however their requirement of meteorological parameters render the models ineffective as additional models and infrastructure need to be in place to model meteorology. To address this, a model needs to be created that can accurately predict pollution based on time. A dataset containing three years worth of hourly air quality measurements in Seoul was sourced from the Seoul Open Data Plaza. To optimize the model, PFM has the following parameters: model type, changepoints, seasonality, holidays, and error. Cross validation was performed on the 2017–18 data; then, the model predicted 2019 values. To compare the predicted and actual values and determine the accuracy of the model, the statistical indicators: mean squared error (MSE), mean absolute error (MAE), root mean squared error (RMSE), and coverage were used. PFM predicted PM(2.5) and PM(10) with a MAE value of 12.6 µg/m(3) and 19.6 µg/m(3), respectively. PFM also predicted SO(2) and CO with a MAE value of 0.00124 ppm and 0.207 ppm, respectively. PFM’s prediction of PM(2.5) and PM(10) had a MAE approximately 2 times and 4 times less, respectively, than comparable models. PFM’s prediction of SO(2)and CO had a MAE approximately five times and 50 times less, respectively, than comparable models. In most cases, PFM’s ability to accurately forecast the concentration of air pollutants in Seoul up to one year in advance outperformed similar models proposed in literature. This study addresses the limitations of the prior two PFM studies by expanding the modelled air pollutants from three pollutants to six pollutants while increasing the prediction time from 3 days to 1 year. This is also the first research to use PFM in Seoul, Korea. To achieve more accurate results, a larger air pollution dataset needs to be implemented with PFM. In the future, PFM should be used to predict and model air pollution in other regions, especially those without advanced infrastructure to model meteorology alongside air pollution. In Seoul, Seoul’s government can use PFM to accurately predict air pollution concentrations and plan accordingly.
format Online
Article
Text
id pubmed-7500321
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-75003212020-09-25 Prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (PM(2.5), PM(10), O(3), NO(2), SO(2), CO) in Seoul, South Korea Shen, Justin Valagolam, Davesh McCalla, Serena PeerJ Atmospheric Chemistry Amidst recent industrialization in South Korea, Seoul has experienced high levels of air pollution, an issue that is magnified due to a lack of effective air pollution prediction techniques. In this study, the Prophet forecasting model (PFM) was used to predict both short-term and long-term air pollution in Seoul. The air pollutants forecasted in this study were PM(2.5), PM(10), O(3), NO(2), SO(2), and CO, air pollutants responsible for numerous health conditions upon long-term exposure. Current chemical models to predict air pollution require complex source lists making them difficult to use. Machine learning models have also been implemented however their requirement of meteorological parameters render the models ineffective as additional models and infrastructure need to be in place to model meteorology. To address this, a model needs to be created that can accurately predict pollution based on time. A dataset containing three years worth of hourly air quality measurements in Seoul was sourced from the Seoul Open Data Plaza. To optimize the model, PFM has the following parameters: model type, changepoints, seasonality, holidays, and error. Cross validation was performed on the 2017–18 data; then, the model predicted 2019 values. To compare the predicted and actual values and determine the accuracy of the model, the statistical indicators: mean squared error (MSE), mean absolute error (MAE), root mean squared error (RMSE), and coverage were used. PFM predicted PM(2.5) and PM(10) with a MAE value of 12.6 µg/m(3) and 19.6 µg/m(3), respectively. PFM also predicted SO(2) and CO with a MAE value of 0.00124 ppm and 0.207 ppm, respectively. PFM’s prediction of PM(2.5) and PM(10) had a MAE approximately 2 times and 4 times less, respectively, than comparable models. PFM’s prediction of SO(2)and CO had a MAE approximately five times and 50 times less, respectively, than comparable models. In most cases, PFM’s ability to accurately forecast the concentration of air pollutants in Seoul up to one year in advance outperformed similar models proposed in literature. This study addresses the limitations of the prior two PFM studies by expanding the modelled air pollutants from three pollutants to six pollutants while increasing the prediction time from 3 days to 1 year. This is also the first research to use PFM in Seoul, Korea. To achieve more accurate results, a larger air pollution dataset needs to be implemented with PFM. In the future, PFM should be used to predict and model air pollution in other regions, especially those without advanced infrastructure to model meteorology alongside air pollution. In Seoul, Seoul’s government can use PFM to accurately predict air pollution concentrations and plan accordingly. PeerJ Inc. 2020-09-15 /pmc/articles/PMC7500321/ /pubmed/32983651 http://dx.doi.org/10.7717/peerj.9961 Text en ©2020 Shen et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Atmospheric Chemistry
Shen, Justin
Valagolam, Davesh
McCalla, Serena
Prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (PM(2.5), PM(10), O(3), NO(2), SO(2), CO) in Seoul, South Korea
title Prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (PM(2.5), PM(10), O(3), NO(2), SO(2), CO) in Seoul, South Korea
title_full Prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (PM(2.5), PM(10), O(3), NO(2), SO(2), CO) in Seoul, South Korea
title_fullStr Prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (PM(2.5), PM(10), O(3), NO(2), SO(2), CO) in Seoul, South Korea
title_full_unstemmed Prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (PM(2.5), PM(10), O(3), NO(2), SO(2), CO) in Seoul, South Korea
title_short Prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (PM(2.5), PM(10), O(3), NO(2), SO(2), CO) in Seoul, South Korea
title_sort prophet forecasting model: a machine learning approach to predict the concentration of air pollutants (pm(2.5), pm(10), o(3), no(2), so(2), co) in seoul, south korea
topic Atmospheric Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7500321/
https://www.ncbi.nlm.nih.gov/pubmed/32983651
http://dx.doi.org/10.7717/peerj.9961
work_keys_str_mv AT shenjustin prophetforecastingmodelamachinelearningapproachtopredicttheconcentrationofairpollutantspm25pm10o3no2so2coinseoulsouthkorea
AT valagolamdavesh prophetforecastingmodelamachinelearningapproachtopredicttheconcentrationofairpollutantspm25pm10o3no2so2coinseoulsouthkorea
AT mccallaserena prophetforecastingmodelamachinelearningapproachtopredicttheconcentrationofairpollutantspm25pm10o3no2so2coinseoulsouthkorea