Cargando…

Time series-based PM(2.5) concentration prediction in Jing-Jin-Ji area using machine learning algorithm models

Globally all countries encounter air pollution problems along their development path. As a significant indicator of air quality, PM(2.5) concentration has long been proven to be affecting the population’s death rate. Machine learning algorithms proven to outperform traditional statistical approaches...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Xin, Chen, Tengfei, Ge, Rubing, Cui, Caocao, Xu, Fan, Lv, Qi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9519508/
https://www.ncbi.nlm.nih.gov/pubmed/36185154
http://dx.doi.org/10.1016/j.heliyon.2022.e10691
_version_ 1784799415423205376
author Ma, Xin
Chen, Tengfei
Ge, Rubing
Cui, Caocao
Xu, Fan
Lv, Qi
author_facet Ma, Xin
Chen, Tengfei
Ge, Rubing
Cui, Caocao
Xu, Fan
Lv, Qi
author_sort Ma, Xin
collection PubMed
description Globally all countries encounter air pollution problems along their development path. As a significant indicator of air quality, PM(2.5) concentration has long been proven to be affecting the population’s death rate. Machine learning algorithms proven to outperform traditional statistical approaches are widely used in air pollution prediction. However research on the model selection discussion and environmental interpretation of model prediction results is still scarce and urgently needed to lead the policy making on air pollution control. Our research compared four types of machine learning algorisms LinearSVR, K-Nearest Neighbor, Lasso regression, Gradient boosting by looking into their performance in predicting PM(2.5) concentrations among different cities and seasons. The results show that the machine learning model is able to forecast the next day PM(2.5) concentration based on the previous five days' data with better accuracy. The comparative experiments show that based on city level the Gradient Boosting prediction model has better prediction performance with mean absolute error (MAE) of 9 ug/m(3) and root mean square error (RMSE) of 10.25–16.76 ug/m(3), lower compared with the other three models, and based on season level four models have the best prediction performances in winter time and the worst in summer time. And more importantly the demonstration of models' different performances in each city and each season is of great significance in environmental policy implications.
format Online
Article
Text
id pubmed-9519508
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-95195082022-09-30 Time series-based PM(2.5) concentration prediction in Jing-Jin-Ji area using machine learning algorithm models Ma, Xin Chen, Tengfei Ge, Rubing Cui, Caocao Xu, Fan Lv, Qi Heliyon Research Article Globally all countries encounter air pollution problems along their development path. As a significant indicator of air quality, PM(2.5) concentration has long been proven to be affecting the population’s death rate. Machine learning algorithms proven to outperform traditional statistical approaches are widely used in air pollution prediction. However research on the model selection discussion and environmental interpretation of model prediction results is still scarce and urgently needed to lead the policy making on air pollution control. Our research compared four types of machine learning algorisms LinearSVR, K-Nearest Neighbor, Lasso regression, Gradient boosting by looking into their performance in predicting PM(2.5) concentrations among different cities and seasons. The results show that the machine learning model is able to forecast the next day PM(2.5) concentration based on the previous five days' data with better accuracy. The comparative experiments show that based on city level the Gradient Boosting prediction model has better prediction performance with mean absolute error (MAE) of 9 ug/m(3) and root mean square error (RMSE) of 10.25–16.76 ug/m(3), lower compared with the other three models, and based on season level four models have the best prediction performances in winter time and the worst in summer time. And more importantly the demonstration of models' different performances in each city and each season is of great significance in environmental policy implications. Elsevier 2022-09-23 /pmc/articles/PMC9519508/ /pubmed/36185154 http://dx.doi.org/10.1016/j.heliyon.2022.e10691 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Ma, Xin
Chen, Tengfei
Ge, Rubing
Cui, Caocao
Xu, Fan
Lv, Qi
Time series-based PM(2.5) concentration prediction in Jing-Jin-Ji area using machine learning algorithm models
title Time series-based PM(2.5) concentration prediction in Jing-Jin-Ji area using machine learning algorithm models
title_full Time series-based PM(2.5) concentration prediction in Jing-Jin-Ji area using machine learning algorithm models
title_fullStr Time series-based PM(2.5) concentration prediction in Jing-Jin-Ji area using machine learning algorithm models
title_full_unstemmed Time series-based PM(2.5) concentration prediction in Jing-Jin-Ji area using machine learning algorithm models
title_short Time series-based PM(2.5) concentration prediction in Jing-Jin-Ji area using machine learning algorithm models
title_sort time series-based pm(2.5) concentration prediction in jing-jin-ji area using machine learning algorithm models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9519508/
https://www.ncbi.nlm.nih.gov/pubmed/36185154
http://dx.doi.org/10.1016/j.heliyon.2022.e10691
work_keys_str_mv AT maxin timeseriesbasedpm25concentrationpredictioninjingjinjiareausingmachinelearningalgorithmmodels
AT chentengfei timeseriesbasedpm25concentrationpredictioninjingjinjiareausingmachinelearningalgorithmmodels
AT gerubing timeseriesbasedpm25concentrationpredictioninjingjinjiareausingmachinelearningalgorithmmodels
AT cuicaocao timeseriesbasedpm25concentrationpredictioninjingjinjiareausingmachinelearningalgorithmmodels
AT xufan timeseriesbasedpm25concentrationpredictioninjingjinjiareausingmachinelearningalgorithmmodels
AT lvqi timeseriesbasedpm25concentrationpredictioninjingjinjiareausingmachinelearningalgorithmmodels