Cargando…

Machine learning methods to predict particulate matter PM (2.5)

Introduction Pollution of air in urban cities across the world has been steadily increasing in recent years. An increasing trend in particulate matter, PM (2.5), is a threat because it can lead to uncontrollable consequences like worsening of asthma and cardiovascular disease. The metric used to mea...

Descripción completa

Detalles Bibliográficos
Autores principales: Palanichamy, Naveen, Haw, Su-Cheng, S, Subramanian, Murugan, Rishanti, Govindasamy, Kuhaneswaran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9723408/
https://www.ncbi.nlm.nih.gov/pubmed/36531254
http://dx.doi.org/10.12688/f1000research.73166.1
_version_ 1784844169264496640
author Palanichamy, Naveen
Haw, Su-Cheng
S, Subramanian
Murugan, Rishanti
Govindasamy, Kuhaneswaran
author_facet Palanichamy, Naveen
Haw, Su-Cheng
S, Subramanian
Murugan, Rishanti
Govindasamy, Kuhaneswaran
author_sort Palanichamy, Naveen
collection PubMed
description Introduction Pollution of air in urban cities across the world has been steadily increasing in recent years. An increasing trend in particulate matter, PM (2.5), is a threat because it can lead to uncontrollable consequences like worsening of asthma and cardiovascular disease. The metric used to measure air quality is the air pollutant index (API). In Malaysia, machine learning (ML) techniques for PM (2.5 )have received less attention as the concentration is on predicting other air pollutants. To fill the research gap, this study focuses on correctly predicting PM (2.5) concentrations in the smart cities of Malaysia by comparing supervised ML techniques, which helps to mitigate its adverse effects. Methods In this paper, ML models for forecasting PM (2.5) concentrations were investigated on Malaysian air quality data sets from 2017 to 2018. The dataset was preprocessed by data cleaning and a normalization process. Next, it was reduced into an informative dataset with location and time factors in the feature extraction process. The dataset was fed into three supervised ML classifiers, which include random forest (RF), artificial neural network (ANN) and long short-term memory (LSTM). Finally, their output was evaluated using the confusion matrix and compared to identify the best model for the accurate prediction of PM (2.5). Results Overall, the experimental result shows an accuracy of 97.7% was obtained by the RF model in comparison with the accuracy of ANN (61.14%) and LSTM (61.77%) in predicting PM (2.5). Discussion RF performed well when compared with ANN and LSTM for the given data with minimum features. RF was able to reach good accuracy as the model learns from the random samples by using decision tree with the maximum vote on the predictions.
format Online
Article
Text
id pubmed-9723408
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-97234082022-12-15 Machine learning methods to predict particulate matter PM (2.5) Palanichamy, Naveen Haw, Su-Cheng S, Subramanian Murugan, Rishanti Govindasamy, Kuhaneswaran F1000Res Research Article Introduction Pollution of air in urban cities across the world has been steadily increasing in recent years. An increasing trend in particulate matter, PM (2.5), is a threat because it can lead to uncontrollable consequences like worsening of asthma and cardiovascular disease. The metric used to measure air quality is the air pollutant index (API). In Malaysia, machine learning (ML) techniques for PM (2.5 )have received less attention as the concentration is on predicting other air pollutants. To fill the research gap, this study focuses on correctly predicting PM (2.5) concentrations in the smart cities of Malaysia by comparing supervised ML techniques, which helps to mitigate its adverse effects. Methods In this paper, ML models for forecasting PM (2.5) concentrations were investigated on Malaysian air quality data sets from 2017 to 2018. The dataset was preprocessed by data cleaning and a normalization process. Next, it was reduced into an informative dataset with location and time factors in the feature extraction process. The dataset was fed into three supervised ML classifiers, which include random forest (RF), artificial neural network (ANN) and long short-term memory (LSTM). Finally, their output was evaluated using the confusion matrix and compared to identify the best model for the accurate prediction of PM (2.5). Results Overall, the experimental result shows an accuracy of 97.7% was obtained by the RF model in comparison with the accuracy of ANN (61.14%) and LSTM (61.77%) in predicting PM (2.5). Discussion RF performed well when compared with ANN and LSTM for the given data with minimum features. RF was able to reach good accuracy as the model learns from the random samples by using decision tree with the maximum vote on the predictions. F1000 Research Limited 2022-04-11 /pmc/articles/PMC9723408/ /pubmed/36531254 http://dx.doi.org/10.12688/f1000research.73166.1 Text en Copyright: © 2022 Palanichamy N et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Palanichamy, Naveen
Haw, Su-Cheng
S, Subramanian
Murugan, Rishanti
Govindasamy, Kuhaneswaran
Machine learning methods to predict particulate matter PM (2.5)
title Machine learning methods to predict particulate matter PM (2.5)
title_full Machine learning methods to predict particulate matter PM (2.5)
title_fullStr Machine learning methods to predict particulate matter PM (2.5)
title_full_unstemmed Machine learning methods to predict particulate matter PM (2.5)
title_short Machine learning methods to predict particulate matter PM (2.5)
title_sort machine learning methods to predict particulate matter pm (2.5)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9723408/
https://www.ncbi.nlm.nih.gov/pubmed/36531254
http://dx.doi.org/10.12688/f1000research.73166.1
work_keys_str_mv AT palanichamynaveen machinelearningmethodstopredictparticulatematterpm25
AT hawsucheng machinelearningmethodstopredictparticulatematterpm25
AT ssubramanian machinelearningmethodstopredictparticulatematterpm25
AT muruganrishanti machinelearningmethodstopredictparticulatematterpm25
AT govindasamykuhaneswaran machinelearningmethodstopredictparticulatematterpm25