Cargando…

PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework

Although many machine learning methods have been widely used to predict PM2.5 concentrations, these single or hybrid methods still have some shortcomings. This study integrated the advantages of convolutional neural network (CNN) feature extraction and the regression ability of random forest (RF) to...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Mei-Hsin, Chen, Yao-Chung, Chou, Tien-Yin, Ning, Fang-Shii
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002213/ https://www.ncbi.nlm.nih.gov/pubmed/36901088 http://dx.doi.org/10.3390/ijerph20054077

_version_	1784904335411380224
author	Chen, Mei-Hsin Chen, Yao-Chung Chou, Tien-Yin Ning, Fang-Shii
author_facet	Chen, Mei-Hsin Chen, Yao-Chung Chou, Tien-Yin Ning, Fang-Shii
author_sort	Chen, Mei-Hsin
collection	PubMed
description	Although many machine learning methods have been widely used to predict PM2.5 concentrations, these single or hybrid methods still have some shortcomings. This study integrated the advantages of convolutional neural network (CNN) feature extraction and the regression ability of random forest (RF) to propose a novel CNN-RF ensemble framework for PM2.5 concentration modeling. The observational data from 13 monitoring stations in Kaohsiung in 2021 were selected for model training and testing. First, CNN was implemented to extract key meteorological and pollution data. Subsequently, the RF algorithm was employed to train the model with five input factors, namely the extracted features from the CNN and spatiotemporal factors, including the day of the year, the hour of the day, latitude, and longitude. Independent observations from two stations were used to evaluate the models. The findings demonstrated that the proposed CNN–RF model had better modeling capability compared with the independent CNN and RF models: the average improvements in root mean square error (RMSE) and mean absolute error (MAE) ranged from 8.10% to 11.11%, respectively. In addition, the proposed CNN–RF hybrid model has fewer excess residuals at thresholds of 10 μg/m(3), 20 μg/m(3), and 30 μg/m(3). The results revealed that the proposed CNN–RF ensemble framework is a stable, reliable, and accurate method that can generate superior results compared with the single CNN and RF methods. The proposed method could be a valuable reference for readers and may inspire researchers to develop even more effective methods for air pollution modeling. This research has important implications for air pollution research, data analysis, model estimation, and machine learning.
format	Online Article Text
id	pubmed-10002213
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100022132023-03-11 PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework Chen, Mei-Hsin Chen, Yao-Chung Chou, Tien-Yin Ning, Fang-Shii Int J Environ Res Public Health Article Although many machine learning methods have been widely used to predict PM2.5 concentrations, these single or hybrid methods still have some shortcomings. This study integrated the advantages of convolutional neural network (CNN) feature extraction and the regression ability of random forest (RF) to propose a novel CNN-RF ensemble framework for PM2.5 concentration modeling. The observational data from 13 monitoring stations in Kaohsiung in 2021 were selected for model training and testing. First, CNN was implemented to extract key meteorological and pollution data. Subsequently, the RF algorithm was employed to train the model with five input factors, namely the extracted features from the CNN and spatiotemporal factors, including the day of the year, the hour of the day, latitude, and longitude. Independent observations from two stations were used to evaluate the models. The findings demonstrated that the proposed CNN–RF model had better modeling capability compared with the independent CNN and RF models: the average improvements in root mean square error (RMSE) and mean absolute error (MAE) ranged from 8.10% to 11.11%, respectively. In addition, the proposed CNN–RF hybrid model has fewer excess residuals at thresholds of 10 μg/m(3), 20 μg/m(3), and 30 μg/m(3). The results revealed that the proposed CNN–RF ensemble framework is a stable, reliable, and accurate method that can generate superior results compared with the single CNN and RF methods. The proposed method could be a valuable reference for readers and may inspire researchers to develop even more effective methods for air pollution modeling. This research has important implications for air pollution research, data analysis, model estimation, and machine learning. MDPI 2023-02-24 /pmc/articles/PMC10002213/ /pubmed/36901088 http://dx.doi.org/10.3390/ijerph20054077 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Chen, Mei-Hsin Chen, Yao-Chung Chou, Tien-Yin Ning, Fang-Shii PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework
title	PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework
title_full	PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework
title_fullStr	PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework
title_full_unstemmed	PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework
title_short	PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework
title_sort	pm2.5 concentration prediction model: a cnn–rf ensemble framework
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002213/ https://www.ncbi.nlm.nih.gov/pubmed/36901088 http://dx.doi.org/10.3390/ijerph20054077
work_keys_str_mv	AT chenmeihsin pm25concentrationpredictionmodelacnnrfensembleframework AT chenyaochung pm25concentrationpredictionmodelacnnrfensembleframework AT choutienyin pm25concentrationpredictionmodelacnnrfensembleframework AT ningfangshii pm25concentrationpredictionmodelacnnrfensembleframework

PM2.5 Concentration Prediction Model: A CNN–RF Ensemble Framework

Ejemplares similares