Cargando…

Kriging-Based Land-Use Regression Models That Use Machine Learning Algorithms to Estimate the Monthly BTEX Concentration

This paper uses machine learning to refine a Land-use Regression (LUR) model and to estimate the spatial–temporal variation in BTEX concentrations in Kaohsiung, Taiwan. Using the Taiwanese Environmental Protection Agency (EPA) data of BTEX (benzene, toluene, ethylbenzene, and xylenes) concentrations...

Descripción completa

Detalles Bibliográficos
Autores principales: Hsu, Chin-Yu, Zeng, Yu-Ting, Chen, Yu-Cheng, Chen, Mu-Jean, Lung, Shih-Chun Candice, Wu, Chih-Da
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7579284/
https://www.ncbi.nlm.nih.gov/pubmed/32977562
http://dx.doi.org/10.3390/ijerph17196956
_version_ 1783598554417201152
author Hsu, Chin-Yu
Zeng, Yu-Ting
Chen, Yu-Cheng
Chen, Mu-Jean
Lung, Shih-Chun Candice
Wu, Chih-Da
author_facet Hsu, Chin-Yu
Zeng, Yu-Ting
Chen, Yu-Cheng
Chen, Mu-Jean
Lung, Shih-Chun Candice
Wu, Chih-Da
author_sort Hsu, Chin-Yu
collection PubMed
description This paper uses machine learning to refine a Land-use Regression (LUR) model and to estimate the spatial–temporal variation in BTEX concentrations in Kaohsiung, Taiwan. Using the Taiwanese Environmental Protection Agency (EPA) data of BTEX (benzene, toluene, ethylbenzene, and xylenes) concentrations from 2015 to 2018, which includes local emission sources as a result of Asian cultural characteristics, a new LUR model is developed. The 2019 data was then used as external data to verify the reliability of the model. We used hybrid Kriging-land-use regression (Hybrid Kriging-LUR) models, geographically weighted regression (GWR), and two machine learning algorithms—random forest (RF) and extreme gradient boosting (XGBoost)—for model development. Initially, the proposed Hybrid Kriging-LUR models explained each variation in BTEX from 37% to 52%. Using machine learning algorithms (XGBoost) increased the explanatory power of the models for each BTEX, between 61% and 79%. This study compared each combination of the Hybrid Kriging-LUR model and (i) GWR, (ii) RF, and (iii) XGBoost algorithm to estimate the spatiotemporal variation in BTEX concentration. It is shown that a combination of Hybrid Kriging-LUR and the XGBoost algorithm gives better performance than other integrated methods.
format Online
Article
Text
id pubmed-7579284
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75792842020-10-29 Kriging-Based Land-Use Regression Models That Use Machine Learning Algorithms to Estimate the Monthly BTEX Concentration Hsu, Chin-Yu Zeng, Yu-Ting Chen, Yu-Cheng Chen, Mu-Jean Lung, Shih-Chun Candice Wu, Chih-Da Int J Environ Res Public Health Article This paper uses machine learning to refine a Land-use Regression (LUR) model and to estimate the spatial–temporal variation in BTEX concentrations in Kaohsiung, Taiwan. Using the Taiwanese Environmental Protection Agency (EPA) data of BTEX (benzene, toluene, ethylbenzene, and xylenes) concentrations from 2015 to 2018, which includes local emission sources as a result of Asian cultural characteristics, a new LUR model is developed. The 2019 data was then used as external data to verify the reliability of the model. We used hybrid Kriging-land-use regression (Hybrid Kriging-LUR) models, geographically weighted regression (GWR), and two machine learning algorithms—random forest (RF) and extreme gradient boosting (XGBoost)—for model development. Initially, the proposed Hybrid Kriging-LUR models explained each variation in BTEX from 37% to 52%. Using machine learning algorithms (XGBoost) increased the explanatory power of the models for each BTEX, between 61% and 79%. This study compared each combination of the Hybrid Kriging-LUR model and (i) GWR, (ii) RF, and (iii) XGBoost algorithm to estimate the spatiotemporal variation in BTEX concentration. It is shown that a combination of Hybrid Kriging-LUR and the XGBoost algorithm gives better performance than other integrated methods. MDPI 2020-09-23 2020-10 /pmc/articles/PMC7579284/ /pubmed/32977562 http://dx.doi.org/10.3390/ijerph17196956 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hsu, Chin-Yu
Zeng, Yu-Ting
Chen, Yu-Cheng
Chen, Mu-Jean
Lung, Shih-Chun Candice
Wu, Chih-Da
Kriging-Based Land-Use Regression Models That Use Machine Learning Algorithms to Estimate the Monthly BTEX Concentration
title Kriging-Based Land-Use Regression Models That Use Machine Learning Algorithms to Estimate the Monthly BTEX Concentration
title_full Kriging-Based Land-Use Regression Models That Use Machine Learning Algorithms to Estimate the Monthly BTEX Concentration
title_fullStr Kriging-Based Land-Use Regression Models That Use Machine Learning Algorithms to Estimate the Monthly BTEX Concentration
title_full_unstemmed Kriging-Based Land-Use Regression Models That Use Machine Learning Algorithms to Estimate the Monthly BTEX Concentration
title_short Kriging-Based Land-Use Regression Models That Use Machine Learning Algorithms to Estimate the Monthly BTEX Concentration
title_sort kriging-based land-use regression models that use machine learning algorithms to estimate the monthly btex concentration
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7579284/
https://www.ncbi.nlm.nih.gov/pubmed/32977562
http://dx.doi.org/10.3390/ijerph17196956
work_keys_str_mv AT hsuchinyu krigingbasedlanduseregressionmodelsthatusemachinelearningalgorithmstoestimatethemonthlybtexconcentration
AT zengyuting krigingbasedlanduseregressionmodelsthatusemachinelearningalgorithmstoestimatethemonthlybtexconcentration
AT chenyucheng krigingbasedlanduseregressionmodelsthatusemachinelearningalgorithmstoestimatethemonthlybtexconcentration
AT chenmujean krigingbasedlanduseregressionmodelsthatusemachinelearningalgorithmstoestimatethemonthlybtexconcentration
AT lungshihchuncandice krigingbasedlanduseregressionmodelsthatusemachinelearningalgorithmstoestimatethemonthlybtexconcentration
AT wuchihda krigingbasedlanduseregressionmodelsthatusemachinelearningalgorithmstoestimatethemonthlybtexconcentration