Cargando…

Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework

Instead of a flag valid/non-valid usually proposed in the quality control (QC) processes of air quality (AQ), we proposed a method that predicts the p-value of each observation as a value between 0 and 1. We based our error predictions on three approaches: the one proposed by the Working Group on Gu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lepioufle, Jean-Marie, Marsteen, Leif, Johnsrud, Mona
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8003348/ https://www.ncbi.nlm.nih.gov/pubmed/33808772 http://dx.doi.org/10.3390/s21062160

_version_	1783671668581859328
author	Lepioufle, Jean-Marie Marsteen, Leif Johnsrud, Mona
author_facet	Lepioufle, Jean-Marie Marsteen, Leif Johnsrud, Mona
author_sort	Lepioufle, Jean-Marie
collection	PubMed
description	Instead of a flag valid/non-valid usually proposed in the quality control (QC) processes of air quality (AQ), we proposed a method that predicts the p-value of each observation as a value between 0 and 1. We based our error predictions on three approaches: the one proposed by the Working Group on Guidance for the Demonstration of Equivalence (European Commission (2010)), the one proposed by Wager (Journal of [Formula: see text] , 15, 1625–1651 (2014)) and the one proposed by Lu (Journal of [Formula: see text] , 22, 1–41 (2021)). Total Error framework enables to differentiate the different errors: input, output, structural modeling and remnant. We thus theoretically described a one-site AQ prediction based on a multi-site network using Random Forest for regression in a Total Error framework. We demonstrated the methodology with a dataset of hourly nitrogen dioxide measured by a network of monitoring stations located in Oslo, Norway and implemented the error predictions for the three approaches. The results indicate that a simple one-site AQ prediction based on a multi-site network using Random Forest for regression provides moderate metrics for fixed stations. According to the diagnostic based on predictive qq-plot and among the three approaches used in this study, the approach proposed by Lu provides better error predictions. Furthermore, ensuring a high precision of the error prediction requires efforts on getting accurate input, output and prediction model and limiting our lack of knowledge about the “true” AQ phenomena. We put effort in quantifying each type of error involved in the error prediction to assess the error prediction model and further improving it in terms of performance and precision.
format	Online Article Text
id	pubmed-8003348
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-80033482021-03-28 Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework Lepioufle, Jean-Marie Marsteen, Leif Johnsrud, Mona Sensors (Basel) Article Instead of a flag valid/non-valid usually proposed in the quality control (QC) processes of air quality (AQ), we proposed a method that predicts the p-value of each observation as a value between 0 and 1. We based our error predictions on three approaches: the one proposed by the Working Group on Guidance for the Demonstration of Equivalence (European Commission (2010)), the one proposed by Wager (Journal of [Formula: see text] , 15, 1625–1651 (2014)) and the one proposed by Lu (Journal of [Formula: see text] , 22, 1–41 (2021)). Total Error framework enables to differentiate the different errors: input, output, structural modeling and remnant. We thus theoretically described a one-site AQ prediction based on a multi-site network using Random Forest for regression in a Total Error framework. We demonstrated the methodology with a dataset of hourly nitrogen dioxide measured by a network of monitoring stations located in Oslo, Norway and implemented the error predictions for the three approaches. The results indicate that a simple one-site AQ prediction based on a multi-site network using Random Forest for regression provides moderate metrics for fixed stations. According to the diagnostic based on predictive qq-plot and among the three approaches used in this study, the approach proposed by Lu provides better error predictions. Furthermore, ensuring a high precision of the error prediction requires efforts on getting accurate input, output and prediction model and limiting our lack of knowledge about the “true” AQ phenomena. We put effort in quantifying each type of error involved in the error prediction to assess the error prediction model and further improving it in terms of performance and precision. MDPI 2021-03-19 /pmc/articles/PMC8003348/ /pubmed/33808772 http://dx.doi.org/10.3390/s21062160 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Lepioufle, Jean-Marie Marsteen, Leif Johnsrud, Mona Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework
title	Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework
title_full	Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework
title_fullStr	Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework
title_full_unstemmed	Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework
title_short	Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework
title_sort	error prediction of air quality at monitoring stations using random forest in a total error framework
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8003348/ https://www.ncbi.nlm.nih.gov/pubmed/33808772 http://dx.doi.org/10.3390/s21062160
work_keys_str_mv	AT lepiouflejeanmarie errorpredictionofairqualityatmonitoringstationsusingrandomforestinatotalerrorframework AT marsteenleif errorpredictionofairqualityatmonitoringstationsusingrandomforestinatotalerrorframework AT johnsrudmona errorpredictionofairqualityatmonitoringstationsusingrandomforestinatotalerrorframework

Error Prediction of Air Quality at Monitoring Stations Using Random Forest in a Total Error Framework

Ejemplares similares