Cargando…

A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques

Harmful cyanobacterial bloom (HCB) is problematic for drinking water treatment, and some of its strains can produce toxins that significantly affect human health. To better control eutrophication and HCB, catchment managers need to continuously keep track of nitrogen (N) and phosphorus (P) in the wa...

Descripción completa

Detalles Bibliográficos
Autores principales: Paepae, Thulane, Bokoro, Pitshou N., Kyamakya, Kyandoghere
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572788/
https://www.ncbi.nlm.nih.gov/pubmed/36236438
http://dx.doi.org/10.3390/s22197338
_version_ 1784810703978233856
author Paepae, Thulane
Bokoro, Pitshou N.
Kyamakya, Kyandoghere
author_facet Paepae, Thulane
Bokoro, Pitshou N.
Kyamakya, Kyandoghere
author_sort Paepae, Thulane
collection PubMed
description Harmful cyanobacterial bloom (HCB) is problematic for drinking water treatment, and some of its strains can produce toxins that significantly affect human health. To better control eutrophication and HCB, catchment managers need to continuously keep track of nitrogen (N) and phosphorus (P) in the water bodies. However, the high-frequency monitoring of these water quality indicators is not economical. In these cases, machine learning techniques may serve as viable alternatives since they can learn directly from the available surrogate data. In the present work, a random forest, extremely randomized trees (ET), extreme gradient boosting, k-nearest neighbors, a light gradient boosting machine, and bagging regressor-based virtual sensors were used to predict N and P in two catchments with contrasting land uses. The effect of data scaling and missing value imputation were also assessed, while the Shapley additive explanations were used to rank feature importance. A specification book, sensitivity analysis, and best practices for developing virtual sensors are discussed. Results show that ET, MinMax scaler, and a multivariate imputer were the best predictive model, scaler, and imputer, respectively. The highest predictive performance, reported in terms of R(2), was 97% in the rural catchment and 82% in an urban catchment.
format Online
Article
Text
id pubmed-9572788
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95727882022-10-17 A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques Paepae, Thulane Bokoro, Pitshou N. Kyamakya, Kyandoghere Sensors (Basel) Article Harmful cyanobacterial bloom (HCB) is problematic for drinking water treatment, and some of its strains can produce toxins that significantly affect human health. To better control eutrophication and HCB, catchment managers need to continuously keep track of nitrogen (N) and phosphorus (P) in the water bodies. However, the high-frequency monitoring of these water quality indicators is not economical. In these cases, machine learning techniques may serve as viable alternatives since they can learn directly from the available surrogate data. In the present work, a random forest, extremely randomized trees (ET), extreme gradient boosting, k-nearest neighbors, a light gradient boosting machine, and bagging regressor-based virtual sensors were used to predict N and P in two catchments with contrasting land uses. The effect of data scaling and missing value imputation were also assessed, while the Shapley additive explanations were used to rank feature importance. A specification book, sensitivity analysis, and best practices for developing virtual sensors are discussed. Results show that ET, MinMax scaler, and a multivariate imputer were the best predictive model, scaler, and imputer, respectively. The highest predictive performance, reported in terms of R(2), was 97% in the rural catchment and 82% in an urban catchment. MDPI 2022-09-27 /pmc/articles/PMC9572788/ /pubmed/36236438 http://dx.doi.org/10.3390/s22197338 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Paepae, Thulane
Bokoro, Pitshou N.
Kyamakya, Kyandoghere
A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques
title A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques
title_full A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques
title_fullStr A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques
title_full_unstemmed A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques
title_short A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques
title_sort virtual sensing concept for nitrogen and phosphorus monitoring using machine learning techniques
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572788/
https://www.ncbi.nlm.nih.gov/pubmed/36236438
http://dx.doi.org/10.3390/s22197338
work_keys_str_mv AT paepaethulane avirtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques
AT bokoropitshoun avirtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques
AT kyamakyakyandoghere avirtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques
AT paepaethulane virtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques
AT bokoropitshoun virtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques
AT kyamakyakyandoghere virtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques