Cargando…
A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques
Harmful cyanobacterial bloom (HCB) is problematic for drinking water treatment, and some of its strains can produce toxins that significantly affect human health. To better control eutrophication and HCB, catchment managers need to continuously keep track of nitrogen (N) and phosphorus (P) in the wa...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572788/ https://www.ncbi.nlm.nih.gov/pubmed/36236438 http://dx.doi.org/10.3390/s22197338 |
_version_ | 1784810703978233856 |
---|---|
author | Paepae, Thulane Bokoro, Pitshou N. Kyamakya, Kyandoghere |
author_facet | Paepae, Thulane Bokoro, Pitshou N. Kyamakya, Kyandoghere |
author_sort | Paepae, Thulane |
collection | PubMed |
description | Harmful cyanobacterial bloom (HCB) is problematic for drinking water treatment, and some of its strains can produce toxins that significantly affect human health. To better control eutrophication and HCB, catchment managers need to continuously keep track of nitrogen (N) and phosphorus (P) in the water bodies. However, the high-frequency monitoring of these water quality indicators is not economical. In these cases, machine learning techniques may serve as viable alternatives since they can learn directly from the available surrogate data. In the present work, a random forest, extremely randomized trees (ET), extreme gradient boosting, k-nearest neighbors, a light gradient boosting machine, and bagging regressor-based virtual sensors were used to predict N and P in two catchments with contrasting land uses. The effect of data scaling and missing value imputation were also assessed, while the Shapley additive explanations were used to rank feature importance. A specification book, sensitivity analysis, and best practices for developing virtual sensors are discussed. Results show that ET, MinMax scaler, and a multivariate imputer were the best predictive model, scaler, and imputer, respectively. The highest predictive performance, reported in terms of R(2), was 97% in the rural catchment and 82% in an urban catchment. |
format | Online Article Text |
id | pubmed-9572788 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95727882022-10-17 A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques Paepae, Thulane Bokoro, Pitshou N. Kyamakya, Kyandoghere Sensors (Basel) Article Harmful cyanobacterial bloom (HCB) is problematic for drinking water treatment, and some of its strains can produce toxins that significantly affect human health. To better control eutrophication and HCB, catchment managers need to continuously keep track of nitrogen (N) and phosphorus (P) in the water bodies. However, the high-frequency monitoring of these water quality indicators is not economical. In these cases, machine learning techniques may serve as viable alternatives since they can learn directly from the available surrogate data. In the present work, a random forest, extremely randomized trees (ET), extreme gradient boosting, k-nearest neighbors, a light gradient boosting machine, and bagging regressor-based virtual sensors were used to predict N and P in two catchments with contrasting land uses. The effect of data scaling and missing value imputation were also assessed, while the Shapley additive explanations were used to rank feature importance. A specification book, sensitivity analysis, and best practices for developing virtual sensors are discussed. Results show that ET, MinMax scaler, and a multivariate imputer were the best predictive model, scaler, and imputer, respectively. The highest predictive performance, reported in terms of R(2), was 97% in the rural catchment and 82% in an urban catchment. MDPI 2022-09-27 /pmc/articles/PMC9572788/ /pubmed/36236438 http://dx.doi.org/10.3390/s22197338 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Paepae, Thulane Bokoro, Pitshou N. Kyamakya, Kyandoghere A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques |
title | A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques |
title_full | A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques |
title_fullStr | A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques |
title_full_unstemmed | A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques |
title_short | A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques |
title_sort | virtual sensing concept for nitrogen and phosphorus monitoring using machine learning techniques |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572788/ https://www.ncbi.nlm.nih.gov/pubmed/36236438 http://dx.doi.org/10.3390/s22197338 |
work_keys_str_mv | AT paepaethulane avirtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques AT bokoropitshoun avirtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques AT kyamakyakyandoghere avirtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques AT paepaethulane virtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques AT bokoropitshoun virtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques AT kyamakyakyandoghere virtualsensingconceptfornitrogenandphosphorusmonitoringusingmachinelearningtechniques |