Cargando…

Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods

In the context of escalating global environmental concerns, the importance of preserving water resources and upholding ecological equilibrium has become increasingly apparent. As a result, the monitoring and prediction of water quality have emerged as vital tasks in achieving these objectives. Howev...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xianhe, Li, Ying, Qiao, Qian, Tavares, Adriano, Liang, Yanchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10453428/
https://www.ncbi.nlm.nih.gov/pubmed/37628216
http://dx.doi.org/10.3390/e25081186
_version_ 1785095933768237056
author Wang, Xianhe
Li, Ying
Qiao, Qian
Tavares, Adriano
Liang, Yanchun
author_facet Wang, Xianhe
Li, Ying
Qiao, Qian
Tavares, Adriano
Liang, Yanchun
author_sort Wang, Xianhe
collection PubMed
description In the context of escalating global environmental concerns, the importance of preserving water resources and upholding ecological equilibrium has become increasingly apparent. As a result, the monitoring and prediction of water quality have emerged as vital tasks in achieving these objectives. However, ensuring the accuracy and dependability of water quality prediction has proven to be a challenging endeavor. To address this issue, this study proposes a comprehensive weight-based approach that combines entropy weighting with the Pearson correlation coefficient to select crucial features in water quality prediction. This approach effectively considers both feature correlation and information content, avoiding excessive reliance on a single criterion for feature selection. Through the utilization of this comprehensive approach, a comprehensive evaluation of the contribution and importance of the features was achieved, thereby minimizing subjective bias and uncertainty. By striking a balance among various factors, features with stronger correlation and greater information content can be selected, leading to improved accuracy and robustness in the feature-selection process. Furthermore, this study explored several machine learning models for water quality prediction, including Support Vector Machines (SVMs), Multilayer Perceptron (MLP), Random Forest (RF), XGBoost, and Long Short-Term Memory (LSTM). SVM exhibited commendable performance in predicting Dissolved Oxygen (DO), showcasing excellent generalization capabilities and high prediction accuracy. MLP demonstrated its strength in nonlinear modeling and performed well in predicting multiple water quality parameters. Conversely, the RF and XGBoost models exhibited relatively inferior performance in water quality prediction. In contrast, the LSTM model, a recurrent neural network specialized in processing time series data, demonstrated exceptional abilities in water quality prediction. It effectively captured the dynamic patterns present in time series data, offering stable and accurate predictions for various water quality parameters.
format Online
Article
Text
id pubmed-10453428
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104534282023-08-26 Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods Wang, Xianhe Li, Ying Qiao, Qian Tavares, Adriano Liang, Yanchun Entropy (Basel) Article In the context of escalating global environmental concerns, the importance of preserving water resources and upholding ecological equilibrium has become increasingly apparent. As a result, the monitoring and prediction of water quality have emerged as vital tasks in achieving these objectives. However, ensuring the accuracy and dependability of water quality prediction has proven to be a challenging endeavor. To address this issue, this study proposes a comprehensive weight-based approach that combines entropy weighting with the Pearson correlation coefficient to select crucial features in water quality prediction. This approach effectively considers both feature correlation and information content, avoiding excessive reliance on a single criterion for feature selection. Through the utilization of this comprehensive approach, a comprehensive evaluation of the contribution and importance of the features was achieved, thereby minimizing subjective bias and uncertainty. By striking a balance among various factors, features with stronger correlation and greater information content can be selected, leading to improved accuracy and robustness in the feature-selection process. Furthermore, this study explored several machine learning models for water quality prediction, including Support Vector Machines (SVMs), Multilayer Perceptron (MLP), Random Forest (RF), XGBoost, and Long Short-Term Memory (LSTM). SVM exhibited commendable performance in predicting Dissolved Oxygen (DO), showcasing excellent generalization capabilities and high prediction accuracy. MLP demonstrated its strength in nonlinear modeling and performed well in predicting multiple water quality parameters. Conversely, the RF and XGBoost models exhibited relatively inferior performance in water quality prediction. In contrast, the LSTM model, a recurrent neural network specialized in processing time series data, demonstrated exceptional abilities in water quality prediction. It effectively captured the dynamic patterns present in time series data, offering stable and accurate predictions for various water quality parameters. MDPI 2023-08-09 /pmc/articles/PMC10453428/ /pubmed/37628216 http://dx.doi.org/10.3390/e25081186 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Xianhe
Li, Ying
Qiao, Qian
Tavares, Adriano
Liang, Yanchun
Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods
title Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods
title_full Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods
title_fullStr Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods
title_full_unstemmed Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods
title_short Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods
title_sort water quality prediction based on machine learning and comprehensive weighting methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10453428/
https://www.ncbi.nlm.nih.gov/pubmed/37628216
http://dx.doi.org/10.3390/e25081186
work_keys_str_mv AT wangxianhe waterqualitypredictionbasedonmachinelearningandcomprehensiveweightingmethods
AT liying waterqualitypredictionbasedonmachinelearningandcomprehensiveweightingmethods
AT qiaoqian waterqualitypredictionbasedonmachinelearningandcomprehensiveweightingmethods
AT tavaresadriano waterqualitypredictionbasedonmachinelearningandcomprehensiveweightingmethods
AT liangyanchun waterqualitypredictionbasedonmachinelearningandcomprehensiveweightingmethods