Cargando…

Improving Water Quality Index Prediction Using Regression Learning Models

Rivers are the main sources of freshwater supply for the world population. However, many economic activities contribute to river water pollution. River water quality can be monitored using various parameters, such as the pH level, dissolved oxygen, total suspended solids, and the chemical properties...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohd Zebaral Hoque, Jesmeen, Ab. Aziz, Nor Azlina, Alelyani, Salem, Mohana, Mohamed, Hosain, Maruf
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602497/
https://www.ncbi.nlm.nih.gov/pubmed/36294286
http://dx.doi.org/10.3390/ijerph192013702
_version_ 1784817332691927040
author Mohd Zebaral Hoque, Jesmeen
Ab. Aziz, Nor Azlina
Alelyani, Salem
Mohana, Mohamed
Hosain, Maruf
author_facet Mohd Zebaral Hoque, Jesmeen
Ab. Aziz, Nor Azlina
Alelyani, Salem
Mohana, Mohamed
Hosain, Maruf
author_sort Mohd Zebaral Hoque, Jesmeen
collection PubMed
description Rivers are the main sources of freshwater supply for the world population. However, many economic activities contribute to river water pollution. River water quality can be monitored using various parameters, such as the pH level, dissolved oxygen, total suspended solids, and the chemical properties. Analyzing the trend and pattern of these parameters enables the prediction of the water quality so that proactive measures can be made by relevant authorities to prevent water pollution and predict the effectiveness of water restoration measures. Machine learning regression algorithms can be applied for this purpose. Here, eight machine learning regression techniques, including decision tree regression, linear regression, ridge, Lasso, support vector regression, random forest regression, extra tree regression, and the artificial neural network, are applied for the purpose of water quality index prediction. Historical data from Indian rivers are adopted for this study. The data refer to six water parameters. Twelve other features are then derived from the original six parameters. The performances of the models using different algorithms and sets of features are compared. The derived water quality rating scale features are identified to contribute toward the development of better regression models, while the linear regression and ridge offer the best performance. The best mean square error achieved is 0 and the correlation coefficient is 1.
format Online
Article
Text
id pubmed-9602497
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96024972022-10-27 Improving Water Quality Index Prediction Using Regression Learning Models Mohd Zebaral Hoque, Jesmeen Ab. Aziz, Nor Azlina Alelyani, Salem Mohana, Mohamed Hosain, Maruf Int J Environ Res Public Health Article Rivers are the main sources of freshwater supply for the world population. However, many economic activities contribute to river water pollution. River water quality can be monitored using various parameters, such as the pH level, dissolved oxygen, total suspended solids, and the chemical properties. Analyzing the trend and pattern of these parameters enables the prediction of the water quality so that proactive measures can be made by relevant authorities to prevent water pollution and predict the effectiveness of water restoration measures. Machine learning regression algorithms can be applied for this purpose. Here, eight machine learning regression techniques, including decision tree regression, linear regression, ridge, Lasso, support vector regression, random forest regression, extra tree regression, and the artificial neural network, are applied for the purpose of water quality index prediction. Historical data from Indian rivers are adopted for this study. The data refer to six water parameters. Twelve other features are then derived from the original six parameters. The performances of the models using different algorithms and sets of features are compared. The derived water quality rating scale features are identified to contribute toward the development of better regression models, while the linear regression and ridge offer the best performance. The best mean square error achieved is 0 and the correlation coefficient is 1. MDPI 2022-10-21 /pmc/articles/PMC9602497/ /pubmed/36294286 http://dx.doi.org/10.3390/ijerph192013702 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mohd Zebaral Hoque, Jesmeen
Ab. Aziz, Nor Azlina
Alelyani, Salem
Mohana, Mohamed
Hosain, Maruf
Improving Water Quality Index Prediction Using Regression Learning Models
title Improving Water Quality Index Prediction Using Regression Learning Models
title_full Improving Water Quality Index Prediction Using Regression Learning Models
title_fullStr Improving Water Quality Index Prediction Using Regression Learning Models
title_full_unstemmed Improving Water Quality Index Prediction Using Regression Learning Models
title_short Improving Water Quality Index Prediction Using Regression Learning Models
title_sort improving water quality index prediction using regression learning models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9602497/
https://www.ncbi.nlm.nih.gov/pubmed/36294286
http://dx.doi.org/10.3390/ijerph192013702
work_keys_str_mv AT mohdzebaralhoquejesmeen improvingwaterqualityindexpredictionusingregressionlearningmodels
AT abaziznorazlina improvingwaterqualityindexpredictionusingregressionlearningmodels
AT alelyanisalem improvingwaterqualityindexpredictionusingregressionlearningmodels
AT mohanamohamed improvingwaterqualityindexpredictionusingregressionlearningmodels
AT hosainmaruf improvingwaterqualityindexpredictionusingregressionlearningmodels