Cargando…

Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques

Monitoring ex-situ water parameters, namely heavy metals, needs time and laboratory work for water sampling and analytical processes, which can retard the response to ongoing pollution events. Previous studies have successfully applied fast modeling techniques such as artificial intelligence algorit...

Descripción completa

Detalles Bibliográficos
Autores principales: Huynh, Thi-Minh-Trang, Ni, Chuen-Fa, Su, Yu-Sheng, Nguyen, Vo-Chau-Ngan, Lee, I-Hsien, Lin, Chi-Ping, Nguyen, Hoang-Hiep
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9566676/
https://www.ncbi.nlm.nih.gov/pubmed/36231480
http://dx.doi.org/10.3390/ijerph191912180
_version_ 1784809210413842432
author Huynh, Thi-Minh-Trang
Ni, Chuen-Fa
Su, Yu-Sheng
Nguyen, Vo-Chau-Ngan
Lee, I-Hsien
Lin, Chi-Ping
Nguyen, Hoang-Hiep
author_facet Huynh, Thi-Minh-Trang
Ni, Chuen-Fa
Su, Yu-Sheng
Nguyen, Vo-Chau-Ngan
Lee, I-Hsien
Lin, Chi-Ping
Nguyen, Hoang-Hiep
author_sort Huynh, Thi-Minh-Trang
collection PubMed
description Monitoring ex-situ water parameters, namely heavy metals, needs time and laboratory work for water sampling and analytical processes, which can retard the response to ongoing pollution events. Previous studies have successfully applied fast modeling techniques such as artificial intelligence algorithms to predict heavy metals. However, neither low-cost feature predictability nor explainability assessments have been considered in the modeling process. This study proposes a reliable and explainable framework to find an effective model and feature set to predict heavy metals in groundwater. The integrated assessment framework has four steps: model selection uncertainty, feature selection uncertainty, predictive uncertainty, and model interpretability. The results show that Random Forest is the most suitable model, and quick-measure parameters can be used as predictors for arsenic (As), iron (Fe), and manganese (Mn). Although the model performance is auspicious, it likely produces significant uncertainties. The findings also demonstrate that arsenic is related to nutrients and spatial distribution, while Fe and Mn are affected by spatial distribution and salinity. Some limitations and suggestions are also discussed to improve the prediction accuracy and interpretability.
format Online
Article
Text
id pubmed-9566676
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95666762022-10-15 Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques Huynh, Thi-Minh-Trang Ni, Chuen-Fa Su, Yu-Sheng Nguyen, Vo-Chau-Ngan Lee, I-Hsien Lin, Chi-Ping Nguyen, Hoang-Hiep Int J Environ Res Public Health Article Monitoring ex-situ water parameters, namely heavy metals, needs time and laboratory work for water sampling and analytical processes, which can retard the response to ongoing pollution events. Previous studies have successfully applied fast modeling techniques such as artificial intelligence algorithms to predict heavy metals. However, neither low-cost feature predictability nor explainability assessments have been considered in the modeling process. This study proposes a reliable and explainable framework to find an effective model and feature set to predict heavy metals in groundwater. The integrated assessment framework has four steps: model selection uncertainty, feature selection uncertainty, predictive uncertainty, and model interpretability. The results show that Random Forest is the most suitable model, and quick-measure parameters can be used as predictors for arsenic (As), iron (Fe), and manganese (Mn). Although the model performance is auspicious, it likely produces significant uncertainties. The findings also demonstrate that arsenic is related to nutrients and spatial distribution, while Fe and Mn are affected by spatial distribution and salinity. Some limitations and suggestions are also discussed to improve the prediction accuracy and interpretability. MDPI 2022-09-26 /pmc/articles/PMC9566676/ /pubmed/36231480 http://dx.doi.org/10.3390/ijerph191912180 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Huynh, Thi-Minh-Trang
Ni, Chuen-Fa
Su, Yu-Sheng
Nguyen, Vo-Chau-Ngan
Lee, I-Hsien
Lin, Chi-Ping
Nguyen, Hoang-Hiep
Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques
title Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques
title_full Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques
title_fullStr Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques
title_full_unstemmed Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques
title_short Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques
title_sort predicting heavy metal concentrations in shallow aquifer systems based on low-cost physiochemical parameters using machine learning techniques
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9566676/
https://www.ncbi.nlm.nih.gov/pubmed/36231480
http://dx.doi.org/10.3390/ijerph191912180
work_keys_str_mv AT huynhthiminhtrang predictingheavymetalconcentrationsinshallowaquifersystemsbasedonlowcostphysiochemicalparametersusingmachinelearningtechniques
AT nichuenfa predictingheavymetalconcentrationsinshallowaquifersystemsbasedonlowcostphysiochemicalparametersusingmachinelearningtechniques
AT suyusheng predictingheavymetalconcentrationsinshallowaquifersystemsbasedonlowcostphysiochemicalparametersusingmachinelearningtechniques
AT nguyenvochaungan predictingheavymetalconcentrationsinshallowaquifersystemsbasedonlowcostphysiochemicalparametersusingmachinelearningtechniques
AT leeihsien predictingheavymetalconcentrationsinshallowaquifersystemsbasedonlowcostphysiochemicalparametersusingmachinelearningtechniques
AT linchiping predictingheavymetalconcentrationsinshallowaquifersystemsbasedonlowcostphysiochemicalparametersusingmachinelearningtechniques
AT nguyenhoanghiep predictingheavymetalconcentrationsinshallowaquifersystemsbasedonlowcostphysiochemicalparametersusingmachinelearningtechniques