Cargando…

PyLEnM: A Machine Learning Framework for Long-Term Groundwater Contamination Monitoring Strategies

[Image: see text] In this study, we have developed a comprehensive machine learning (ML) framework for long-term groundwater contamination monitoring as the Python package PyLEnM (Python for Long-term Environmental Monitoring). PyLEnM aims to establish the seamless data-to-ML pipeline with various u...

Descripción completa

Detalles Bibliográficos
Autores principales: Meray, Aurelien O., Sturla, Savannah, Siddiquee, Masudur R., Serata, Rebecca, Uhlemann, Sebastian, Gonzalez-Raymat, Hansell, Denham, Miles, Upadhyay, Himanshu, Lagos, Leonel E., Eddy-Dilek, Carol, Wainwright, Haruko M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069689/
https://www.ncbi.nlm.nih.gov/pubmed/35427133
http://dx.doi.org/10.1021/acs.est.1c07440
_version_ 1784700484414603264
author Meray, Aurelien O.
Sturla, Savannah
Siddiquee, Masudur R.
Serata, Rebecca
Uhlemann, Sebastian
Gonzalez-Raymat, Hansell
Denham, Miles
Upadhyay, Himanshu
Lagos, Leonel E.
Eddy-Dilek, Carol
Wainwright, Haruko M.
author_facet Meray, Aurelien O.
Sturla, Savannah
Siddiquee, Masudur R.
Serata, Rebecca
Uhlemann, Sebastian
Gonzalez-Raymat, Hansell
Denham, Miles
Upadhyay, Himanshu
Lagos, Leonel E.
Eddy-Dilek, Carol
Wainwright, Haruko M.
author_sort Meray, Aurelien O.
collection PubMed
description [Image: see text] In this study, we have developed a comprehensive machine learning (ML) framework for long-term groundwater contamination monitoring as the Python package PyLEnM (Python for Long-term Environmental Monitoring). PyLEnM aims to establish the seamless data-to-ML pipeline with various utility functions, such as quality assurance and quality control (QA/QC), coincident/colocated data identification, the automated ingestion and processing of publicly available spatial data layers, and novel data summarization/visualization. The key ML innovations include (1) time series/multianalyte clustering to find the well groups that have similar groundwater dynamics and to inform spatial interpolation and well optimization, (2) the automated model selection and parameter tuning, comparing multiple regression models for spatial interpolation, (3) the proxy-based spatial interpolation method by including spatial data layers or in situ measurable variables as predictors for contaminant concentrations and groundwater levels, and (4) the new well optimization algorithm to identify the most effective subset of wells for maintaining the spatial interpolation ability for long-term monitoring. We demonstrate our methodology using the monitoring data at the Savannah River Site F-Area. Through this open-source PyLEnM package, we aim to improve the transparency of data analytics at contaminated sites, empowering concerned citizens as well as improving public relations.
format Online
Article
Text
id pubmed-9069689
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-90696892022-05-06 PyLEnM: A Machine Learning Framework for Long-Term Groundwater Contamination Monitoring Strategies Meray, Aurelien O. Sturla, Savannah Siddiquee, Masudur R. Serata, Rebecca Uhlemann, Sebastian Gonzalez-Raymat, Hansell Denham, Miles Upadhyay, Himanshu Lagos, Leonel E. Eddy-Dilek, Carol Wainwright, Haruko M. Environ Sci Technol [Image: see text] In this study, we have developed a comprehensive machine learning (ML) framework for long-term groundwater contamination monitoring as the Python package PyLEnM (Python for Long-term Environmental Monitoring). PyLEnM aims to establish the seamless data-to-ML pipeline with various utility functions, such as quality assurance and quality control (QA/QC), coincident/colocated data identification, the automated ingestion and processing of publicly available spatial data layers, and novel data summarization/visualization. The key ML innovations include (1) time series/multianalyte clustering to find the well groups that have similar groundwater dynamics and to inform spatial interpolation and well optimization, (2) the automated model selection and parameter tuning, comparing multiple regression models for spatial interpolation, (3) the proxy-based spatial interpolation method by including spatial data layers or in situ measurable variables as predictors for contaminant concentrations and groundwater levels, and (4) the new well optimization algorithm to identify the most effective subset of wells for maintaining the spatial interpolation ability for long-term monitoring. We demonstrate our methodology using the monitoring data at the Savannah River Site F-Area. Through this open-source PyLEnM package, we aim to improve the transparency of data analytics at contaminated sites, empowering concerned citizens as well as improving public relations. American Chemical Society 2022-04-15 2022-05-03 /pmc/articles/PMC9069689/ /pubmed/35427133 http://dx.doi.org/10.1021/acs.est.1c07440 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Meray, Aurelien O.
Sturla, Savannah
Siddiquee, Masudur R.
Serata, Rebecca
Uhlemann, Sebastian
Gonzalez-Raymat, Hansell
Denham, Miles
Upadhyay, Himanshu
Lagos, Leonel E.
Eddy-Dilek, Carol
Wainwright, Haruko M.
PyLEnM: A Machine Learning Framework for Long-Term Groundwater Contamination Monitoring Strategies
title PyLEnM: A Machine Learning Framework for Long-Term Groundwater Contamination Monitoring Strategies
title_full PyLEnM: A Machine Learning Framework for Long-Term Groundwater Contamination Monitoring Strategies
title_fullStr PyLEnM: A Machine Learning Framework for Long-Term Groundwater Contamination Monitoring Strategies
title_full_unstemmed PyLEnM: A Machine Learning Framework for Long-Term Groundwater Contamination Monitoring Strategies
title_short PyLEnM: A Machine Learning Framework for Long-Term Groundwater Contamination Monitoring Strategies
title_sort pylenm: a machine learning framework for long-term groundwater contamination monitoring strategies
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069689/
https://www.ncbi.nlm.nih.gov/pubmed/35427133
http://dx.doi.org/10.1021/acs.est.1c07440
work_keys_str_mv AT merayaurelieno pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT sturlasavannah pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT siddiqueemasudurr pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT seratarebecca pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT uhlemannsebastian pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT gonzalezraymathansell pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT denhammiles pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT upadhyayhimanshu pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT lagosleonele pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT eddydilekcarol pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies
AT wainwrightharukom pylenmamachinelearningframeworkforlongtermgroundwatercontaminationmonitoringstrategies