Cargando…

Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis

To minimize the damage from contaminant accidents in rivers, early identification of the contaminant source is crucial. Thus, in this study, a framework combining Machine Learning (ML) and the Transient Storage zone Model (TSM) was developed to predict the spill location and mass of a contaminant so...

Descripción completa

Detalles Bibliográficos
Autores principales: Kwon, Siyoon, Noh, Hyoseob, Seo, Il Won, Jung, Sung Hyun, Baek, Donghae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7908193/
https://www.ncbi.nlm.nih.gov/pubmed/33498931
http://dx.doi.org/10.3390/ijerph18031023
_version_ 1783655656112259072
author Kwon, Siyoon
Noh, Hyoseob
Seo, Il Won
Jung, Sung Hyun
Baek, Donghae
author_facet Kwon, Siyoon
Noh, Hyoseob
Seo, Il Won
Jung, Sung Hyun
Baek, Donghae
author_sort Kwon, Siyoon
collection PubMed
description To minimize the damage from contaminant accidents in rivers, early identification of the contaminant source is crucial. Thus, in this study, a framework combining Machine Learning (ML) and the Transient Storage zone Model (TSM) was developed to predict the spill location and mass of a contaminant source. The TSM model was employed to simulate non-Fickian Breakthrough Curves (BTCs), which entails relevant information of the contaminant source. Then, the ML models were used to identify the BTC features, characterized by 21 variables, to predict the spill location and mass. The proposed framework was applied to the Gam Creek, South Korea, in which two tracer tests were conducted. In this study, six ML methods were applied for the prediction of spill location and mass, while the most relevant BTC features were selected by Recursive Feature Elimination Cross-Validation (RFECV). Model applications to field data showed that the ensemble Decision tree models, Random Forest (RF) and Xgboost (XGB), were the most efficient and feasible in predicting the contaminant source.
format Online
Article
Text
id pubmed-7908193
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79081932021-02-27 Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis Kwon, Siyoon Noh, Hyoseob Seo, Il Won Jung, Sung Hyun Baek, Donghae Int J Environ Res Public Health Article To minimize the damage from contaminant accidents in rivers, early identification of the contaminant source is crucial. Thus, in this study, a framework combining Machine Learning (ML) and the Transient Storage zone Model (TSM) was developed to predict the spill location and mass of a contaminant source. The TSM model was employed to simulate non-Fickian Breakthrough Curves (BTCs), which entails relevant information of the contaminant source. Then, the ML models were used to identify the BTC features, characterized by 21 variables, to predict the spill location and mass. The proposed framework was applied to the Gam Creek, South Korea, in which two tracer tests were conducted. In this study, six ML methods were applied for the prediction of spill location and mass, while the most relevant BTC features were selected by Recursive Feature Elimination Cross-Validation (RFECV). Model applications to field data showed that the ensemble Decision tree models, Random Forest (RF) and Xgboost (XGB), were the most efficient and feasible in predicting the contaminant source. MDPI 2021-01-24 2021-02 /pmc/articles/PMC7908193/ /pubmed/33498931 http://dx.doi.org/10.3390/ijerph18031023 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kwon, Siyoon
Noh, Hyoseob
Seo, Il Won
Jung, Sung Hyun
Baek, Donghae
Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis
title Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis
title_full Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis
title_fullStr Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis
title_full_unstemmed Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis
title_short Identification Framework of Contaminant Spill in Rivers Using Machine Learning with Breakthrough Curve Analysis
title_sort identification framework of contaminant spill in rivers using machine learning with breakthrough curve analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7908193/
https://www.ncbi.nlm.nih.gov/pubmed/33498931
http://dx.doi.org/10.3390/ijerph18031023
work_keys_str_mv AT kwonsiyoon identificationframeworkofcontaminantspillinriversusingmachinelearningwithbreakthroughcurveanalysis
AT nohhyoseob identificationframeworkofcontaminantspillinriversusingmachinelearningwithbreakthroughcurveanalysis
AT seoilwon identificationframeworkofcontaminantspillinriversusingmachinelearningwithbreakthroughcurveanalysis
AT jungsunghyun identificationframeworkofcontaminantspillinriversusingmachinelearningwithbreakthroughcurveanalysis
AT baekdonghae identificationframeworkofcontaminantspillinriversusingmachinelearningwithbreakthroughcurveanalysis