Cargando…

Predicting bacterial transport through saturated porous media using an automated machine learning model

Escherichia coli, as an indicator of fecal contamination, can move from manure-amended soil to groundwater under rainfall or irrigation events. Predicting its vertical transport in the subsurface is essential for the development of engineering solutions to reduce the risk of microbiological contamin...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Fengxian, Zhou, Bin, Yang, Liqiong, Chen, Xijuan, Zhuang, Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10206036/
https://www.ncbi.nlm.nih.gov/pubmed/37234532
http://dx.doi.org/10.3389/fmicb.2023.1152059
_version_ 1785046140103688192
author Chen, Fengxian
Zhou, Bin
Yang, Liqiong
Chen, Xijuan
Zhuang, Jie
author_facet Chen, Fengxian
Zhou, Bin
Yang, Liqiong
Chen, Xijuan
Zhuang, Jie
author_sort Chen, Fengxian
collection PubMed
description Escherichia coli, as an indicator of fecal contamination, can move from manure-amended soil to groundwater under rainfall or irrigation events. Predicting its vertical transport in the subsurface is essential for the development of engineering solutions to reduce the risk of microbiological contamination. In this study, we collected 377 datasets from 61 published papers addressing E. coli transport through saturated porous media and trained six types of machine learning algorithms to predict bacterial transport. Eight variables, including bacterial concentration, porous medium type, median grain size, ionic strength, pore water velocity, column length, saturated hydraulic conductivity, and organic matter content were used as input variables while the first-order attachment coefficient and spatial removal rate were set as target variables. The eight input variables have low correlations with the target variables, namely, they cannot predict target variables independently. However, using the predictive models, input variables can effectively predict the target variables. For scenarios with higher bacterial retention, such as smaller median grain size, the predictive models showed better performance. Among six types of machine learning algorithms, Gradient Boosting Machine and Extreme Gradient Boosting outperformed other algorithms. In most predictive models, pore water velocity, ionic strength, median grain size, and column length showed higher importance than other input variables. This study provided a valuable tool to evaluate the transport risk of E.coli in the subsurface under saturated water flow conditions. It also proved the feasibility of data-driven methods that could be used for predicting other contaminants’ transport in the environment.
format Online
Article
Text
id pubmed-10206036
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102060362023-05-25 Predicting bacterial transport through saturated porous media using an automated machine learning model Chen, Fengxian Zhou, Bin Yang, Liqiong Chen, Xijuan Zhuang, Jie Front Microbiol Microbiology Escherichia coli, as an indicator of fecal contamination, can move from manure-amended soil to groundwater under rainfall or irrigation events. Predicting its vertical transport in the subsurface is essential for the development of engineering solutions to reduce the risk of microbiological contamination. In this study, we collected 377 datasets from 61 published papers addressing E. coli transport through saturated porous media and trained six types of machine learning algorithms to predict bacterial transport. Eight variables, including bacterial concentration, porous medium type, median grain size, ionic strength, pore water velocity, column length, saturated hydraulic conductivity, and organic matter content were used as input variables while the first-order attachment coefficient and spatial removal rate were set as target variables. The eight input variables have low correlations with the target variables, namely, they cannot predict target variables independently. However, using the predictive models, input variables can effectively predict the target variables. For scenarios with higher bacterial retention, such as smaller median grain size, the predictive models showed better performance. Among six types of machine learning algorithms, Gradient Boosting Machine and Extreme Gradient Boosting outperformed other algorithms. In most predictive models, pore water velocity, ionic strength, median grain size, and column length showed higher importance than other input variables. This study provided a valuable tool to evaluate the transport risk of E.coli in the subsurface under saturated water flow conditions. It also proved the feasibility of data-driven methods that could be used for predicting other contaminants’ transport in the environment. Frontiers Media S.A. 2023-05-10 /pmc/articles/PMC10206036/ /pubmed/37234532 http://dx.doi.org/10.3389/fmicb.2023.1152059 Text en Copyright © 2023 Chen, Zhou, Yang, Chen and Zhuang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Chen, Fengxian
Zhou, Bin
Yang, Liqiong
Chen, Xijuan
Zhuang, Jie
Predicting bacterial transport through saturated porous media using an automated machine learning model
title Predicting bacterial transport through saturated porous media using an automated machine learning model
title_full Predicting bacterial transport through saturated porous media using an automated machine learning model
title_fullStr Predicting bacterial transport through saturated porous media using an automated machine learning model
title_full_unstemmed Predicting bacterial transport through saturated porous media using an automated machine learning model
title_short Predicting bacterial transport through saturated porous media using an automated machine learning model
title_sort predicting bacterial transport through saturated porous media using an automated machine learning model
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10206036/
https://www.ncbi.nlm.nih.gov/pubmed/37234532
http://dx.doi.org/10.3389/fmicb.2023.1152059
work_keys_str_mv AT chenfengxian predictingbacterialtransportthroughsaturatedporousmediausinganautomatedmachinelearningmodel
AT zhoubin predictingbacterialtransportthroughsaturatedporousmediausinganautomatedmachinelearningmodel
AT yangliqiong predictingbacterialtransportthroughsaturatedporousmediausinganautomatedmachinelearningmodel
AT chenxijuan predictingbacterialtransportthroughsaturatedporousmediausinganautomatedmachinelearningmodel
AT zhuangjie predictingbacterialtransportthroughsaturatedporousmediausinganautomatedmachinelearningmodel