Cargando…

Mapped Predictions of Manganese and Arsenic in an Alluvial Aquifer Using Boosted Regression Trees

Manganese (Mn) concentrations and the probability of arsenic (As) exceeding the drinking‐water standard of 10 μg/L were predicted in the Mississippi River Valley alluvial aquifer (MRVA) using boosted regression trees (BRT). BRT, a type of ensemble‐tree machine‐learning model, were created using pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Knierim, Katherine J., Kingsbury, James A., Belitz, Kenneth, Stackelberg, Paul E., Minsley, Burke J., Rigby, J.R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Blackwell Publishing Ltd 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9302655/
https://www.ncbi.nlm.nih.gov/pubmed/34951475
http://dx.doi.org/10.1111/gwat.13164
_version_ 1784751678506926080
author Knierim, Katherine J.
Kingsbury, James A.
Belitz, Kenneth
Stackelberg, Paul E.
Minsley, Burke J.
Rigby, J.R.
author_facet Knierim, Katherine J.
Kingsbury, James A.
Belitz, Kenneth
Stackelberg, Paul E.
Minsley, Burke J.
Rigby, J.R.
author_sort Knierim, Katherine J.
collection PubMed
description Manganese (Mn) concentrations and the probability of arsenic (As) exceeding the drinking‐water standard of 10 μg/L were predicted in the Mississippi River Valley alluvial aquifer (MRVA) using boosted regression trees (BRT). BRT, a type of ensemble‐tree machine‐learning model, were created using predictor variables that affect Mn and As distribution in groundwater. These variables included iron (Fe) concentrations and specific conductance predicted from previously developed BRT models, groundwater flux and age estimates from MODFLOW, and hydrologic characteristics. The models also included results from the first airborne geophysical survey conducted in the United States to target an entire aquifer system. Predictions of high Mn and As occurred where Fe was high. Predicted high Mn concentrations were correlated with fraction of young groundwater (less than 65 years) computed from MODFLOW results. High probabilities of As exceedance were predicted where groundwater was relatively old and airborne electromagnetic resistivity was high, typically proximal to streams. Two‐variable partial‐dependence plots and sensitivity analysis were used to provide insight into the factors controlling Mn and As distribution in groundwater. The maps of predicted Mn concentrations and As exceedance probabilities can be used to identify areas where these constituents may be high, and that could be targeted for further study. This paper shows that incorporation of a selected set of process‐informed data, such as MODFLOW results and airborne geophysics, into a machine‐learning model improves model interpretability. Incorporation of process‐rich information into machine‐learning models will likely be useful for addressing a wide range of problems of interest to groundwater hydrologists.
format Online
Article
Text
id pubmed-9302655
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Blackwell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-93026552022-07-22 Mapped Predictions of Manganese and Arsenic in an Alluvial Aquifer Using Boosted Regression Trees Knierim, Katherine J. Kingsbury, James A. Belitz, Kenneth Stackelberg, Paul E. Minsley, Burke J. Rigby, J.R. Ground Water Research Papers/ Manganese (Mn) concentrations and the probability of arsenic (As) exceeding the drinking‐water standard of 10 μg/L were predicted in the Mississippi River Valley alluvial aquifer (MRVA) using boosted regression trees (BRT). BRT, a type of ensemble‐tree machine‐learning model, were created using predictor variables that affect Mn and As distribution in groundwater. These variables included iron (Fe) concentrations and specific conductance predicted from previously developed BRT models, groundwater flux and age estimates from MODFLOW, and hydrologic characteristics. The models also included results from the first airborne geophysical survey conducted in the United States to target an entire aquifer system. Predictions of high Mn and As occurred where Fe was high. Predicted high Mn concentrations were correlated with fraction of young groundwater (less than 65 years) computed from MODFLOW results. High probabilities of As exceedance were predicted where groundwater was relatively old and airborne electromagnetic resistivity was high, typically proximal to streams. Two‐variable partial‐dependence plots and sensitivity analysis were used to provide insight into the factors controlling Mn and As distribution in groundwater. The maps of predicted Mn concentrations and As exceedance probabilities can be used to identify areas where these constituents may be high, and that could be targeted for further study. This paper shows that incorporation of a selected set of process‐informed data, such as MODFLOW results and airborne geophysics, into a machine‐learning model improves model interpretability. Incorporation of process‐rich information into machine‐learning models will likely be useful for addressing a wide range of problems of interest to groundwater hydrologists. Blackwell Publishing Ltd 2022-01-07 2022 /pmc/articles/PMC9302655/ /pubmed/34951475 http://dx.doi.org/10.1111/gwat.13164 Text en Published 2022. This article is a U.S. Government work and is in the public domain in the USA. Groundwater published by Wiley Periodicals LLC on behalf of National Ground Water Association. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Research Papers/
Knierim, Katherine J.
Kingsbury, James A.
Belitz, Kenneth
Stackelberg, Paul E.
Minsley, Burke J.
Rigby, J.R.
Mapped Predictions of Manganese and Arsenic in an Alluvial Aquifer Using Boosted Regression Trees
title Mapped Predictions of Manganese and Arsenic in an Alluvial Aquifer Using Boosted Regression Trees
title_full Mapped Predictions of Manganese and Arsenic in an Alluvial Aquifer Using Boosted Regression Trees
title_fullStr Mapped Predictions of Manganese and Arsenic in an Alluvial Aquifer Using Boosted Regression Trees
title_full_unstemmed Mapped Predictions of Manganese and Arsenic in an Alluvial Aquifer Using Boosted Regression Trees
title_short Mapped Predictions of Manganese and Arsenic in an Alluvial Aquifer Using Boosted Regression Trees
title_sort mapped predictions of manganese and arsenic in an alluvial aquifer using boosted regression trees
topic Research Papers/
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9302655/
https://www.ncbi.nlm.nih.gov/pubmed/34951475
http://dx.doi.org/10.1111/gwat.13164
work_keys_str_mv AT knierimkatherinej mappedpredictionsofmanganeseandarsenicinanalluvialaquiferusingboostedregressiontrees
AT kingsburyjamesa mappedpredictionsofmanganeseandarsenicinanalluvialaquiferusingboostedregressiontrees
AT belitzkenneth mappedpredictionsofmanganeseandarsenicinanalluvialaquiferusingboostedregressiontrees
AT stackelbergpaule mappedpredictionsofmanganeseandarsenicinanalluvialaquiferusingboostedregressiontrees
AT minsleyburkej mappedpredictionsofmanganeseandarsenicinanalluvialaquiferusingboostedregressiontrees
AT rigbyjr mappedpredictionsofmanganeseandarsenicinanalluvialaquiferusingboostedregressiontrees