Cargando…

A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system

Modeling the magnitude and distribution of sediment-bound pollutants in estuaries is often limited by incomplete knowledge of the site and inadequate sample density. To address these modeling limitations, a decision-support tool framework was conceived that predicts sediment contamination from the s...

Descripción completa

Detalles Bibliográficos
Autores principales: Walsh, Eric S., Kreakie, Betty J., Cantwell, Mark G., Nacci, Diane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524344/
https://www.ncbi.nlm.nih.gov/pubmed/28738089
http://dx.doi.org/10.1371/journal.pone.0179473
_version_ 1783252450716680192
author Walsh, Eric S.
Kreakie, Betty J.
Cantwell, Mark G.
Nacci, Diane
author_facet Walsh, Eric S.
Kreakie, Betty J.
Cantwell, Mark G.
Nacci, Diane
author_sort Walsh, Eric S.
collection PubMed
description Modeling the magnitude and distribution of sediment-bound pollutants in estuaries is often limited by incomplete knowledge of the site and inadequate sample density. To address these modeling limitations, a decision-support tool framework was conceived that predicts sediment contamination from the sub-estuary to broader estuary extent. For this study, a Random Forest (RF) model was implemented to predict the distribution of a model contaminant, triclosan (5-chloro-2-(2,4-dichlorophenoxy)phenol) (TCS), in Narragansett Bay, Rhode Island, USA. TCS is an unregulated contaminant used in many personal care products. The RF explanatory variables were associated with TCS transport and fate (proxies) and direct and indirect environmental entry. The continuous RF TCS concentration predictions were discretized into three levels of contamination (low, medium, and high) for three different quantile thresholds. The RF model explained 63% of the variance with a minimum number of variables. Total organic carbon (TOC) (transport and fate proxy) was a strong predictor of TCS contamination causing a mean squared error increase of 59% when compared to permutations of randomized values of TOC. Additionally, combined sewer overflow discharge (environmental entry) and sand (transport and fate proxy) were strong predictors. The discretization models identified a TCS area of greatest concern in the northern reach of Narragansett Bay (Providence River sub-estuary), which was validated with independent test samples. This decision-support tool performed well at the sub-estuary extent and provided the means to identify areas of concern and prioritize bay-wide sampling.
format Online
Article
Text
id pubmed-5524344
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55243442017-08-07 A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system Walsh, Eric S. Kreakie, Betty J. Cantwell, Mark G. Nacci, Diane PLoS One Research Article Modeling the magnitude and distribution of sediment-bound pollutants in estuaries is often limited by incomplete knowledge of the site and inadequate sample density. To address these modeling limitations, a decision-support tool framework was conceived that predicts sediment contamination from the sub-estuary to broader estuary extent. For this study, a Random Forest (RF) model was implemented to predict the distribution of a model contaminant, triclosan (5-chloro-2-(2,4-dichlorophenoxy)phenol) (TCS), in Narragansett Bay, Rhode Island, USA. TCS is an unregulated contaminant used in many personal care products. The RF explanatory variables were associated with TCS transport and fate (proxies) and direct and indirect environmental entry. The continuous RF TCS concentration predictions were discretized into three levels of contamination (low, medium, and high) for three different quantile thresholds. The RF model explained 63% of the variance with a minimum number of variables. Total organic carbon (TOC) (transport and fate proxy) was a strong predictor of TCS contamination causing a mean squared error increase of 59% when compared to permutations of randomized values of TOC. Additionally, combined sewer overflow discharge (environmental entry) and sand (transport and fate proxy) were strong predictors. The discretization models identified a TCS area of greatest concern in the northern reach of Narragansett Bay (Providence River sub-estuary), which was validated with independent test samples. This decision-support tool performed well at the sub-estuary extent and provided the means to identify areas of concern and prioritize bay-wide sampling. Public Library of Science 2017-07-24 /pmc/articles/PMC5524344/ /pubmed/28738089 http://dx.doi.org/10.1371/journal.pone.0179473 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Walsh, Eric S.
Kreakie, Betty J.
Cantwell, Mark G.
Nacci, Diane
A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system
title A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system
title_full A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system
title_fullStr A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system
title_full_unstemmed A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system
title_short A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system
title_sort random forest approach to predict the spatial distribution of sediment pollution in an estuarine system
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524344/
https://www.ncbi.nlm.nih.gov/pubmed/28738089
http://dx.doi.org/10.1371/journal.pone.0179473
work_keys_str_mv AT walsherics arandomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem
AT kreakiebettyj arandomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem
AT cantwellmarkg arandomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem
AT naccidiane arandomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem
AT walsherics randomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem
AT kreakiebettyj randomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem
AT cantwellmarkg randomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem
AT naccidiane randomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem