Cargando…
A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system
Modeling the magnitude and distribution of sediment-bound pollutants in estuaries is often limited by incomplete knowledge of the site and inadequate sample density. To address these modeling limitations, a decision-support tool framework was conceived that predicts sediment contamination from the s...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524344/ https://www.ncbi.nlm.nih.gov/pubmed/28738089 http://dx.doi.org/10.1371/journal.pone.0179473 |
_version_ | 1783252450716680192 |
---|---|
author | Walsh, Eric S. Kreakie, Betty J. Cantwell, Mark G. Nacci, Diane |
author_facet | Walsh, Eric S. Kreakie, Betty J. Cantwell, Mark G. Nacci, Diane |
author_sort | Walsh, Eric S. |
collection | PubMed |
description | Modeling the magnitude and distribution of sediment-bound pollutants in estuaries is often limited by incomplete knowledge of the site and inadequate sample density. To address these modeling limitations, a decision-support tool framework was conceived that predicts sediment contamination from the sub-estuary to broader estuary extent. For this study, a Random Forest (RF) model was implemented to predict the distribution of a model contaminant, triclosan (5-chloro-2-(2,4-dichlorophenoxy)phenol) (TCS), in Narragansett Bay, Rhode Island, USA. TCS is an unregulated contaminant used in many personal care products. The RF explanatory variables were associated with TCS transport and fate (proxies) and direct and indirect environmental entry. The continuous RF TCS concentration predictions were discretized into three levels of contamination (low, medium, and high) for three different quantile thresholds. The RF model explained 63% of the variance with a minimum number of variables. Total organic carbon (TOC) (transport and fate proxy) was a strong predictor of TCS contamination causing a mean squared error increase of 59% when compared to permutations of randomized values of TOC. Additionally, combined sewer overflow discharge (environmental entry) and sand (transport and fate proxy) were strong predictors. The discretization models identified a TCS area of greatest concern in the northern reach of Narragansett Bay (Providence River sub-estuary), which was validated with independent test samples. This decision-support tool performed well at the sub-estuary extent and provided the means to identify areas of concern and prioritize bay-wide sampling. |
format | Online Article Text |
id | pubmed-5524344 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-55243442017-08-07 A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system Walsh, Eric S. Kreakie, Betty J. Cantwell, Mark G. Nacci, Diane PLoS One Research Article Modeling the magnitude and distribution of sediment-bound pollutants in estuaries is often limited by incomplete knowledge of the site and inadequate sample density. To address these modeling limitations, a decision-support tool framework was conceived that predicts sediment contamination from the sub-estuary to broader estuary extent. For this study, a Random Forest (RF) model was implemented to predict the distribution of a model contaminant, triclosan (5-chloro-2-(2,4-dichlorophenoxy)phenol) (TCS), in Narragansett Bay, Rhode Island, USA. TCS is an unregulated contaminant used in many personal care products. The RF explanatory variables were associated with TCS transport and fate (proxies) and direct and indirect environmental entry. The continuous RF TCS concentration predictions were discretized into three levels of contamination (low, medium, and high) for three different quantile thresholds. The RF model explained 63% of the variance with a minimum number of variables. Total organic carbon (TOC) (transport and fate proxy) was a strong predictor of TCS contamination causing a mean squared error increase of 59% when compared to permutations of randomized values of TOC. Additionally, combined sewer overflow discharge (environmental entry) and sand (transport and fate proxy) were strong predictors. The discretization models identified a TCS area of greatest concern in the northern reach of Narragansett Bay (Providence River sub-estuary), which was validated with independent test samples. This decision-support tool performed well at the sub-estuary extent and provided the means to identify areas of concern and prioritize bay-wide sampling. Public Library of Science 2017-07-24 /pmc/articles/PMC5524344/ /pubmed/28738089 http://dx.doi.org/10.1371/journal.pone.0179473 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Walsh, Eric S. Kreakie, Betty J. Cantwell, Mark G. Nacci, Diane A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system |
title | A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system |
title_full | A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system |
title_fullStr | A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system |
title_full_unstemmed | A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system |
title_short | A Random Forest approach to predict the spatial distribution of sediment pollution in an estuarine system |
title_sort | random forest approach to predict the spatial distribution of sediment pollution in an estuarine system |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5524344/ https://www.ncbi.nlm.nih.gov/pubmed/28738089 http://dx.doi.org/10.1371/journal.pone.0179473 |
work_keys_str_mv | AT walsherics arandomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem AT kreakiebettyj arandomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem AT cantwellmarkg arandomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem AT naccidiane arandomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem AT walsherics randomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem AT kreakiebettyj randomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem AT cantwellmarkg randomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem AT naccidiane randomforestapproachtopredictthespatialdistributionofsedimentpollutioninanestuarinesystem |