Cargando…

Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics

AIM: Availability of uniformly collected presence, absence, and abundance data remains a key challenge in species distribution modeling (SDM). For invasive species, abundance and impacts are highly variable across landscapes, and quality occurrence and abundance data are critical for predicting loca...

Descripción completa

Detalles Bibliográficos
Autores principales: Thomas, Shyam M., Verhoeven, Michael R., Walsh, Jake R., Larkin, Daniel J., Hansen, Gretchen J. A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8462136/
https://www.ncbi.nlm.nih.gov/pubmed/34594521
http://dx.doi.org/10.1002/ece3.8002
_version_ 1784572135789821952
author Thomas, Shyam M.
Verhoeven, Michael R.
Walsh, Jake R.
Larkin, Daniel J.
Hansen, Gretchen J. A.
author_facet Thomas, Shyam M.
Verhoeven, Michael R.
Walsh, Jake R.
Larkin, Daniel J.
Hansen, Gretchen J. A.
author_sort Thomas, Shyam M.
collection PubMed
description AIM: Availability of uniformly collected presence, absence, and abundance data remains a key challenge in species distribution modeling (SDM). For invasive species, abundance and impacts are highly variable across landscapes, and quality occurrence and abundance data are critical for predicting locations at high risk for invasion and impacts, respectively. We leverage a large aquatic vegetation dataset comprising point‐level survey data that includes information on the invasive plant Myriophyllum spicatum (Eurasian watermilfoil) to: (a) develop SDMs to predict invasion and impact from environmental variables based on presence–absence, presence‐only, and abundance data, and (b) compare evaluation metrics based on functional and discrimination accuracy for presence–absence and presence‐only SDMs. LOCATION: Minnesota, USA. METHODS: Eurasian watermilfoil presence–absence and abundance information were gathered from 468 surveyed lakes, and 801 unsurveyed lakes were leveraged as pseudoabsences for presence‐only models. A Random Forest algorithm was used to model the distribution and abundance of Eurasian watermilfoil as a function of lake‐specific predictors, both with and without a spatial autocovariate. Occurrence‐based SDMs were evaluated using conventional discrimination accuracy metrics and functional accuracy metrics assessing correlation between predicted suitability and observed abundance. RESULTS: Water temperature degree days and maximum lake depth were two leading predictors influencing both invasion risk and abundance, but they were relatively less important for predicting abundance than other water quality measures. Road density was a strong predictor of Eurasian watermilfoil invasion risk but not abundance. Model evaluations highlighted significant differences: Presence–absence models had high functional accuracy despite low discrimination accuracy, whereas presence‐only models showed the opposite pattern. MAIN CONCLUSION: Complementing presence–absence data with abundance information offers a richer understanding of invasive Eurasian watermilfoil's ecological niche and enables evaluation of the model's functional accuracy. Conventional discrimination accuracy measures were misleading when models were developed using pseudoabsences. We thus caution against the overuse of presence‐only models and suggest directing more effort toward systematic monitoring programs that yield high‐quality data.
format Online
Article
Text
id pubmed-8462136
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-84621362021-09-29 Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics Thomas, Shyam M. Verhoeven, Michael R. Walsh, Jake R. Larkin, Daniel J. Hansen, Gretchen J. A. Ecol Evol Original Research AIM: Availability of uniformly collected presence, absence, and abundance data remains a key challenge in species distribution modeling (SDM). For invasive species, abundance and impacts are highly variable across landscapes, and quality occurrence and abundance data are critical for predicting locations at high risk for invasion and impacts, respectively. We leverage a large aquatic vegetation dataset comprising point‐level survey data that includes information on the invasive plant Myriophyllum spicatum (Eurasian watermilfoil) to: (a) develop SDMs to predict invasion and impact from environmental variables based on presence–absence, presence‐only, and abundance data, and (b) compare evaluation metrics based on functional and discrimination accuracy for presence–absence and presence‐only SDMs. LOCATION: Minnesota, USA. METHODS: Eurasian watermilfoil presence–absence and abundance information were gathered from 468 surveyed lakes, and 801 unsurveyed lakes were leveraged as pseudoabsences for presence‐only models. A Random Forest algorithm was used to model the distribution and abundance of Eurasian watermilfoil as a function of lake‐specific predictors, both with and without a spatial autocovariate. Occurrence‐based SDMs were evaluated using conventional discrimination accuracy metrics and functional accuracy metrics assessing correlation between predicted suitability and observed abundance. RESULTS: Water temperature degree days and maximum lake depth were two leading predictors influencing both invasion risk and abundance, but they were relatively less important for predicting abundance than other water quality measures. Road density was a strong predictor of Eurasian watermilfoil invasion risk but not abundance. Model evaluations highlighted significant differences: Presence–absence models had high functional accuracy despite low discrimination accuracy, whereas presence‐only models showed the opposite pattern. MAIN CONCLUSION: Complementing presence–absence data with abundance information offers a richer understanding of invasive Eurasian watermilfoil's ecological niche and enables evaluation of the model's functional accuracy. Conventional discrimination accuracy measures were misleading when models were developed using pseudoabsences. We thus caution against the overuse of presence‐only models and suggest directing more effort toward systematic monitoring programs that yield high‐quality data. John Wiley and Sons Inc. 2021-08-13 /pmc/articles/PMC8462136/ /pubmed/34594521 http://dx.doi.org/10.1002/ece3.8002 Text en © 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Thomas, Shyam M.
Verhoeven, Michael R.
Walsh, Jake R.
Larkin, Daniel J.
Hansen, Gretchen J. A.
Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics
title Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics
title_full Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics
title_fullStr Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics
title_full_unstemmed Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics
title_short Species distribution models for invasive Eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics
title_sort species distribution models for invasive eurasian watermilfoil highlight the importance of data quality and limitations of discrimination accuracy metrics
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8462136/
https://www.ncbi.nlm.nih.gov/pubmed/34594521
http://dx.doi.org/10.1002/ece3.8002
work_keys_str_mv AT thomasshyamm speciesdistributionmodelsforinvasiveeurasianwatermilfoilhighlighttheimportanceofdataqualityandlimitationsofdiscriminationaccuracymetrics
AT verhoevenmichaelr speciesdistributionmodelsforinvasiveeurasianwatermilfoilhighlighttheimportanceofdataqualityandlimitationsofdiscriminationaccuracymetrics
AT walshjaker speciesdistributionmodelsforinvasiveeurasianwatermilfoilhighlighttheimportanceofdataqualityandlimitationsofdiscriminationaccuracymetrics
AT larkindanielj speciesdistributionmodelsforinvasiveeurasianwatermilfoilhighlighttheimportanceofdataqualityandlimitationsofdiscriminationaccuracymetrics
AT hansengretchenja speciesdistributionmodelsforinvasiveeurasianwatermilfoilhighlighttheimportanceofdataqualityandlimitationsofdiscriminationaccuracymetrics