Cargando…
Pseudoabsence Generation Strategies for Species Distribution Models
BACKGROUND: Species distribution models require selection of species, study extent and spatial unit, statistical methods, variables, and assessment metrics. If absence data are not available, another important consideration is pseudoabsence generation. Different strategies for pseudoabsence generati...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3432107/ https://www.ncbi.nlm.nih.gov/pubmed/22952985 http://dx.doi.org/10.1371/journal.pone.0044486 |
_version_ | 1782242168018042880 |
---|---|
author | Hanberry, Brice B. He, Hong S. Palik, Brian J. |
author_facet | Hanberry, Brice B. He, Hong S. Palik, Brian J. |
author_sort | Hanberry, Brice B. |
collection | PubMed |
description | BACKGROUND: Species distribution models require selection of species, study extent and spatial unit, statistical methods, variables, and assessment metrics. If absence data are not available, another important consideration is pseudoabsence generation. Different strategies for pseudoabsence generation can produce varying spatial representation of species. METHODOLOGY: We considered model outcomes from four different strategies for generating pseudoabsences. We generating pseudoabsences randomly by 1) selection from the entire study extent, 2) a two-step process of selection first from the entire study extent, followed by selection for pseudoabsences from areas with predicted probability <25%, 3) selection from plots surveyed without detection of species presence, 4) a two-step process of selection first for pseudoabsences from plots surveyed without detection of species presence, followed by selection for pseudoabsences from the areas with predicted probability <25%. We used Random Forests as our statistical method and sixteen predictor variables to model tree species with at least 150 records from Forest Inventory and Analysis surveys in the Laurentian Mixed Forest province of Minnesota. CONCLUSIONS: Pseudoabsence generation strategy completely affected the area predicted as present for species distribution models and may be one of the most influential determinants of models. All the pseudoabsence strategies produced mean AUC values of at least 0.87. More importantly than accuracy metrics, the two-step strategies over-predicted species presence, due to too much environmental distance between the pseudoabsences and recorded presences, whereas models based on random pseudoabsences under-predicted species presence, due to too little environmental distance between the pseudoabsences and recorded presences. Models using pseudoabsences from surveyed plots produced a balance between areas with high and low predicted probabilities and the strongest relationship between density and area with predicted probabilities ≥75%. Because of imperfect accuracy assessment, the best assessment currently may be evaluation of whether the species has been sufficiently but not excessively predicted to occur. |
format | Online Article Text |
id | pubmed-3432107 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-34321072012-09-05 Pseudoabsence Generation Strategies for Species Distribution Models Hanberry, Brice B. He, Hong S. Palik, Brian J. PLoS One Research Article BACKGROUND: Species distribution models require selection of species, study extent and spatial unit, statistical methods, variables, and assessment metrics. If absence data are not available, another important consideration is pseudoabsence generation. Different strategies for pseudoabsence generation can produce varying spatial representation of species. METHODOLOGY: We considered model outcomes from four different strategies for generating pseudoabsences. We generating pseudoabsences randomly by 1) selection from the entire study extent, 2) a two-step process of selection first from the entire study extent, followed by selection for pseudoabsences from areas with predicted probability <25%, 3) selection from plots surveyed without detection of species presence, 4) a two-step process of selection first for pseudoabsences from plots surveyed without detection of species presence, followed by selection for pseudoabsences from the areas with predicted probability <25%. We used Random Forests as our statistical method and sixteen predictor variables to model tree species with at least 150 records from Forest Inventory and Analysis surveys in the Laurentian Mixed Forest province of Minnesota. CONCLUSIONS: Pseudoabsence generation strategy completely affected the area predicted as present for species distribution models and may be one of the most influential determinants of models. All the pseudoabsence strategies produced mean AUC values of at least 0.87. More importantly than accuracy metrics, the two-step strategies over-predicted species presence, due to too much environmental distance between the pseudoabsences and recorded presences, whereas models based on random pseudoabsences under-predicted species presence, due to too little environmental distance between the pseudoabsences and recorded presences. Models using pseudoabsences from surveyed plots produced a balance between areas with high and low predicted probabilities and the strongest relationship between density and area with predicted probabilities ≥75%. Because of imperfect accuracy assessment, the best assessment currently may be evaluation of whether the species has been sufficiently but not excessively predicted to occur. Public Library of Science 2012-08-31 /pmc/articles/PMC3432107/ /pubmed/22952985 http://dx.doi.org/10.1371/journal.pone.0044486 Text en © 2012 This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Hanberry, Brice B. He, Hong S. Palik, Brian J. Pseudoabsence Generation Strategies for Species Distribution Models |
title | Pseudoabsence Generation Strategies for Species Distribution Models |
title_full | Pseudoabsence Generation Strategies for Species Distribution Models |
title_fullStr | Pseudoabsence Generation Strategies for Species Distribution Models |
title_full_unstemmed | Pseudoabsence Generation Strategies for Species Distribution Models |
title_short | Pseudoabsence Generation Strategies for Species Distribution Models |
title_sort | pseudoabsence generation strategies for species distribution models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3432107/ https://www.ncbi.nlm.nih.gov/pubmed/22952985 http://dx.doi.org/10.1371/journal.pone.0044486 |
work_keys_str_mv | AT hanberrybriceb pseudoabsencegenerationstrategiesforspeciesdistributionmodels AT hehongs pseudoabsencegenerationstrategiesforspeciesdistributionmodels AT palikbrianj pseudoabsencegenerationstrategiesforspeciesdistributionmodels |