Cargando…

Population distribution models: species distributions are better modeled using biologically relevant data partitions

BACKGROUND: Predicting the geographic distribution of widespread species through modeling is problematic for several reasons including high rates of omission errors. One potential source of error for modeling widespread species is that subspecies and/or races of species are frequently pooled for ana...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonzalez, Sergio C, Soto-Centeno, J Angel, Reed, David L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184255/
https://www.ncbi.nlm.nih.gov/pubmed/21929792
http://dx.doi.org/10.1186/1472-6785-11-20
_version_ 1782213081686867968
author Gonzalez, Sergio C
Soto-Centeno, J Angel
Reed, David L
author_facet Gonzalez, Sergio C
Soto-Centeno, J Angel
Reed, David L
author_sort Gonzalez, Sergio C
collection PubMed
description BACKGROUND: Predicting the geographic distribution of widespread species through modeling is problematic for several reasons including high rates of omission errors. One potential source of error for modeling widespread species is that subspecies and/or races of species are frequently pooled for analyses, which may mask biologically relevant spatial variation within the distribution of a single widespread species. We contrast a presence-only maximum entropy model for the widely distributed oldfield mouse (Peromyscus polionotus) that includes all available presence locations for this species, with two composite maximum entropy models. The composite models either subdivided the total species distribution into four geographic quadrants or by fifteen subspecies to capture spatially relevant variation in P. polionotus distributions. RESULTS: Despite high Area Under the ROC Curve (AUC) values for all models, the composite species distribution model of P. polionotus generated from individual subspecies models represented the known distribution of the species much better than did the models produced by partitioning data into geographic quadrants or modeling the whole species as a single unit. CONCLUSIONS: Because the AUC values failed to describe the differences in the predictability of the three modeling strategies, we suggest using omission curves in addition to AUC values to assess model performance. Dividing the data of a widespread species into biologically relevant partitions greatly increased the performance of our distribution model; therefore, this approach may prove to be quite practical and informative for a wide range of modeling applications.
format Online
Article
Text
id pubmed-3184255
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31842552011-10-02 Population distribution models: species distributions are better modeled using biologically relevant data partitions Gonzalez, Sergio C Soto-Centeno, J Angel Reed, David L BMC Ecol Research Article BACKGROUND: Predicting the geographic distribution of widespread species through modeling is problematic for several reasons including high rates of omission errors. One potential source of error for modeling widespread species is that subspecies and/or races of species are frequently pooled for analyses, which may mask biologically relevant spatial variation within the distribution of a single widespread species. We contrast a presence-only maximum entropy model for the widely distributed oldfield mouse (Peromyscus polionotus) that includes all available presence locations for this species, with two composite maximum entropy models. The composite models either subdivided the total species distribution into four geographic quadrants or by fifteen subspecies to capture spatially relevant variation in P. polionotus distributions. RESULTS: Despite high Area Under the ROC Curve (AUC) values for all models, the composite species distribution model of P. polionotus generated from individual subspecies models represented the known distribution of the species much better than did the models produced by partitioning data into geographic quadrants or modeling the whole species as a single unit. CONCLUSIONS: Because the AUC values failed to describe the differences in the predictability of the three modeling strategies, we suggest using omission curves in addition to AUC values to assess model performance. Dividing the data of a widespread species into biologically relevant partitions greatly increased the performance of our distribution model; therefore, this approach may prove to be quite practical and informative for a wide range of modeling applications. BioMed Central 2011-09-19 /pmc/articles/PMC3184255/ /pubmed/21929792 http://dx.doi.org/10.1186/1472-6785-11-20 Text en Copyright ©2011 Gonzalez et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Gonzalez, Sergio C
Soto-Centeno, J Angel
Reed, David L
Population distribution models: species distributions are better modeled using biologically relevant data partitions
title Population distribution models: species distributions are better modeled using biologically relevant data partitions
title_full Population distribution models: species distributions are better modeled using biologically relevant data partitions
title_fullStr Population distribution models: species distributions are better modeled using biologically relevant data partitions
title_full_unstemmed Population distribution models: species distributions are better modeled using biologically relevant data partitions
title_short Population distribution models: species distributions are better modeled using biologically relevant data partitions
title_sort population distribution models: species distributions are better modeled using biologically relevant data partitions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184255/
https://www.ncbi.nlm.nih.gov/pubmed/21929792
http://dx.doi.org/10.1186/1472-6785-11-20
work_keys_str_mv AT gonzalezsergioc populationdistributionmodelsspeciesdistributionsarebettermodeledusingbiologicallyrelevantdatapartitions
AT sotocentenojangel populationdistributionmodelsspeciesdistributionsarebettermodeledusingbiologicallyrelevantdatapartitions
AT reeddavidl populationdistributionmodelsspeciesdistributionsarebettermodeledusingbiologicallyrelevantdatapartitions