Cargando…

Using Historical Atlas Data to Develop High-Resolution Distribution Models of Freshwater Fishes

Understanding the spatial pattern of species distributions is fundamental in biogeography, and conservation and resource management applications. Most species distribution models (SDMs) require or prefer species presence and absence data for adequate estimation of model parameters. However, observat...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Jian, Frimpong, Emmanuel A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4468192/
https://www.ncbi.nlm.nih.gov/pubmed/26075902
http://dx.doi.org/10.1371/journal.pone.0129995
_version_ 1782376460479102976
author Huang, Jian
Frimpong, Emmanuel A.
author_facet Huang, Jian
Frimpong, Emmanuel A.
author_sort Huang, Jian
collection PubMed
description Understanding the spatial pattern of species distributions is fundamental in biogeography, and conservation and resource management applications. Most species distribution models (SDMs) require or prefer species presence and absence data for adequate estimation of model parameters. However, observations with unreliable or unreported species absences dominate and limit the implementation of SDMs. Presence-only models generally yield less accurate predictions of species distribution, and make it difficult to incorporate spatial autocorrelation. The availability of large amounts of historical presence records for freshwater fishes of the United States provides an opportunity for deriving reliable absences from data reported as presence-only, when sampling was predominantly community-based. In this study, we used boosted regression trees (BRT), logistic regression, and MaxEnt models to assess the performance of a historical metacommunity database with inferred absences, for modeling fish distributions, investigating the effect of model choice and data properties thereby. With models of the distribution of 76 native, non-game fish species of varied traits and rarity attributes in four river basins across the United States, we show that model accuracy depends on data quality (e.g., sample size, location precision), species’ rarity, statistical modeling technique, and consideration of spatial autocorrelation. The cross-validation area under the receiver-operating-characteristic curve (AUC) tended to be high in the spatial presence-absence models at the highest level of resolution for species with large geographic ranges and small local populations. Prevalence affected training but not validation AUC. The key habitat predictors identified and the fish-habitat relationships evaluated through partial dependence plots corroborated most previous studies. The community-based SDM framework broadens our capability to model species distributions by innovatively removing the constraint of lack of species absence data, thus providing a robust prediction of distribution for stream fishes in other regions where historical data exist, and for other taxa (e.g., benthic macroinvertebrates, birds) usually observed by community-based sampling designs.
format Online
Article
Text
id pubmed-4468192
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44681922015-06-25 Using Historical Atlas Data to Develop High-Resolution Distribution Models of Freshwater Fishes Huang, Jian Frimpong, Emmanuel A. PLoS One Research Article Understanding the spatial pattern of species distributions is fundamental in biogeography, and conservation and resource management applications. Most species distribution models (SDMs) require or prefer species presence and absence data for adequate estimation of model parameters. However, observations with unreliable or unreported species absences dominate and limit the implementation of SDMs. Presence-only models generally yield less accurate predictions of species distribution, and make it difficult to incorporate spatial autocorrelation. The availability of large amounts of historical presence records for freshwater fishes of the United States provides an opportunity for deriving reliable absences from data reported as presence-only, when sampling was predominantly community-based. In this study, we used boosted regression trees (BRT), logistic regression, and MaxEnt models to assess the performance of a historical metacommunity database with inferred absences, for modeling fish distributions, investigating the effect of model choice and data properties thereby. With models of the distribution of 76 native, non-game fish species of varied traits and rarity attributes in four river basins across the United States, we show that model accuracy depends on data quality (e.g., sample size, location precision), species’ rarity, statistical modeling technique, and consideration of spatial autocorrelation. The cross-validation area under the receiver-operating-characteristic curve (AUC) tended to be high in the spatial presence-absence models at the highest level of resolution for species with large geographic ranges and small local populations. Prevalence affected training but not validation AUC. The key habitat predictors identified and the fish-habitat relationships evaluated through partial dependence plots corroborated most previous studies. The community-based SDM framework broadens our capability to model species distributions by innovatively removing the constraint of lack of species absence data, thus providing a robust prediction of distribution for stream fishes in other regions where historical data exist, and for other taxa (e.g., benthic macroinvertebrates, birds) usually observed by community-based sampling designs. Public Library of Science 2015-06-15 /pmc/articles/PMC4468192/ /pubmed/26075902 http://dx.doi.org/10.1371/journal.pone.0129995 Text en © 2015 Huang, Frimpong http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Huang, Jian
Frimpong, Emmanuel A.
Using Historical Atlas Data to Develop High-Resolution Distribution Models of Freshwater Fishes
title Using Historical Atlas Data to Develop High-Resolution Distribution Models of Freshwater Fishes
title_full Using Historical Atlas Data to Develop High-Resolution Distribution Models of Freshwater Fishes
title_fullStr Using Historical Atlas Data to Develop High-Resolution Distribution Models of Freshwater Fishes
title_full_unstemmed Using Historical Atlas Data to Develop High-Resolution Distribution Models of Freshwater Fishes
title_short Using Historical Atlas Data to Develop High-Resolution Distribution Models of Freshwater Fishes
title_sort using historical atlas data to develop high-resolution distribution models of freshwater fishes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4468192/
https://www.ncbi.nlm.nih.gov/pubmed/26075902
http://dx.doi.org/10.1371/journal.pone.0129995
work_keys_str_mv AT huangjian usinghistoricalatlasdatatodevelophighresolutiondistributionmodelsoffreshwaterfishes
AT frimpongemmanuela usinghistoricalatlasdatatodevelophighresolutiondistributionmodelsoffreshwaterfishes