Cargando…

A machine learning method for estimating the probability of presence using presence‐background data

Estimating the prevalence or the absolute probability of the presence of a species from presence‐background data has become a controversial topic in species distribution modelling. In this paper, we propose a new method by combining both statistics and machine learning algorithms that helps overcome...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yan, Samarasekara, Chathuri L., Stone, Lewi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203590/
https://www.ncbi.nlm.nih.gov/pubmed/35784023
http://dx.doi.org/10.1002/ece3.8998
_version_ 1784728740019830784
author Wang, Yan
Samarasekara, Chathuri L.
Stone, Lewi
author_facet Wang, Yan
Samarasekara, Chathuri L.
Stone, Lewi
author_sort Wang, Yan
collection PubMed
description Estimating the prevalence or the absolute probability of the presence of a species from presence‐background data has become a controversial topic in species distribution modelling. In this paper, we propose a new method by combining both statistics and machine learning algorithms that helps overcome some of the known existing problems. We have also revisited the popular but highly controversial Lele and Keim (LK) method by evaluating its performance and assessing the RSPF condition it relies on. Simulations show that the LK method with the RSPF assumptions would render fragile estimation/prediction of the desired probabilities. Rather, we propose the local knowledge condition, which relaxes the predetermined population prevalence condition that has so often been used in much of the existing literature. Simulations demonstrate the performance of the new method utilizing the local knowledge assumption to successfully estimate the probability of presence. The local knowledge extends the local certainty or the prototypical presence location assumption, and has significant implications for demonstrating the necessary condition for identifying absolute (rather than relative) probability of presence from presence background without absence data in species distribution modelling.
format Online
Article
Text
id pubmed-9203590
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-92035902022-07-01 A machine learning method for estimating the probability of presence using presence‐background data Wang, Yan Samarasekara, Chathuri L. Stone, Lewi Ecol Evol Research Articles Estimating the prevalence or the absolute probability of the presence of a species from presence‐background data has become a controversial topic in species distribution modelling. In this paper, we propose a new method by combining both statistics and machine learning algorithms that helps overcome some of the known existing problems. We have also revisited the popular but highly controversial Lele and Keim (LK) method by evaluating its performance and assessing the RSPF condition it relies on. Simulations show that the LK method with the RSPF assumptions would render fragile estimation/prediction of the desired probabilities. Rather, we propose the local knowledge condition, which relaxes the predetermined population prevalence condition that has so often been used in much of the existing literature. Simulations demonstrate the performance of the new method utilizing the local knowledge assumption to successfully estimate the probability of presence. The local knowledge extends the local certainty or the prototypical presence location assumption, and has significant implications for demonstrating the necessary condition for identifying absolute (rather than relative) probability of presence from presence background without absence data in species distribution modelling. John Wiley and Sons Inc. 2022-06-16 /pmc/articles/PMC9203590/ /pubmed/35784023 http://dx.doi.org/10.1002/ece3.8998 Text en © 2022 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Wang, Yan
Samarasekara, Chathuri L.
Stone, Lewi
A machine learning method for estimating the probability of presence using presence‐background data
title A machine learning method for estimating the probability of presence using presence‐background data
title_full A machine learning method for estimating the probability of presence using presence‐background data
title_fullStr A machine learning method for estimating the probability of presence using presence‐background data
title_full_unstemmed A machine learning method for estimating the probability of presence using presence‐background data
title_short A machine learning method for estimating the probability of presence using presence‐background data
title_sort machine learning method for estimating the probability of presence using presence‐background data
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203590/
https://www.ncbi.nlm.nih.gov/pubmed/35784023
http://dx.doi.org/10.1002/ece3.8998
work_keys_str_mv AT wangyan amachinelearningmethodforestimatingtheprobabilityofpresenceusingpresencebackgrounddata
AT samarasekarachathuril amachinelearningmethodforestimatingtheprobabilityofpresenceusingpresencebackgrounddata
AT stonelewi amachinelearningmethodforestimatingtheprobabilityofpresenceusingpresencebackgrounddata
AT wangyan machinelearningmethodforestimatingtheprobabilityofpresenceusingpresencebackgrounddata
AT samarasekarachathuril machinelearningmethodforestimatingtheprobabilityofpresenceusingpresencebackgrounddata
AT stonelewi machinelearningmethodforestimatingtheprobabilityofpresenceusingpresencebackgrounddata