Cargando…

A method to implement continuous characters in digital identification keys that estimates the probability of an annotation

PREMISE: Species identification is vital to many disciplines. Digital technology has improved identification tools, but the direct use of characters with continuous states has yet to be fully realized. To achieve full use of continuous characters for identification, I propose a classifier that calcu...

Descripción completa

Detalles Bibliográficos
Autor principal: Tyrrell, Christopher D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6526653/
https://www.ncbi.nlm.nih.gov/pubmed/31139513
http://dx.doi.org/10.1002/aps3.1247
_version_ 1783419932487188480
author Tyrrell, Christopher D.
author_facet Tyrrell, Christopher D.
author_sort Tyrrell, Christopher D.
collection PubMed
description PREMISE: Species identification is vital to many disciplines. Digital technology has improved identification tools, but the direct use of characters with continuous states has yet to be fully realized. To achieve full use of continuous characters for identification, I propose a classifier that calculates a posterior probability (degree of belief) in possible name assignments and an estimate of the relative evidence for the candidate annotations. METHODS: A model for a species is defined using continuous morphological characters, and an algorithm for identification with a naive Bayesian classifier, using the model, is presented. A method of estimating the strength of evidence for candidate species is also described. RESULTS: The proposed method is applied in two example identifications: native vs. invasive Myriophyllum in North America and vegetative Rhipidocladum bamboos in Mexico. In each instance, the new method provides a probability and estimate of the strength of the probability to enhance the name assignment in situations where taxa are difficult to differentiate using discrete character states. DISCUSSION: Naive Bayesian classifiers take advantage of the predictive information inherent in continuous morphological characters. Application of this methodology to plant taxonomy advances our ability to leverage digital technology for improved interactive taxonomic identifications.
format Online
Article
Text
id pubmed-6526653
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-65266532019-05-28 A method to implement continuous characters in digital identification keys that estimates the probability of an annotation Tyrrell, Christopher D. Appl Plant Sci Application Article PREMISE: Species identification is vital to many disciplines. Digital technology has improved identification tools, but the direct use of characters with continuous states has yet to be fully realized. To achieve full use of continuous characters for identification, I propose a classifier that calculates a posterior probability (degree of belief) in possible name assignments and an estimate of the relative evidence for the candidate annotations. METHODS: A model for a species is defined using continuous morphological characters, and an algorithm for identification with a naive Bayesian classifier, using the model, is presented. A method of estimating the strength of evidence for candidate species is also described. RESULTS: The proposed method is applied in two example identifications: native vs. invasive Myriophyllum in North America and vegetative Rhipidocladum bamboos in Mexico. In each instance, the new method provides a probability and estimate of the strength of the probability to enhance the name assignment in situations where taxa are difficult to differentiate using discrete character states. DISCUSSION: Naive Bayesian classifiers take advantage of the predictive information inherent in continuous morphological characters. Application of this methodology to plant taxonomy advances our ability to leverage digital technology for improved interactive taxonomic identifications. John Wiley and Sons Inc. 2019-05-08 /pmc/articles/PMC6526653/ /pubmed/31139513 http://dx.doi.org/10.1002/aps3.1247 Text en © 2019 Tyrrell Applications in Plant Sciences is published by Wiley Periodicals, Inc. on behalf of the Botanical Society of America This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Application Article
Tyrrell, Christopher D.
A method to implement continuous characters in digital identification keys that estimates the probability of an annotation
title A method to implement continuous characters in digital identification keys that estimates the probability of an annotation
title_full A method to implement continuous characters in digital identification keys that estimates the probability of an annotation
title_fullStr A method to implement continuous characters in digital identification keys that estimates the probability of an annotation
title_full_unstemmed A method to implement continuous characters in digital identification keys that estimates the probability of an annotation
title_short A method to implement continuous characters in digital identification keys that estimates the probability of an annotation
title_sort method to implement continuous characters in digital identification keys that estimates the probability of an annotation
topic Application Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6526653/
https://www.ncbi.nlm.nih.gov/pubmed/31139513
http://dx.doi.org/10.1002/aps3.1247
work_keys_str_mv AT tyrrellchristopherd amethodtoimplementcontinuouscharactersindigitalidentificationkeysthatestimatestheprobabilityofanannotation
AT tyrrellchristopherd methodtoimplementcontinuouscharactersindigitalidentificationkeysthatestimatestheprobabilityofanannotation