Cargando…

Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification

Discriminating between black and white spruce (Picea mariana and Picea glauca) is a difficult palynological classification problem that, if solved, would provide valuable data for paleoclimate reconstructions. We developed an open-source visual recognition software (ARLO, Automated Recognition with...

Descripción completa

Detalles Bibliográficos
Autores principales: Tcheng, David K., Nayak, Ashwin K., Fowlkes, Charless C., Punyasena, Surangi W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4750970/
https://www.ncbi.nlm.nih.gov/pubmed/26867017
http://dx.doi.org/10.1371/journal.pone.0148879
_version_ 1782415512776474624
author Tcheng, David K.
Nayak, Ashwin K.
Fowlkes, Charless C.
Punyasena, Surangi W.
author_facet Tcheng, David K.
Nayak, Ashwin K.
Fowlkes, Charless C.
Punyasena, Surangi W.
author_sort Tcheng, David K.
collection PubMed
description Discriminating between black and white spruce (Picea mariana and Picea glauca) is a difficult palynological classification problem that, if solved, would provide valuable data for paleoclimate reconstructions. We developed an open-source visual recognition software (ARLO, Automated Recognition with Layered Optimization) capable of differentiating between these two species at an accuracy on par with human experts. The system applies pattern recognition and machine learning to the analysis of pollen images and discovers general-purpose image features, defined by simple features of lines and grids of pixels taken at different dimensions, size, spacing, and resolution. It adapts to a given problem by searching for the most effective combination of both feature representation and learning strategy. This results in a powerful and flexible framework for image classification. We worked with images acquired using an automated slide scanner. We first applied a hash-based “pollen spotting” model to segment pollen grains from the slide background. We next tested ARLO’s ability to reconstruct black to white spruce pollen ratios using artificially constructed slides of known ratios. We then developed a more scalable hash-based method of image analysis that was able to distinguish between the pollen of black and white spruce with an estimated accuracy of 83.61%, comparable to human expert performance. Our results demonstrate the capability of machine learning systems to automate challenging taxonomic classifications in pollen analysis, and our success with simple image representations suggests that our approach is generalizable to many other object recognition problems.
format Online
Article
Text
id pubmed-4750970
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47509702016-02-26 Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification Tcheng, David K. Nayak, Ashwin K. Fowlkes, Charless C. Punyasena, Surangi W. PLoS One Research Article Discriminating between black and white spruce (Picea mariana and Picea glauca) is a difficult palynological classification problem that, if solved, would provide valuable data for paleoclimate reconstructions. We developed an open-source visual recognition software (ARLO, Automated Recognition with Layered Optimization) capable of differentiating between these two species at an accuracy on par with human experts. The system applies pattern recognition and machine learning to the analysis of pollen images and discovers general-purpose image features, defined by simple features of lines and grids of pixels taken at different dimensions, size, spacing, and resolution. It adapts to a given problem by searching for the most effective combination of both feature representation and learning strategy. This results in a powerful and flexible framework for image classification. We worked with images acquired using an automated slide scanner. We first applied a hash-based “pollen spotting” model to segment pollen grains from the slide background. We next tested ARLO’s ability to reconstruct black to white spruce pollen ratios using artificially constructed slides of known ratios. We then developed a more scalable hash-based method of image analysis that was able to distinguish between the pollen of black and white spruce with an estimated accuracy of 83.61%, comparable to human expert performance. Our results demonstrate the capability of machine learning systems to automate challenging taxonomic classifications in pollen analysis, and our success with simple image representations suggests that our approach is generalizable to many other object recognition problems. Public Library of Science 2016-02-11 /pmc/articles/PMC4750970/ /pubmed/26867017 http://dx.doi.org/10.1371/journal.pone.0148879 Text en © 2016 Tcheng et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tcheng, David K.
Nayak, Ashwin K.
Fowlkes, Charless C.
Punyasena, Surangi W.
Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification
title Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification
title_full Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification
title_fullStr Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification
title_full_unstemmed Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification
title_short Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification
title_sort visual recognition software for binary classification and its application to spruce pollen identification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4750970/
https://www.ncbi.nlm.nih.gov/pubmed/26867017
http://dx.doi.org/10.1371/journal.pone.0148879
work_keys_str_mv AT tchengdavidk visualrecognitionsoftwareforbinaryclassificationanditsapplicationtosprucepollenidentification
AT nayakashwink visualrecognitionsoftwareforbinaryclassificationanditsapplicationtosprucepollenidentification
AT fowlkescharlessc visualrecognitionsoftwareforbinaryclassificationanditsapplicationtosprucepollenidentification
AT punyasenasurangiw visualrecognitionsoftwareforbinaryclassificationanditsapplicationtosprucepollenidentification