Cargando…

MorphoCluster: Efficient Annotation of Plankton Images by Clustering

In this work, we present MorphoCluster, a software tool for data-driven, fast, and accurate annotation of large image data sets. While already having surpassed the annotation rate of human experts, volume and complexity of marine data will continue to increase in the coming years. Still, this data r...

Descripción completa

Detalles Bibliográficos
Autores principales: Schröder, Simon-Martin, Kiko, Rainer, Koch, Reinhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308937/
https://www.ncbi.nlm.nih.gov/pubmed/32481730
http://dx.doi.org/10.3390/s20113060
_version_ 1783549108192018432
author Schröder, Simon-Martin
Kiko, Rainer
Koch, Reinhard
author_facet Schröder, Simon-Martin
Kiko, Rainer
Koch, Reinhard
author_sort Schröder, Simon-Martin
collection PubMed
description In this work, we present MorphoCluster, a software tool for data-driven, fast, and accurate annotation of large image data sets. While already having surpassed the annotation rate of human experts, volume and complexity of marine data will continue to increase in the coming years. Still, this data requires interpretation. MorphoCluster augments the human ability to discover patterns and perform object classification in large amounts of data by embedding unsupervised clustering in an interactive process. By aggregating similar images into clusters, our novel approach to image annotation increases consistency, multiplies the throughput of an annotator, and allows experts to adapt the granularity of their sorting scheme to the structure in the data. By sorting a set of 1.2 M objects into 280 data-driven classes in 71 h (16 k objects per hour), with 90% of these classes having a precision of 0.889 or higher. This shows that MorphoCluster is at the same time fast, accurate, and consistent; provides a fine-grained and data-driven classification; and enables novelty detection.
format Online
Article
Text
id pubmed-7308937
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73089372020-06-25 MorphoCluster: Efficient Annotation of Plankton Images by Clustering Schröder, Simon-Martin Kiko, Rainer Koch, Reinhard Sensors (Basel) Article In this work, we present MorphoCluster, a software tool for data-driven, fast, and accurate annotation of large image data sets. While already having surpassed the annotation rate of human experts, volume and complexity of marine data will continue to increase in the coming years. Still, this data requires interpretation. MorphoCluster augments the human ability to discover patterns and perform object classification in large amounts of data by embedding unsupervised clustering in an interactive process. By aggregating similar images into clusters, our novel approach to image annotation increases consistency, multiplies the throughput of an annotator, and allows experts to adapt the granularity of their sorting scheme to the structure in the data. By sorting a set of 1.2 M objects into 280 data-driven classes in 71 h (16 k objects per hour), with 90% of these classes having a precision of 0.889 or higher. This shows that MorphoCluster is at the same time fast, accurate, and consistent; provides a fine-grained and data-driven classification; and enables novelty detection. MDPI 2020-05-28 /pmc/articles/PMC7308937/ /pubmed/32481730 http://dx.doi.org/10.3390/s20113060 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Schröder, Simon-Martin
Kiko, Rainer
Koch, Reinhard
MorphoCluster: Efficient Annotation of Plankton Images by Clustering
title MorphoCluster: Efficient Annotation of Plankton Images by Clustering
title_full MorphoCluster: Efficient Annotation of Plankton Images by Clustering
title_fullStr MorphoCluster: Efficient Annotation of Plankton Images by Clustering
title_full_unstemmed MorphoCluster: Efficient Annotation of Plankton Images by Clustering
title_short MorphoCluster: Efficient Annotation of Plankton Images by Clustering
title_sort morphocluster: efficient annotation of plankton images by clustering
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308937/
https://www.ncbi.nlm.nih.gov/pubmed/32481730
http://dx.doi.org/10.3390/s20113060
work_keys_str_mv AT schrodersimonmartin morphoclusterefficientannotationofplanktonimagesbyclustering
AT kikorainer morphoclusterefficientannotationofplanktonimagesbyclustering
AT kochreinhard morphoclusterefficientannotationofplanktonimagesbyclustering