Cargando…

Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy

Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required am...

Descripción completa

Detalles Bibliográficos
Autores principales: Schmarje, Lars, Brünger, Johannes, Santarossa, Monty, Schröder, Simon-Martin, Kiko, Rainer, Koch, Reinhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512301/
https://www.ncbi.nlm.nih.gov/pubmed/34640981
http://dx.doi.org/10.3390/s21196661
_version_ 1784582957985431552
author Schmarje, Lars
Brünger, Johannes
Santarossa, Monty
Schröder, Simon-Martin
Kiko, Rainer
Koch, Reinhard
author_facet Schmarje, Lars
Brünger, Johannes
Santarossa, Monty
Schröder, Simon-Martin
Kiko, Rainer
Koch, Reinhard
author_sort Schmarje, Lars
collection PubMed
description Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes. For underwater classification, and uncurated real-world datasets in general, clean class boundaries can often not be given due to a limited information content in the images and transitional stages of the depicted objects. This leads to different experts having different opinions and thus producing fuzzy labels which could also be considered ambiguous or divergent. We propose a novel framework for handling semi-supervised classifications of such fuzzy labels. It is based on the idea of overclustering to detect substructures in these fuzzy labels. We propose a novel loss to improve the overclustering capability of our framework and show the benefit of overclustering for fuzzy labels. We show that our framework is superior to previous state-of-the-art semi-supervised methods when applied to real-world plankton data with fuzzy labels. Moreover, we acquire 5 to 10% more consistent predictions of substructures.
format Online
Article
Text
id pubmed-8512301
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85123012021-10-14 Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy Schmarje, Lars Brünger, Johannes Santarossa, Monty Schröder, Simon-Martin Kiko, Rainer Koch, Reinhard Sensors (Basel) Article Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes. For underwater classification, and uncurated real-world datasets in general, clean class boundaries can often not be given due to a limited information content in the images and transitional stages of the depicted objects. This leads to different experts having different opinions and thus producing fuzzy labels which could also be considered ambiguous or divergent. We propose a novel framework for handling semi-supervised classifications of such fuzzy labels. It is based on the idea of overclustering to detect substructures in these fuzzy labels. We propose a novel loss to improve the overclustering capability of our framework and show the benefit of overclustering for fuzzy labels. We show that our framework is superior to previous state-of-the-art semi-supervised methods when applied to real-world plankton data with fuzzy labels. Moreover, we acquire 5 to 10% more consistent predictions of substructures. MDPI 2021-10-07 /pmc/articles/PMC8512301/ /pubmed/34640981 http://dx.doi.org/10.3390/s21196661 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Schmarje, Lars
Brünger, Johannes
Santarossa, Monty
Schröder, Simon-Martin
Kiko, Rainer
Koch, Reinhard
Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy
title Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy
title_full Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy
title_fullStr Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy
title_full_unstemmed Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy
title_short Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy
title_sort fuzzy overclustering: semi-supervised classification of fuzzy labels with overclustering and inverse cross-entropy
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512301/
https://www.ncbi.nlm.nih.gov/pubmed/34640981
http://dx.doi.org/10.3390/s21196661
work_keys_str_mv AT schmarjelars fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy
AT brungerjohannes fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy
AT santarossamonty fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy
AT schrodersimonmartin fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy
AT kikorainer fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy
AT kochreinhard fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy