Cargando…
Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy
Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required am...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512301/ https://www.ncbi.nlm.nih.gov/pubmed/34640981 http://dx.doi.org/10.3390/s21196661 |
_version_ | 1784582957985431552 |
---|---|
author | Schmarje, Lars Brünger, Johannes Santarossa, Monty Schröder, Simon-Martin Kiko, Rainer Koch, Reinhard |
author_facet | Schmarje, Lars Brünger, Johannes Santarossa, Monty Schröder, Simon-Martin Kiko, Rainer Koch, Reinhard |
author_sort | Schmarje, Lars |
collection | PubMed |
description | Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes. For underwater classification, and uncurated real-world datasets in general, clean class boundaries can often not be given due to a limited information content in the images and transitional stages of the depicted objects. This leads to different experts having different opinions and thus producing fuzzy labels which could also be considered ambiguous or divergent. We propose a novel framework for handling semi-supervised classifications of such fuzzy labels. It is based on the idea of overclustering to detect substructures in these fuzzy labels. We propose a novel loss to improve the overclustering capability of our framework and show the benefit of overclustering for fuzzy labels. We show that our framework is superior to previous state-of-the-art semi-supervised methods when applied to real-world plankton data with fuzzy labels. Moreover, we acquire 5 to 10% more consistent predictions of substructures. |
format | Online Article Text |
id | pubmed-8512301 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-85123012021-10-14 Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy Schmarje, Lars Brünger, Johannes Santarossa, Monty Schröder, Simon-Martin Kiko, Rainer Koch, Reinhard Sensors (Basel) Article Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes. For underwater classification, and uncurated real-world datasets in general, clean class boundaries can often not be given due to a limited information content in the images and transitional stages of the depicted objects. This leads to different experts having different opinions and thus producing fuzzy labels which could also be considered ambiguous or divergent. We propose a novel framework for handling semi-supervised classifications of such fuzzy labels. It is based on the idea of overclustering to detect substructures in these fuzzy labels. We propose a novel loss to improve the overclustering capability of our framework and show the benefit of overclustering for fuzzy labels. We show that our framework is superior to previous state-of-the-art semi-supervised methods when applied to real-world plankton data with fuzzy labels. Moreover, we acquire 5 to 10% more consistent predictions of substructures. MDPI 2021-10-07 /pmc/articles/PMC8512301/ /pubmed/34640981 http://dx.doi.org/10.3390/s21196661 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Schmarje, Lars Brünger, Johannes Santarossa, Monty Schröder, Simon-Martin Kiko, Rainer Koch, Reinhard Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy |
title | Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy |
title_full | Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy |
title_fullStr | Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy |
title_full_unstemmed | Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy |
title_short | Fuzzy Overclustering: Semi-Supervised Classification of Fuzzy Labels with Overclustering and Inverse Cross-Entropy |
title_sort | fuzzy overclustering: semi-supervised classification of fuzzy labels with overclustering and inverse cross-entropy |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512301/ https://www.ncbi.nlm.nih.gov/pubmed/34640981 http://dx.doi.org/10.3390/s21196661 |
work_keys_str_mv | AT schmarjelars fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy AT brungerjohannes fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy AT santarossamonty fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy AT schrodersimonmartin fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy AT kikorainer fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy AT kochreinhard fuzzyoverclusteringsemisupervisedclassificationoffuzzylabelswithoverclusteringandinversecrossentropy |