Cargando…

The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects

Data augmentation is an established technique in computer vision to foster the generalization of training and to deal with low data volume. Most data augmentation and computer vision research are focused on everyday images such as traffic data. The application of computer vision techniques in domain...

Descripción completa

Detalles Bibliográficos
Autores principales: Tan, Mingkun, Langenkämper, Daniel, Nattkemper, Tim W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9322900/
https://www.ncbi.nlm.nih.gov/pubmed/35891060
http://dx.doi.org/10.3390/s22145383
_version_ 1784756416591953920
author Tan, Mingkun
Langenkämper, Daniel
Nattkemper, Tim W.
author_facet Tan, Mingkun
Langenkämper, Daniel
Nattkemper, Tim W.
author_sort Tan, Mingkun
collection PubMed
description Data augmentation is an established technique in computer vision to foster the generalization of training and to deal with low data volume. Most data augmentation and computer vision research are focused on everyday images such as traffic data. The application of computer vision techniques in domains like marine sciences has shown to be not that straightforward in the past due to special characteristics, such as very low data volume and class imbalance, because of costly manual annotation by human domain experts, and general low species abundances. However, the data volume acquired today with moving platforms to collect large image collections from remote marine habitats, like the deep benthos, for marine biodiversity assessment and monitoring makes the use of computer vision automatic detection and classification inevitable. In this work, we investigate the effect of data augmentation in the context of taxonomic classification in underwater, i.e., benthic images. First, we show that established data augmentation methods (i.e., geometric and photometric transformations) perform differently in marine image collections compared to established image collections like the Cityscapes dataset, showing everyday traffic images. Some of the methods even decrease the learning performance when applied to marine image collections. Second, we propose new data augmentation combination policies motivated by our observations and compare their effect to those proposed by the AutoAugment algorithm and can show that the proposed augmentation policy outperforms the AutoAugment results for marine image collections. We conclude that in the case of small marine image datasets, background knowledge, and heuristics should sometimes be applied to design an effective data augmentation method.
format Online
Article
Text
id pubmed-9322900
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93229002022-07-27 The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects Tan, Mingkun Langenkämper, Daniel Nattkemper, Tim W. Sensors (Basel) Article Data augmentation is an established technique in computer vision to foster the generalization of training and to deal with low data volume. Most data augmentation and computer vision research are focused on everyday images such as traffic data. The application of computer vision techniques in domains like marine sciences has shown to be not that straightforward in the past due to special characteristics, such as very low data volume and class imbalance, because of costly manual annotation by human domain experts, and general low species abundances. However, the data volume acquired today with moving platforms to collect large image collections from remote marine habitats, like the deep benthos, for marine biodiversity assessment and monitoring makes the use of computer vision automatic detection and classification inevitable. In this work, we investigate the effect of data augmentation in the context of taxonomic classification in underwater, i.e., benthic images. First, we show that established data augmentation methods (i.e., geometric and photometric transformations) perform differently in marine image collections compared to established image collections like the Cityscapes dataset, showing everyday traffic images. Some of the methods even decrease the learning performance when applied to marine image collections. Second, we propose new data augmentation combination policies motivated by our observations and compare their effect to those proposed by the AutoAugment algorithm and can show that the proposed augmentation policy outperforms the AutoAugment results for marine image collections. We conclude that in the case of small marine image datasets, background knowledge, and heuristics should sometimes be applied to design an effective data augmentation method. MDPI 2022-07-19 /pmc/articles/PMC9322900/ /pubmed/35891060 http://dx.doi.org/10.3390/s22145383 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tan, Mingkun
Langenkämper, Daniel
Nattkemper, Tim W.
The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects
title The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects
title_full The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects
title_fullStr The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects
title_full_unstemmed The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects
title_short The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects
title_sort impact of data augmentations on deep learning-based marine object classification in benthic image transects
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9322900/
https://www.ncbi.nlm.nih.gov/pubmed/35891060
http://dx.doi.org/10.3390/s22145383
work_keys_str_mv AT tanmingkun theimpactofdataaugmentationsondeeplearningbasedmarineobjectclassificationinbenthicimagetransects
AT langenkamperdaniel theimpactofdataaugmentationsondeeplearningbasedmarineobjectclassificationinbenthicimagetransects
AT nattkempertimw theimpactofdataaugmentationsondeeplearningbasedmarineobjectclassificationinbenthicimagetransects
AT tanmingkun impactofdataaugmentationsondeeplearningbasedmarineobjectclassificationinbenthicimagetransects
AT langenkamperdaniel impactofdataaugmentationsondeeplearningbasedmarineobjectclassificationinbenthicimagetransects
AT nattkempertimw impactofdataaugmentationsondeeplearningbasedmarineobjectclassificationinbenthicimagetransects