Cargando…

CNN Training with Twenty Samples for Crack Detection via Data Augmentation

The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zirui, Yang, Jingjing, Jiang, Haonan, Fan, Xueling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506713/
https://www.ncbi.nlm.nih.gov/pubmed/32867223
http://dx.doi.org/10.3390/s20174849
_version_ 1783585077177876480
author Wang, Zirui
Yang, Jingjing
Jiang, Haonan
Fan, Xueling
author_facet Wang, Zirui
Yang, Jingjing
Jiang, Haonan
Fan, Xueling
author_sort Wang, Zirui
collection PubMed
description The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In this paper, we attempt to construct a crack detector based on CNN with twenty images via a two-stage data augmentation method. In detail, nine data augmentation methods are compared for crack detection in the model training, respectively. As a result, the rotation method outperforms these methods for augmentation, and by an in-depth exploration of the rotation method, the performance of the detector is further improved. Furthermore, data augmentation is also applied in the inference process to improve the recall of trained models. The identical object has more chances to be detected in the series of augmented images. This trick is essentially a performance–resource trade-off. For more improvement with limited resources, the greedy algorithm is adopted for searching a better combination of data augmentation. The results show that the crack detectors trained on the small dataset are significantly improved via the proposed two-stage data augmentation. Specifically, using 20 images for training, recall in detecting the cracks achieves 96% and [Formula: see text] , which is a variant of F-score for crack detection, achieves 91.18%.
format Online
Article
Text
id pubmed-7506713
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75067132020-09-26 CNN Training with Twenty Samples for Crack Detection via Data Augmentation Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling Sensors (Basel) Article The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In this paper, we attempt to construct a crack detector based on CNN with twenty images via a two-stage data augmentation method. In detail, nine data augmentation methods are compared for crack detection in the model training, respectively. As a result, the rotation method outperforms these methods for augmentation, and by an in-depth exploration of the rotation method, the performance of the detector is further improved. Furthermore, data augmentation is also applied in the inference process to improve the recall of trained models. The identical object has more chances to be detected in the series of augmented images. This trick is essentially a performance–resource trade-off. For more improvement with limited resources, the greedy algorithm is adopted for searching a better combination of data augmentation. The results show that the crack detectors trained on the small dataset are significantly improved via the proposed two-stage data augmentation. Specifically, using 20 images for training, recall in detecting the cracks achieves 96% and [Formula: see text] , which is a variant of F-score for crack detection, achieves 91.18%. MDPI 2020-08-27 /pmc/articles/PMC7506713/ /pubmed/32867223 http://dx.doi.org/10.3390/s20174849 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Zirui
Yang, Jingjing
Jiang, Haonan
Fan, Xueling
CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_full CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_fullStr CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_full_unstemmed CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_short CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_sort cnn training with twenty samples for crack detection via data augmentation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506713/
https://www.ncbi.nlm.nih.gov/pubmed/32867223
http://dx.doi.org/10.3390/s20174849
work_keys_str_mv AT wangzirui cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation
AT yangjingjing cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation
AT jianghaonan cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation
AT fanxueling cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation