Cargando…

CNN Training with Twenty Samples for Crack Detection via Data Augmentation

The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In thi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Zirui, Yang, Jingjing, Jiang, Haonan, Fan, Xueling
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506713/ https://www.ncbi.nlm.nih.gov/pubmed/32867223 http://dx.doi.org/10.3390/s20174849

_version_	1783585077177876480
author	Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling
author_facet	Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling
author_sort	Wang, Zirui
collection	PubMed
description	The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In this paper, we attempt to construct a crack detector based on CNN with twenty images via a two-stage data augmentation method. In detail, nine data augmentation methods are compared for crack detection in the model training, respectively. As a result, the rotation method outperforms these methods for augmentation, and by an in-depth exploration of the rotation method, the performance of the detector is further improved. Furthermore, data augmentation is also applied in the inference process to improve the recall of trained models. The identical object has more chances to be detected in the series of augmented images. This trick is essentially a performance–resource trade-off. For more improvement with limited resources, the greedy algorithm is adopted for searching a better combination of data augmentation. The results show that the crack detectors trained on the small dataset are significantly improved via the proposed two-stage data augmentation. Specifically, using 20 images for training, recall in detecting the cracks achieves 96% and [Formula: see text] , which is a variant of F-score for crack detection, achieves 91.18%.
format	Online Article Text
id	pubmed-7506713
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75067132020-09-26 CNN Training with Twenty Samples for Crack Detection via Data Augmentation Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling Sensors (Basel) Article The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In this paper, we attempt to construct a crack detector based on CNN with twenty images via a two-stage data augmentation method. In detail, nine data augmentation methods are compared for crack detection in the model training, respectively. As a result, the rotation method outperforms these methods for augmentation, and by an in-depth exploration of the rotation method, the performance of the detector is further improved. Furthermore, data augmentation is also applied in the inference process to improve the recall of trained models. The identical object has more chances to be detected in the series of augmented images. This trick is essentially a performance–resource trade-off. For more improvement with limited resources, the greedy algorithm is adopted for searching a better combination of data augmentation. The results show that the crack detectors trained on the small dataset are significantly improved via the proposed two-stage data augmentation. Specifically, using 20 images for training, recall in detecting the cracks achieves 96% and [Formula: see text] , which is a variant of F-score for crack detection, achieves 91.18%. MDPI 2020-08-27 /pmc/articles/PMC7506713/ /pubmed/32867223 http://dx.doi.org/10.3390/s20174849 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title	CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_full	CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_fullStr	CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_full_unstemmed	CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_short	CNN Training with Twenty Samples for Crack Detection via Data Augmentation
title_sort	cnn training with twenty samples for crack detection via data augmentation
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506713/ https://www.ncbi.nlm.nih.gov/pubmed/32867223 http://dx.doi.org/10.3390/s20174849
work_keys_str_mv	AT wangzirui cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation AT yangjingjing cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation AT jianghaonan cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation AT fanxueling cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation

CNN Training with Twenty Samples for Crack Detection via Data Augmentation

Ejemplares similares