Cargando…
CNN Training with Twenty Samples for Crack Detection via Data Augmentation
The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In thi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506713/ https://www.ncbi.nlm.nih.gov/pubmed/32867223 http://dx.doi.org/10.3390/s20174849 |
_version_ | 1783585077177876480 |
---|---|
author | Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling |
author_facet | Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling |
author_sort | Wang, Zirui |
collection | PubMed |
description | The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In this paper, we attempt to construct a crack detector based on CNN with twenty images via a two-stage data augmentation method. In detail, nine data augmentation methods are compared for crack detection in the model training, respectively. As a result, the rotation method outperforms these methods for augmentation, and by an in-depth exploration of the rotation method, the performance of the detector is further improved. Furthermore, data augmentation is also applied in the inference process to improve the recall of trained models. The identical object has more chances to be detected in the series of augmented images. This trick is essentially a performance–resource trade-off. For more improvement with limited resources, the greedy algorithm is adopted for searching a better combination of data augmentation. The results show that the crack detectors trained on the small dataset are significantly improved via the proposed two-stage data augmentation. Specifically, using 20 images for training, recall in detecting the cracks achieves 96% and [Formula: see text] , which is a variant of F-score for crack detection, achieves 91.18%. |
format | Online Article Text |
id | pubmed-7506713 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75067132020-09-26 CNN Training with Twenty Samples for Crack Detection via Data Augmentation Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling Sensors (Basel) Article The excellent generalization ability of deep learning methods, e.g., convolutional neural networks (CNNs), depends on a large amount of training data, which is difficult to obtain in industrial practices. Data augmentation is regarded commonly as an effective strategy to address this problem. In this paper, we attempt to construct a crack detector based on CNN with twenty images via a two-stage data augmentation method. In detail, nine data augmentation methods are compared for crack detection in the model training, respectively. As a result, the rotation method outperforms these methods for augmentation, and by an in-depth exploration of the rotation method, the performance of the detector is further improved. Furthermore, data augmentation is also applied in the inference process to improve the recall of trained models. The identical object has more chances to be detected in the series of augmented images. This trick is essentially a performance–resource trade-off. For more improvement with limited resources, the greedy algorithm is adopted for searching a better combination of data augmentation. The results show that the crack detectors trained on the small dataset are significantly improved via the proposed two-stage data augmentation. Specifically, using 20 images for training, recall in detecting the cracks achieves 96% and [Formula: see text] , which is a variant of F-score for crack detection, achieves 91.18%. MDPI 2020-08-27 /pmc/articles/PMC7506713/ /pubmed/32867223 http://dx.doi.org/10.3390/s20174849 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Zirui Yang, Jingjing Jiang, Haonan Fan, Xueling CNN Training with Twenty Samples for Crack Detection via Data Augmentation |
title | CNN Training with Twenty Samples for Crack Detection via Data Augmentation |
title_full | CNN Training with Twenty Samples for Crack Detection via Data Augmentation |
title_fullStr | CNN Training with Twenty Samples for Crack Detection via Data Augmentation |
title_full_unstemmed | CNN Training with Twenty Samples for Crack Detection via Data Augmentation |
title_short | CNN Training with Twenty Samples for Crack Detection via Data Augmentation |
title_sort | cnn training with twenty samples for crack detection via data augmentation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506713/ https://www.ncbi.nlm.nih.gov/pubmed/32867223 http://dx.doi.org/10.3390/s20174849 |
work_keys_str_mv | AT wangzirui cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation AT yangjingjing cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation AT jianghaonan cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation AT fanxueling cnntrainingwithtwentysamplesforcrackdetectionviadataaugmentation |