Cargando…

An Efficient End-to-End Multitask Network Architecture for Defect Inspection

Recently, computer vision-based methods have been successfully applied in many industrial fields. Nevertheless, automated detection of steel surface defects remains a challenge due to the complexity of surface defects. To solve this problem, many models have been proposed, but these models are not g...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Chunguang, Yang, Heqiu, Ma, Jun, Chen, Huayue
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9785184/ https://www.ncbi.nlm.nih.gov/pubmed/36560212 http://dx.doi.org/10.3390/s22249845

_version_	1784857987263758336
author	Zhang, Chunguang Yang, Heqiu Ma, Jun Chen, Huayue
author_facet	Zhang, Chunguang Yang, Heqiu Ma, Jun Chen, Huayue
author_sort	Zhang, Chunguang
collection	PubMed
description	Recently, computer vision-based methods have been successfully applied in many industrial fields. Nevertheless, automated detection of steel surface defects remains a challenge due to the complexity of surface defects. To solve this problem, many models have been proposed, but these models are not good enough to detect all defects. After analyzing the previous research, we believe that the single-task network cannot fully meet the actual detection needs owing to its own characteristics. To address this problem, an end-to-end multi-task network has been proposed. It consists of one encoder and two decoders. The encoder is used for feature extraction, and the two decoders are used for object detection and semantic segmentation, respectively. In an effort to deal with the challenge of changing defect scales, we propose the Depthwise Separable Atrous Spatial Pyramid Pooling module. This module can obtain dense multi-scale features at a very low computational cost. After that, Residually Connected Depthwise Separable Atrous Convolutional Blocks are used to extract spatial information under low computation for better segmentation prediction. Furthermore, we investigate the impact of training strategies on network performance. The performance of the network can be optimized by adopting the strategy of training the segmentation task first and using the deep supervision training method. At length, the advantages of object detection and semantic segmentation are tactfully combined. Our model achieves mIOU 79.37% and mAP@0.5 78.38% on the NEU dataset. Comparative experiments demonstrate that this method has apparent advantages over other models. Meanwhile, the speed of detection amount to 85.6 FPS on a single GPU, which is acceptable in the practical detection process.
format	Online Article Text
id	pubmed-9785184
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-97851842022-12-24 An Efficient End-to-End Multitask Network Architecture for Defect Inspection Zhang, Chunguang Yang, Heqiu Ma, Jun Chen, Huayue Sensors (Basel) Article Recently, computer vision-based methods have been successfully applied in many industrial fields. Nevertheless, automated detection of steel surface defects remains a challenge due to the complexity of surface defects. To solve this problem, many models have been proposed, but these models are not good enough to detect all defects. After analyzing the previous research, we believe that the single-task network cannot fully meet the actual detection needs owing to its own characteristics. To address this problem, an end-to-end multi-task network has been proposed. It consists of one encoder and two decoders. The encoder is used for feature extraction, and the two decoders are used for object detection and semantic segmentation, respectively. In an effort to deal with the challenge of changing defect scales, we propose the Depthwise Separable Atrous Spatial Pyramid Pooling module. This module can obtain dense multi-scale features at a very low computational cost. After that, Residually Connected Depthwise Separable Atrous Convolutional Blocks are used to extract spatial information under low computation for better segmentation prediction. Furthermore, we investigate the impact of training strategies on network performance. The performance of the network can be optimized by adopting the strategy of training the segmentation task first and using the deep supervision training method. At length, the advantages of object detection and semantic segmentation are tactfully combined. Our model achieves mIOU 79.37% and mAP@0.5 78.38% on the NEU dataset. Comparative experiments demonstrate that this method has apparent advantages over other models. Meanwhile, the speed of detection amount to 85.6 FPS on a single GPU, which is acceptable in the practical detection process. MDPI 2022-12-14 /pmc/articles/PMC9785184/ /pubmed/36560212 http://dx.doi.org/10.3390/s22249845 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Chunguang Yang, Heqiu Ma, Jun Chen, Huayue An Efficient End-to-End Multitask Network Architecture for Defect Inspection
title	An Efficient End-to-End Multitask Network Architecture for Defect Inspection
title_full	An Efficient End-to-End Multitask Network Architecture for Defect Inspection
title_fullStr	An Efficient End-to-End Multitask Network Architecture for Defect Inspection
title_full_unstemmed	An Efficient End-to-End Multitask Network Architecture for Defect Inspection
title_short	An Efficient End-to-End Multitask Network Architecture for Defect Inspection
title_sort	efficient end-to-end multitask network architecture for defect inspection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9785184/ https://www.ncbi.nlm.nih.gov/pubmed/36560212 http://dx.doi.org/10.3390/s22249845
work_keys_str_mv	AT zhangchunguang anefficientendtoendmultitasknetworkarchitecturefordefectinspection AT yangheqiu anefficientendtoendmultitasknetworkarchitecturefordefectinspection AT majun anefficientendtoendmultitasknetworkarchitecturefordefectinspection AT chenhuayue anefficientendtoendmultitasknetworkarchitecturefordefectinspection AT zhangchunguang efficientendtoendmultitasknetworkarchitecturefordefectinspection AT yangheqiu efficientendtoendmultitasknetworkarchitecturefordefectinspection AT majun efficientendtoendmultitasknetworkarchitecturefordefectinspection AT chenhuayue efficientendtoendmultitasknetworkarchitecturefordefectinspection

An Efficient End-to-End Multitask Network Architecture for Defect Inspection

Ejemplares similares