Cargando…

Object detectors involving a NAS-gate convolutional module and capsule attention module

Several state-of-the-art object detectors have demonstrated outstanding performances by optimizing feature representation through modification of the backbone architecture and exploitation of a feature pyramid. To determine the effectiveness of this approach, we explore the modification of object de...

Descripción completa

Detalles Bibliográficos
Autores principales:	Viriyasaranon, Thanaporn, Choi, Jang-Hwan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8913793/ https://www.ncbi.nlm.nih.gov/pubmed/35273256 http://dx.doi.org/10.1038/s41598-022-07898-7

_version_	1784667531421679616
author	Viriyasaranon, Thanaporn Choi, Jang-Hwan
author_facet	Viriyasaranon, Thanaporn Choi, Jang-Hwan
author_sort	Viriyasaranon, Thanaporn
collection	PubMed
description	Several state-of-the-art object detectors have demonstrated outstanding performances by optimizing feature representation through modification of the backbone architecture and exploitation of a feature pyramid. To determine the effectiveness of this approach, we explore the modification of object detectors’ backbone and feature pyramid by utilizing Neural Architecture Search (NAS) and Capsule Network. We introduce two modules, namely, NAS-gate convolutional module and Capsule Attention module. The NAS-gate convolutional module optimizes standard convolution in a backbone network based on differentiable architecture search cooperation with multiple convolution conditions to overcome object scale variation problems. The Capsule Attention module exploits the strong spatial relationship encoding ability of the capsule network to generate a spatial attention mask, which emphasizes important features and suppresses unnecessary features in the feature pyramid, in order to optimize the feature representation and localization capability of the detectors. Experimental results indicate that the NAS-gate convolutional module can alleviate the object scale variation problem and the Capsule Attention network can help to avoid inaccurate localization. Next, we introduce NASGC-CapANet, which incorporates the two modules, i.e., a NAS-gate convolutional module and capsule attention module. Results of comparisons against state-of-the-art object detectors on the MS COCO val-2017 dataset demonstrate that NASGC-CapANet-based Faster R-CNN significantly outperforms the baseline Faster R-CNN with a ResNet-50 backbone and a ResNet-101 backbone by mAPs of 2.7% and 2.0%, respectively. Furthermore, the NASGC-CapANet-based Cascade R-CNN achieves a box mAP of 43.8% on the MS COCO test-dev dataset.
format	Online Article Text
id	pubmed-8913793
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-89137932022-03-14 Object detectors involving a NAS-gate convolutional module and capsule attention module Viriyasaranon, Thanaporn Choi, Jang-Hwan Sci Rep Article Several state-of-the-art object detectors have demonstrated outstanding performances by optimizing feature representation through modification of the backbone architecture and exploitation of a feature pyramid. To determine the effectiveness of this approach, we explore the modification of object detectors’ backbone and feature pyramid by utilizing Neural Architecture Search (NAS) and Capsule Network. We introduce two modules, namely, NAS-gate convolutional module and Capsule Attention module. The NAS-gate convolutional module optimizes standard convolution in a backbone network based on differentiable architecture search cooperation with multiple convolution conditions to overcome object scale variation problems. The Capsule Attention module exploits the strong spatial relationship encoding ability of the capsule network to generate a spatial attention mask, which emphasizes important features and suppresses unnecessary features in the feature pyramid, in order to optimize the feature representation and localization capability of the detectors. Experimental results indicate that the NAS-gate convolutional module can alleviate the object scale variation problem and the Capsule Attention network can help to avoid inaccurate localization. Next, we introduce NASGC-CapANet, which incorporates the two modules, i.e., a NAS-gate convolutional module and capsule attention module. Results of comparisons against state-of-the-art object detectors on the MS COCO val-2017 dataset demonstrate that NASGC-CapANet-based Faster R-CNN significantly outperforms the baseline Faster R-CNN with a ResNet-50 backbone and a ResNet-101 backbone by mAPs of 2.7% and 2.0%, respectively. Furthermore, the NASGC-CapANet-based Cascade R-CNN achieves a box mAP of 43.8% on the MS COCO test-dev dataset. Nature Publishing Group UK 2022-03-10 /pmc/articles/PMC8913793/ /pubmed/35273256 http://dx.doi.org/10.1038/s41598-022-07898-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Viriyasaranon, Thanaporn Choi, Jang-Hwan Object detectors involving a NAS-gate convolutional module and capsule attention module
title	Object detectors involving a NAS-gate convolutional module and capsule attention module
title_full	Object detectors involving a NAS-gate convolutional module and capsule attention module
title_fullStr	Object detectors involving a NAS-gate convolutional module and capsule attention module
title_full_unstemmed	Object detectors involving a NAS-gate convolutional module and capsule attention module
title_short	Object detectors involving a NAS-gate convolutional module and capsule attention module
title_sort	object detectors involving a nas-gate convolutional module and capsule attention module
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8913793/ https://www.ncbi.nlm.nih.gov/pubmed/35273256 http://dx.doi.org/10.1038/s41598-022-07898-7
work_keys_str_mv	AT viriyasaranonthanaporn objectdetectorsinvolvinganasgateconvolutionalmoduleandcapsuleattentionmodule AT choijanghwan objectdetectorsinvolvinganasgateconvolutionalmoduleandcapsuleattentionmodule

Object detectors involving a NAS-gate convolutional module and capsule attention module

Ejemplares similares