Cargando…

Long-Range Dependence Involutional Network for Logo Detection

Logo detection is one of the crucial branches in computer vision due to various real-world applications, such as automatic logo detection and recognition, intelligent transportation, and trademark infringement detection. Compared with traditional handcrafted-feature-based methods, deep learning-base...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xingzhuo, Hou, Sujuan, Zhang, Baisong, Wang, Jing, Jia, Weikuan, Zheng, Yuanjie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857861/
https://www.ncbi.nlm.nih.gov/pubmed/36673315
http://dx.doi.org/10.3390/e25010174
_version_ 1784873954265006080
author Li, Xingzhuo
Hou, Sujuan
Zhang, Baisong
Wang, Jing
Jia, Weikuan
Zheng, Yuanjie
author_facet Li, Xingzhuo
Hou, Sujuan
Zhang, Baisong
Wang, Jing
Jia, Weikuan
Zheng, Yuanjie
author_sort Li, Xingzhuo
collection PubMed
description Logo detection is one of the crucial branches in computer vision due to various real-world applications, such as automatic logo detection and recognition, intelligent transportation, and trademark infringement detection. Compared with traditional handcrafted-feature-based methods, deep learning-based convolutional neural networks (CNNs) can learn both low-level and high-level image features. Recent decades have witnessed the great feature representation capabilities of deep CNNs and their variants, which have been very good at discovering intricate structures in high-dimensional data and are thereby applicable to many domains including logo detection. However, logo detection remains challenging, as existing detection methods cannot solve well the problems of a multiscale and large aspect ratios. In this paper, we tackle these challenges by developing a novel long-range dependence involutional network (LDI-Net). Specifically, we designed a strategy that combines a new operator and a self-attention mechanism via rethinking the intrinsic principle of convolution called long-range dependence involution (LD involution) to alleviate the detection difficulties caused by large aspect ratios. We also introduce a multilevel representation neural architecture search (MRNAS) to detect multiscale logo objects by constructing a novel multipath topology. In addition, we implemented an adaptive RoI pooling module (ARM) to improve detection efficiency by addressing the problem of logo deformation. Comprehensive experiments on four benchmark logo datasets demonstrate the effectiveness and efficiency of the proposed approach.
format Online
Article
Text
id pubmed-9857861
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98578612023-01-21 Long-Range Dependence Involutional Network for Logo Detection Li, Xingzhuo Hou, Sujuan Zhang, Baisong Wang, Jing Jia, Weikuan Zheng, Yuanjie Entropy (Basel) Article Logo detection is one of the crucial branches in computer vision due to various real-world applications, such as automatic logo detection and recognition, intelligent transportation, and trademark infringement detection. Compared with traditional handcrafted-feature-based methods, deep learning-based convolutional neural networks (CNNs) can learn both low-level and high-level image features. Recent decades have witnessed the great feature representation capabilities of deep CNNs and their variants, which have been very good at discovering intricate structures in high-dimensional data and are thereby applicable to many domains including logo detection. However, logo detection remains challenging, as existing detection methods cannot solve well the problems of a multiscale and large aspect ratios. In this paper, we tackle these challenges by developing a novel long-range dependence involutional network (LDI-Net). Specifically, we designed a strategy that combines a new operator and a self-attention mechanism via rethinking the intrinsic principle of convolution called long-range dependence involution (LD involution) to alleviate the detection difficulties caused by large aspect ratios. We also introduce a multilevel representation neural architecture search (MRNAS) to detect multiscale logo objects by constructing a novel multipath topology. In addition, we implemented an adaptive RoI pooling module (ARM) to improve detection efficiency by addressing the problem of logo deformation. Comprehensive experiments on four benchmark logo datasets demonstrate the effectiveness and efficiency of the proposed approach. MDPI 2023-01-15 /pmc/articles/PMC9857861/ /pubmed/36673315 http://dx.doi.org/10.3390/e25010174 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Xingzhuo
Hou, Sujuan
Zhang, Baisong
Wang, Jing
Jia, Weikuan
Zheng, Yuanjie
Long-Range Dependence Involutional Network for Logo Detection
title Long-Range Dependence Involutional Network for Logo Detection
title_full Long-Range Dependence Involutional Network for Logo Detection
title_fullStr Long-Range Dependence Involutional Network for Logo Detection
title_full_unstemmed Long-Range Dependence Involutional Network for Logo Detection
title_short Long-Range Dependence Involutional Network for Logo Detection
title_sort long-range dependence involutional network for logo detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857861/
https://www.ncbi.nlm.nih.gov/pubmed/36673315
http://dx.doi.org/10.3390/e25010174
work_keys_str_mv AT lixingzhuo longrangedependenceinvolutionalnetworkforlogodetection
AT housujuan longrangedependenceinvolutionalnetworkforlogodetection
AT zhangbaisong longrangedependenceinvolutionalnetworkforlogodetection
AT wangjing longrangedependenceinvolutionalnetworkforlogodetection
AT jiaweikuan longrangedependenceinvolutionalnetworkforlogodetection
AT zhengyuanjie longrangedependenceinvolutionalnetworkforlogodetection