Cargando…

Fine-grained classification based on multi-scale pyramid convolution networks

The large intra-class variance and small inter-class variance are the key factor affecting fine-grained image classification. Recently, some algorithms have been more accurate and efficient. However, these methods ignore the multi-scale information of the network, resulting in insufficient ability t...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Gaihua, Cheng, Lei, Lin, Jinheng, Dai, Yingying, Zhang, Tianlun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8270455/
https://www.ncbi.nlm.nih.gov/pubmed/34242297
http://dx.doi.org/10.1371/journal.pone.0254054
_version_ 1783720805834686464
author Wang, Gaihua
Cheng, Lei
Lin, Jinheng
Dai, Yingying
Zhang, Tianlun
author_facet Wang, Gaihua
Cheng, Lei
Lin, Jinheng
Dai, Yingying
Zhang, Tianlun
author_sort Wang, Gaihua
collection PubMed
description The large intra-class variance and small inter-class variance are the key factor affecting fine-grained image classification. Recently, some algorithms have been more accurate and efficient. However, these methods ignore the multi-scale information of the network, resulting in insufficient ability to capture subtle changes. To solve this problem, a weakly supervised fine-grained classification network based on multi-scale pyramid is proposed in this paper. It uses pyramid convolution kernel to replace ordinary convolution kernel in residual network, which can expand the receptive field of the convolution kernel and use complementary information of different scales. Meanwhile, the weakly supervised data augmentation network (WS-DAN) is used to prevent over fitting and improve the performance of the model. In addition, a new attention module, which includes spatial attention and channel attention, is introduced to pay more attention to the object part in the image. The comprehensive experiments are carried out on three public benchmarks. It shows that the proposed method can extract subtle feature and achieve classification effectively.
format Online
Article
Text
id pubmed-8270455
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-82704552021-07-21 Fine-grained classification based on multi-scale pyramid convolution networks Wang, Gaihua Cheng, Lei Lin, Jinheng Dai, Yingying Zhang, Tianlun PLoS One Research Article The large intra-class variance and small inter-class variance are the key factor affecting fine-grained image classification. Recently, some algorithms have been more accurate and efficient. However, these methods ignore the multi-scale information of the network, resulting in insufficient ability to capture subtle changes. To solve this problem, a weakly supervised fine-grained classification network based on multi-scale pyramid is proposed in this paper. It uses pyramid convolution kernel to replace ordinary convolution kernel in residual network, which can expand the receptive field of the convolution kernel and use complementary information of different scales. Meanwhile, the weakly supervised data augmentation network (WS-DAN) is used to prevent over fitting and improve the performance of the model. In addition, a new attention module, which includes spatial attention and channel attention, is introduced to pay more attention to the object part in the image. The comprehensive experiments are carried out on three public benchmarks. It shows that the proposed method can extract subtle feature and achieve classification effectively. Public Library of Science 2021-07-09 /pmc/articles/PMC8270455/ /pubmed/34242297 http://dx.doi.org/10.1371/journal.pone.0254054 Text en © 2021 Wang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Gaihua
Cheng, Lei
Lin, Jinheng
Dai, Yingying
Zhang, Tianlun
Fine-grained classification based on multi-scale pyramid convolution networks
title Fine-grained classification based on multi-scale pyramid convolution networks
title_full Fine-grained classification based on multi-scale pyramid convolution networks
title_fullStr Fine-grained classification based on multi-scale pyramid convolution networks
title_full_unstemmed Fine-grained classification based on multi-scale pyramid convolution networks
title_short Fine-grained classification based on multi-scale pyramid convolution networks
title_sort fine-grained classification based on multi-scale pyramid convolution networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8270455/
https://www.ncbi.nlm.nih.gov/pubmed/34242297
http://dx.doi.org/10.1371/journal.pone.0254054
work_keys_str_mv AT wanggaihua finegrainedclassificationbasedonmultiscalepyramidconvolutionnetworks
AT chenglei finegrainedclassificationbasedonmultiscalepyramidconvolutionnetworks
AT linjinheng finegrainedclassificationbasedonmultiscalepyramidconvolutionnetworks
AT daiyingying finegrainedclassificationbasedonmultiscalepyramidconvolutionnetworks
AT zhangtianlun finegrainedclassificationbasedonmultiscalepyramidconvolutionnetworks