Cargando…

Image local structure information learning for fine-grained visual classification

Learning discriminative visual patterns from image local salient regions is widely used for fine-grained visual classification (FGVC) tasks such as plant or animal species classification. A large number of complex networks have been designed for learning discriminative feature representations. In th...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Jin, Zhang, Weichuan, Zhao, Yali, Sun, Changming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9649701/
https://www.ncbi.nlm.nih.gov/pubmed/36357665
http://dx.doi.org/10.1038/s41598-022-23835-0
_version_ 1784827854491484160
author Lu, Jin
Zhang, Weichuan
Zhao, Yali
Sun, Changming
author_facet Lu, Jin
Zhang, Weichuan
Zhao, Yali
Sun, Changming
author_sort Lu, Jin
collection PubMed
description Learning discriminative visual patterns from image local salient regions is widely used for fine-grained visual classification (FGVC) tasks such as plant or animal species classification. A large number of complex networks have been designed for learning discriminative feature representations. In this paper, we propose a novel local structure information (LSI) learning method for FGVC. Firstly, we indicate that the existing FGVC methods have not properly considered how to extract LSI from an input image for FGVC. Then an LSI extraction technique is introduced which has the ability to properly depict the properties of different local structure features in images. Secondly, a novel LSI learning module is proposed to be added into a given backbone network for enhancing the ability of the network to find salient regions. Thirdly, extensive experiments show that our proposed method achieves better performance on six image datasets. Particularly, the proposed method performs far better on datasets with a limited number of images.
format Online
Article
Text
id pubmed-9649701
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96497012022-11-15 Image local structure information learning for fine-grained visual classification Lu, Jin Zhang, Weichuan Zhao, Yali Sun, Changming Sci Rep Article Learning discriminative visual patterns from image local salient regions is widely used for fine-grained visual classification (FGVC) tasks such as plant or animal species classification. A large number of complex networks have been designed for learning discriminative feature representations. In this paper, we propose a novel local structure information (LSI) learning method for FGVC. Firstly, we indicate that the existing FGVC methods have not properly considered how to extract LSI from an input image for FGVC. Then an LSI extraction technique is introduced which has the ability to properly depict the properties of different local structure features in images. Secondly, a novel LSI learning module is proposed to be added into a given backbone network for enhancing the ability of the network to find salient regions. Thirdly, extensive experiments show that our proposed method achieves better performance on six image datasets. Particularly, the proposed method performs far better on datasets with a limited number of images. Nature Publishing Group UK 2022-11-10 /pmc/articles/PMC9649701/ /pubmed/36357665 http://dx.doi.org/10.1038/s41598-022-23835-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Lu, Jin
Zhang, Weichuan
Zhao, Yali
Sun, Changming
Image local structure information learning for fine-grained visual classification
title Image local structure information learning for fine-grained visual classification
title_full Image local structure information learning for fine-grained visual classification
title_fullStr Image local structure information learning for fine-grained visual classification
title_full_unstemmed Image local structure information learning for fine-grained visual classification
title_short Image local structure information learning for fine-grained visual classification
title_sort image local structure information learning for fine-grained visual classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9649701/
https://www.ncbi.nlm.nih.gov/pubmed/36357665
http://dx.doi.org/10.1038/s41598-022-23835-0
work_keys_str_mv AT lujin imagelocalstructureinformationlearningforfinegrainedvisualclassification
AT zhangweichuan imagelocalstructureinformationlearningforfinegrainedvisualclassification
AT zhaoyali imagelocalstructureinformationlearningforfinegrainedvisualclassification
AT sunchangming imagelocalstructureinformationlearningforfinegrainedvisualclassification