Cargando…

Module of Axis-based Nexus Attention for weakly supervised object localization

Weakly supervised object localization tasks remain challenging to identify and segment an entire object rather than only discriminative parts of the object. To tackle this problem, corruption-based approaches have been devised, which involve the training of non-discriminative regions by corrupting (...

Descripción completa

Detalles Bibliográficos
Autores principales: Sohn, Junghyo, Jeon, Eunjin, Jung, Wonsik, Kang, Eunsong, Suk, Heung-Il
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10616293/
https://www.ncbi.nlm.nih.gov/pubmed/37903879
http://dx.doi.org/10.1038/s41598-023-45796-8
_version_ 1785129362861850624
author Sohn, Junghyo
Jeon, Eunjin
Jung, Wonsik
Kang, Eunsong
Suk, Heung-Il
author_facet Sohn, Junghyo
Jeon, Eunjin
Jung, Wonsik
Kang, Eunsong
Suk, Heung-Il
author_sort Sohn, Junghyo
collection PubMed
description Weakly supervised object localization tasks remain challenging to identify and segment an entire object rather than only discriminative parts of the object. To tackle this problem, corruption-based approaches have been devised, which involve the training of non-discriminative regions by corrupting (e.g., erasing) the input images or intermediate feature maps. However, this approach requires an additional hyperparameter, the corrupting threshold, to determine the degree of corruption and can unfavorably disrupt training. It also tends to localize object regions coarsely. In this paper, we propose a novel approach, Module of Axis-based Nexus Attention (MoANA), which helps to adaptively activate less discriminative regions along with the class-discriminative regions without an additional hyperparameter, and elaborately localizes an entire object. Specifically, MoANA consists of three mechanisms (1) triple-view attentions representation, (2) attentions expansion, and (3) features calibration mechanism. Unlike other attention-based methods that train a coarse attention map with the same values across elements in feature maps, MoANA trains fine-grained values in an attention map by assigning different attention values to each element. We validated MoANA by comparing it with various methods. We also analyzed the effect of each component in MoANA and visualized attention maps to provide insights into the calibration.
format Online
Article
Text
id pubmed-10616293
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106162932023-11-01 Module of Axis-based Nexus Attention for weakly supervised object localization Sohn, Junghyo Jeon, Eunjin Jung, Wonsik Kang, Eunsong Suk, Heung-Il Sci Rep Article Weakly supervised object localization tasks remain challenging to identify and segment an entire object rather than only discriminative parts of the object. To tackle this problem, corruption-based approaches have been devised, which involve the training of non-discriminative regions by corrupting (e.g., erasing) the input images or intermediate feature maps. However, this approach requires an additional hyperparameter, the corrupting threshold, to determine the degree of corruption and can unfavorably disrupt training. It also tends to localize object regions coarsely. In this paper, we propose a novel approach, Module of Axis-based Nexus Attention (MoANA), which helps to adaptively activate less discriminative regions along with the class-discriminative regions without an additional hyperparameter, and elaborately localizes an entire object. Specifically, MoANA consists of three mechanisms (1) triple-view attentions representation, (2) attentions expansion, and (3) features calibration mechanism. Unlike other attention-based methods that train a coarse attention map with the same values across elements in feature maps, MoANA trains fine-grained values in an attention map by assigning different attention values to each element. We validated MoANA by comparing it with various methods. We also analyzed the effect of each component in MoANA and visualized attention maps to provide insights into the calibration. Nature Publishing Group UK 2023-10-30 /pmc/articles/PMC10616293/ /pubmed/37903879 http://dx.doi.org/10.1038/s41598-023-45796-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Sohn, Junghyo
Jeon, Eunjin
Jung, Wonsik
Kang, Eunsong
Suk, Heung-Il
Module of Axis-based Nexus Attention for weakly supervised object localization
title Module of Axis-based Nexus Attention for weakly supervised object localization
title_full Module of Axis-based Nexus Attention for weakly supervised object localization
title_fullStr Module of Axis-based Nexus Attention for weakly supervised object localization
title_full_unstemmed Module of Axis-based Nexus Attention for weakly supervised object localization
title_short Module of Axis-based Nexus Attention for weakly supervised object localization
title_sort module of axis-based nexus attention for weakly supervised object localization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10616293/
https://www.ncbi.nlm.nih.gov/pubmed/37903879
http://dx.doi.org/10.1038/s41598-023-45796-8
work_keys_str_mv AT sohnjunghyo moduleofaxisbasednexusattentionforweaklysupervisedobjectlocalization
AT jeoneunjin moduleofaxisbasednexusattentionforweaklysupervisedobjectlocalization
AT jungwonsik moduleofaxisbasednexusattentionforweaklysupervisedobjectlocalization
AT kangeunsong moduleofaxisbasednexusattentionforweaklysupervisedobjectlocalization
AT sukheungil moduleofaxisbasednexusattentionforweaklysupervisedobjectlocalization