Cargando…
Activation extending based on long-range dependencies for weakly supervised semantic segmentation
Weakly supervised semantic segmentation (WSSS) principally obtains pseudo-labels based on the class activation maps (CAM) to handle expensive annotation resources. However, CAM easily involves false and local activation due to the the lack of annotation information. This paper suggests weakly superv...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10662704/ https://www.ncbi.nlm.nih.gov/pubmed/37988337 http://dx.doi.org/10.1371/journal.pone.0288596 |
_version_ | 1785148589320699904 |
---|---|
author | Liu, Haipeng Zhao, Yibo Wang, Meng Ma, Meiyan Chen, Zhaoyu |
author_facet | Liu, Haipeng Zhao, Yibo Wang, Meng Ma, Meiyan Chen, Zhaoyu |
author_sort | Liu, Haipeng |
collection | PubMed |
description | Weakly supervised semantic segmentation (WSSS) principally obtains pseudo-labels based on the class activation maps (CAM) to handle expensive annotation resources. However, CAM easily involves false and local activation due to the the lack of annotation information. This paper suggests weakly supervised learning as semantic information mining to extend object mask. We proposes a novel architecture to mining semantic information by modeling through long-range dependencies from in-sample and inter-sample. Considering the confusion caused by the long-range dependencies, the images are divided into blocks and carried out self-attention operation on the premise of fewer classes to obtain long-range dependencies, to reduce false predictions. Moreover, we perform global to local weighted self-supervised contrastive learning among image blocks, and the local activation of CAM is transferred to different foreground area. Experiments verified that superior semantic details and more reliable pseudo-labels are captured through these suggested modules. Experiments on PASCAL VOC 2012 demonstrated the proposed model achieves 76.6% and 77.4% mIoU in val and test sets, which is superior to the comparison baselines. |
format | Online Article Text |
id | pubmed-10662704 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-106627042023-11-21 Activation extending based on long-range dependencies for weakly supervised semantic segmentation Liu, Haipeng Zhao, Yibo Wang, Meng Ma, Meiyan Chen, Zhaoyu PLoS One Research Article Weakly supervised semantic segmentation (WSSS) principally obtains pseudo-labels based on the class activation maps (CAM) to handle expensive annotation resources. However, CAM easily involves false and local activation due to the the lack of annotation information. This paper suggests weakly supervised learning as semantic information mining to extend object mask. We proposes a novel architecture to mining semantic information by modeling through long-range dependencies from in-sample and inter-sample. Considering the confusion caused by the long-range dependencies, the images are divided into blocks and carried out self-attention operation on the premise of fewer classes to obtain long-range dependencies, to reduce false predictions. Moreover, we perform global to local weighted self-supervised contrastive learning among image blocks, and the local activation of CAM is transferred to different foreground area. Experiments verified that superior semantic details and more reliable pseudo-labels are captured through these suggested modules. Experiments on PASCAL VOC 2012 demonstrated the proposed model achieves 76.6% and 77.4% mIoU in val and test sets, which is superior to the comparison baselines. Public Library of Science 2023-11-21 /pmc/articles/PMC10662704/ /pubmed/37988337 http://dx.doi.org/10.1371/journal.pone.0288596 Text en © 2023 Liu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Liu, Haipeng Zhao, Yibo Wang, Meng Ma, Meiyan Chen, Zhaoyu Activation extending based on long-range dependencies for weakly supervised semantic segmentation |
title | Activation extending based on long-range dependencies for weakly supervised semantic segmentation |
title_full | Activation extending based on long-range dependencies for weakly supervised semantic segmentation |
title_fullStr | Activation extending based on long-range dependencies for weakly supervised semantic segmentation |
title_full_unstemmed | Activation extending based on long-range dependencies for weakly supervised semantic segmentation |
title_short | Activation extending based on long-range dependencies for weakly supervised semantic segmentation |
title_sort | activation extending based on long-range dependencies for weakly supervised semantic segmentation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10662704/ https://www.ncbi.nlm.nih.gov/pubmed/37988337 http://dx.doi.org/10.1371/journal.pone.0288596 |
work_keys_str_mv | AT liuhaipeng activationextendingbasedonlongrangedependenciesforweaklysupervisedsemanticsegmentation AT zhaoyibo activationextendingbasedonlongrangedependenciesforweaklysupervisedsemanticsegmentation AT wangmeng activationextendingbasedonlongrangedependenciesforweaklysupervisedsemanticsegmentation AT mameiyan activationextendingbasedonlongrangedependenciesforweaklysupervisedsemanticsegmentation AT chenzhaoyu activationextendingbasedonlongrangedependenciesforweaklysupervisedsemanticsegmentation |