Cargando…

Feature fusion network based on strip pooling

Contextual information is a key factor affecting semantic segmentation. Recently, many methods have tried to use the self-attention mechanism to capture more contextual information. However, these methods with self-attention mechanism need a huge computation. In order to solve this problem, a novel...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Gaihua, Zhai, Qianyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553855/
https://www.ncbi.nlm.nih.gov/pubmed/34711889
http://dx.doi.org/10.1038/s41598-021-00585-z
_version_ 1784591663044231168
author Wang, Gaihua
Zhai, Qianyu
author_facet Wang, Gaihua
Zhai, Qianyu
author_sort Wang, Gaihua
collection PubMed
description Contextual information is a key factor affecting semantic segmentation. Recently, many methods have tried to use the self-attention mechanism to capture more contextual information. However, these methods with self-attention mechanism need a huge computation. In order to solve this problem, a novel self-attention network, called FFANet, is designed to efficiently capture contextual information, which reduces the amount of calculation through strip pooling and linear layers. It proposes the feature fusion (FF) module to calculate the affinity matrix. The affinity matrix can capture the relationship between pixels. Then we multiply the affinity matrix with the feature map, which can selectively increase the weight of the region of interest. Extensive experiments on the public datasets (PASCAL VOC2012, CityScapes) and remote sensing dataset (DLRSD) have been conducted and achieved Mean Iou score 74.5%, 70.3%, and 63.9% respectively. Compared with the current typical algorithms, the proposed method has achieved excellent performance.
format Online
Article
Text
id pubmed-8553855
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85538552021-11-01 Feature fusion network based on strip pooling Wang, Gaihua Zhai, Qianyu Sci Rep Article Contextual information is a key factor affecting semantic segmentation. Recently, many methods have tried to use the self-attention mechanism to capture more contextual information. However, these methods with self-attention mechanism need a huge computation. In order to solve this problem, a novel self-attention network, called FFANet, is designed to efficiently capture contextual information, which reduces the amount of calculation through strip pooling and linear layers. It proposes the feature fusion (FF) module to calculate the affinity matrix. The affinity matrix can capture the relationship between pixels. Then we multiply the affinity matrix with the feature map, which can selectively increase the weight of the region of interest. Extensive experiments on the public datasets (PASCAL VOC2012, CityScapes) and remote sensing dataset (DLRSD) have been conducted and achieved Mean Iou score 74.5%, 70.3%, and 63.9% respectively. Compared with the current typical algorithms, the proposed method has achieved excellent performance. Nature Publishing Group UK 2021-10-28 /pmc/articles/PMC8553855/ /pubmed/34711889 http://dx.doi.org/10.1038/s41598-021-00585-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Wang, Gaihua
Zhai, Qianyu
Feature fusion network based on strip pooling
title Feature fusion network based on strip pooling
title_full Feature fusion network based on strip pooling
title_fullStr Feature fusion network based on strip pooling
title_full_unstemmed Feature fusion network based on strip pooling
title_short Feature fusion network based on strip pooling
title_sort feature fusion network based on strip pooling
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553855/
https://www.ncbi.nlm.nih.gov/pubmed/34711889
http://dx.doi.org/10.1038/s41598-021-00585-z
work_keys_str_mv AT wanggaihua featurefusionnetworkbasedonstrippooling
AT zhaiqianyu featurefusionnetworkbasedonstrippooling