Cargando…
Feature fusion network based on strip pooling
Contextual information is a key factor affecting semantic segmentation. Recently, many methods have tried to use the self-attention mechanism to capture more contextual information. However, these methods with self-attention mechanism need a huge computation. In order to solve this problem, a novel...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553855/ https://www.ncbi.nlm.nih.gov/pubmed/34711889 http://dx.doi.org/10.1038/s41598-021-00585-z |
_version_ | 1784591663044231168 |
---|---|
author | Wang, Gaihua Zhai, Qianyu |
author_facet | Wang, Gaihua Zhai, Qianyu |
author_sort | Wang, Gaihua |
collection | PubMed |
description | Contextual information is a key factor affecting semantic segmentation. Recently, many methods have tried to use the self-attention mechanism to capture more contextual information. However, these methods with self-attention mechanism need a huge computation. In order to solve this problem, a novel self-attention network, called FFANet, is designed to efficiently capture contextual information, which reduces the amount of calculation through strip pooling and linear layers. It proposes the feature fusion (FF) module to calculate the affinity matrix. The affinity matrix can capture the relationship between pixels. Then we multiply the affinity matrix with the feature map, which can selectively increase the weight of the region of interest. Extensive experiments on the public datasets (PASCAL VOC2012, CityScapes) and remote sensing dataset (DLRSD) have been conducted and achieved Mean Iou score 74.5%, 70.3%, and 63.9% respectively. Compared with the current typical algorithms, the proposed method has achieved excellent performance. |
format | Online Article Text |
id | pubmed-8553855 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-85538552021-11-01 Feature fusion network based on strip pooling Wang, Gaihua Zhai, Qianyu Sci Rep Article Contextual information is a key factor affecting semantic segmentation. Recently, many methods have tried to use the self-attention mechanism to capture more contextual information. However, these methods with self-attention mechanism need a huge computation. In order to solve this problem, a novel self-attention network, called FFANet, is designed to efficiently capture contextual information, which reduces the amount of calculation through strip pooling and linear layers. It proposes the feature fusion (FF) module to calculate the affinity matrix. The affinity matrix can capture the relationship between pixels. Then we multiply the affinity matrix with the feature map, which can selectively increase the weight of the region of interest. Extensive experiments on the public datasets (PASCAL VOC2012, CityScapes) and remote sensing dataset (DLRSD) have been conducted and achieved Mean Iou score 74.5%, 70.3%, and 63.9% respectively. Compared with the current typical algorithms, the proposed method has achieved excellent performance. Nature Publishing Group UK 2021-10-28 /pmc/articles/PMC8553855/ /pubmed/34711889 http://dx.doi.org/10.1038/s41598-021-00585-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Wang, Gaihua Zhai, Qianyu Feature fusion network based on strip pooling |
title | Feature fusion network based on strip pooling |
title_full | Feature fusion network based on strip pooling |
title_fullStr | Feature fusion network based on strip pooling |
title_full_unstemmed | Feature fusion network based on strip pooling |
title_short | Feature fusion network based on strip pooling |
title_sort | feature fusion network based on strip pooling |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553855/ https://www.ncbi.nlm.nih.gov/pubmed/34711889 http://dx.doi.org/10.1038/s41598-021-00585-z |
work_keys_str_mv | AT wanggaihua featurefusionnetworkbasedonstrippooling AT zhaiqianyu featurefusionnetworkbasedonstrippooling |