Cargando…

TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images

Pixel-level information of remote sensing images is of great value in many fields. CNN has a strong ability to extract image backbone features, but due to the localization of convolution operation, it is challenging to directly obtain global feature information and contextual semantic interaction, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Yupeng, Zhang, Shengwei, Zuo, Dongshi, Yan, Weihong, Pan, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346442/
https://www.ncbi.nlm.nih.gov/pubmed/37447759
http://dx.doi.org/10.3390/s23135909
_version_ 1785073314330312704
author Gao, Yupeng
Zhang, Shengwei
Zuo, Dongshi
Yan, Weihong
Pan, Xin
author_facet Gao, Yupeng
Zhang, Shengwei
Zuo, Dongshi
Yan, Weihong
Pan, Xin
author_sort Gao, Yupeng
collection PubMed
description Pixel-level information of remote sensing images is of great value in many fields. CNN has a strong ability to extract image backbone features, but due to the localization of convolution operation, it is challenging to directly obtain global feature information and contextual semantic interaction, which makes it difficult for a pure CNN model to obtain higher precision results in semantic segmentation of remote sensing images. Inspired by the Swin Transformer with global feature coding capability, we design a two-branch multi-scale semantic segmentation network (TMNet) for remote sensing images. The network adopts the structure of a double encoder and a decoder. The Swin Transformer is used to increase the ability to extract global feature information. A multi-scale feature fusion module (MFM) is designed to merge shallow spatial features from images of different scales into deep features. In addition, the feature enhancement module (FEM) and channel enhancement module (CEM) are proposed and added to the dual encoder to enhance the feature extraction. Experiments were conducted on the WHDLD and Potsdam datasets to verify the excellent performance of TMNet.
format Online
Article
Text
id pubmed-10346442
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103464422023-07-15 TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images Gao, Yupeng Zhang, Shengwei Zuo, Dongshi Yan, Weihong Pan, Xin Sensors (Basel) Article Pixel-level information of remote sensing images is of great value in many fields. CNN has a strong ability to extract image backbone features, but due to the localization of convolution operation, it is challenging to directly obtain global feature information and contextual semantic interaction, which makes it difficult for a pure CNN model to obtain higher precision results in semantic segmentation of remote sensing images. Inspired by the Swin Transformer with global feature coding capability, we design a two-branch multi-scale semantic segmentation network (TMNet) for remote sensing images. The network adopts the structure of a double encoder and a decoder. The Swin Transformer is used to increase the ability to extract global feature information. A multi-scale feature fusion module (MFM) is designed to merge shallow spatial features from images of different scales into deep features. In addition, the feature enhancement module (FEM) and channel enhancement module (CEM) are proposed and added to the dual encoder to enhance the feature extraction. Experiments were conducted on the WHDLD and Potsdam datasets to verify the excellent performance of TMNet. MDPI 2023-06-26 /pmc/articles/PMC10346442/ /pubmed/37447759 http://dx.doi.org/10.3390/s23135909 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gao, Yupeng
Zhang, Shengwei
Zuo, Dongshi
Yan, Weihong
Pan, Xin
TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images
title TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images
title_full TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images
title_fullStr TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images
title_full_unstemmed TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images
title_short TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images
title_sort tmnet: a two-branch multi-scale semantic segmentation network for remote sensing images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346442/
https://www.ncbi.nlm.nih.gov/pubmed/37447759
http://dx.doi.org/10.3390/s23135909
work_keys_str_mv AT gaoyupeng tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages
AT zhangshengwei tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages
AT zuodongshi tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages
AT yanweihong tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages
AT panxin tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages