Cargando…
TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images
Pixel-level information of remote sensing images is of great value in many fields. CNN has a strong ability to extract image backbone features, but due to the localization of convolution operation, it is challenging to directly obtain global feature information and contextual semantic interaction, w...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346442/ https://www.ncbi.nlm.nih.gov/pubmed/37447759 http://dx.doi.org/10.3390/s23135909 |
_version_ | 1785073314330312704 |
---|---|
author | Gao, Yupeng Zhang, Shengwei Zuo, Dongshi Yan, Weihong Pan, Xin |
author_facet | Gao, Yupeng Zhang, Shengwei Zuo, Dongshi Yan, Weihong Pan, Xin |
author_sort | Gao, Yupeng |
collection | PubMed |
description | Pixel-level information of remote sensing images is of great value in many fields. CNN has a strong ability to extract image backbone features, but due to the localization of convolution operation, it is challenging to directly obtain global feature information and contextual semantic interaction, which makes it difficult for a pure CNN model to obtain higher precision results in semantic segmentation of remote sensing images. Inspired by the Swin Transformer with global feature coding capability, we design a two-branch multi-scale semantic segmentation network (TMNet) for remote sensing images. The network adopts the structure of a double encoder and a decoder. The Swin Transformer is used to increase the ability to extract global feature information. A multi-scale feature fusion module (MFM) is designed to merge shallow spatial features from images of different scales into deep features. In addition, the feature enhancement module (FEM) and channel enhancement module (CEM) are proposed and added to the dual encoder to enhance the feature extraction. Experiments were conducted on the WHDLD and Potsdam datasets to verify the excellent performance of TMNet. |
format | Online Article Text |
id | pubmed-10346442 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103464422023-07-15 TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images Gao, Yupeng Zhang, Shengwei Zuo, Dongshi Yan, Weihong Pan, Xin Sensors (Basel) Article Pixel-level information of remote sensing images is of great value in many fields. CNN has a strong ability to extract image backbone features, but due to the localization of convolution operation, it is challenging to directly obtain global feature information and contextual semantic interaction, which makes it difficult for a pure CNN model to obtain higher precision results in semantic segmentation of remote sensing images. Inspired by the Swin Transformer with global feature coding capability, we design a two-branch multi-scale semantic segmentation network (TMNet) for remote sensing images. The network adopts the structure of a double encoder and a decoder. The Swin Transformer is used to increase the ability to extract global feature information. A multi-scale feature fusion module (MFM) is designed to merge shallow spatial features from images of different scales into deep features. In addition, the feature enhancement module (FEM) and channel enhancement module (CEM) are proposed and added to the dual encoder to enhance the feature extraction. Experiments were conducted on the WHDLD and Potsdam datasets to verify the excellent performance of TMNet. MDPI 2023-06-26 /pmc/articles/PMC10346442/ /pubmed/37447759 http://dx.doi.org/10.3390/s23135909 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Gao, Yupeng Zhang, Shengwei Zuo, Dongshi Yan, Weihong Pan, Xin TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images |
title | TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images |
title_full | TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images |
title_fullStr | TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images |
title_full_unstemmed | TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images |
title_short | TMNet: A Two-Branch Multi-Scale Semantic Segmentation Network for Remote Sensing Images |
title_sort | tmnet: a two-branch multi-scale semantic segmentation network for remote sensing images |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346442/ https://www.ncbi.nlm.nih.gov/pubmed/37447759 http://dx.doi.org/10.3390/s23135909 |
work_keys_str_mv | AT gaoyupeng tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages AT zhangshengwei tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages AT zuodongshi tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages AT yanweihong tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages AT panxin tmnetatwobranchmultiscalesemanticsegmentationnetworkforremotesensingimages |