Cargando…

Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture

High-precision land cover maps of remote sensing images based on an intelligent extraction method are an important research field for many scholars. In recent years, deep learning represented by convolutional neural networks has been introduced into the field of land cover remote sensing mapping. In...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Tingyu, Wan, Luhe, Qi, Shaoqun, Gao, Meixiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256031/
https://www.ncbi.nlm.nih.gov/pubmed/37300015
http://dx.doi.org/10.3390/s23115288
_version_ 1785057015749410816
author Lu, Tingyu
Wan, Luhe
Qi, Shaoqun
Gao, Meixiang
author_facet Lu, Tingyu
Wan, Luhe
Qi, Shaoqun
Gao, Meixiang
author_sort Lu, Tingyu
collection PubMed
description High-precision land cover maps of remote sensing images based on an intelligent extraction method are an important research field for many scholars. In recent years, deep learning represented by convolutional neural networks has been introduced into the field of land cover remote sensing mapping. In view of the problem that a convolution operation is good at extracting local features but has limitations in modeling long-distance dependence relationships, a semantic segmentation network, DE-UNet, with a dual encoder is proposed in this paper. The Swin Transformer and convolutional neural network are used to design the hybrid architecture. The Swin Transformer pays attention to multi-scale global features and learns local features through the convolutional neural network. Integrated features take into account both global and local context information. In the experiment, remote sensing images from UAVs were used to test three deep learning models including DE-UNet. DE-UNet achieved the highest classification accuracy, and the average overall accuracy was 0.28% and 4.81% higher than UNet and UNet++, respectively. It shows that the introduction of a Transformer enhances the model fitting ability.
format Online
Article
Text
id pubmed-10256031
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102560312023-06-10 Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture Lu, Tingyu Wan, Luhe Qi, Shaoqun Gao, Meixiang Sensors (Basel) Article High-precision land cover maps of remote sensing images based on an intelligent extraction method are an important research field for many scholars. In recent years, deep learning represented by convolutional neural networks has been introduced into the field of land cover remote sensing mapping. In view of the problem that a convolution operation is good at extracting local features but has limitations in modeling long-distance dependence relationships, a semantic segmentation network, DE-UNet, with a dual encoder is proposed in this paper. The Swin Transformer and convolutional neural network are used to design the hybrid architecture. The Swin Transformer pays attention to multi-scale global features and learns local features through the convolutional neural network. Integrated features take into account both global and local context information. In the experiment, remote sensing images from UAVs were used to test three deep learning models including DE-UNet. DE-UNet achieved the highest classification accuracy, and the average overall accuracy was 0.28% and 4.81% higher than UNet and UNet++, respectively. It shows that the introduction of a Transformer enhances the model fitting ability. MDPI 2023-06-02 /pmc/articles/PMC10256031/ /pubmed/37300015 http://dx.doi.org/10.3390/s23115288 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lu, Tingyu
Wan, Luhe
Qi, Shaoqun
Gao, Meixiang
Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture
title Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture
title_full Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture
title_fullStr Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture
title_full_unstemmed Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture
title_short Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture
title_sort land cover classification of uav remote sensing based on transformer–cnn hybrid architecture
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256031/
https://www.ncbi.nlm.nih.gov/pubmed/37300015
http://dx.doi.org/10.3390/s23115288
work_keys_str_mv AT lutingyu landcoverclassificationofuavremotesensingbasedontransformercnnhybridarchitecture
AT wanluhe landcoverclassificationofuavremotesensingbasedontransformercnnhybridarchitecture
AT qishaoqun landcoverclassificationofuavremotesensingbasedontransformercnnhybridarchitecture
AT gaomeixiang landcoverclassificationofuavremotesensingbasedontransformercnnhybridarchitecture