Cargando…
Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture
High-precision land cover maps of remote sensing images based on an intelligent extraction method are an important research field for many scholars. In recent years, deep learning represented by convolutional neural networks has been introduced into the field of land cover remote sensing mapping. In...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256031/ https://www.ncbi.nlm.nih.gov/pubmed/37300015 http://dx.doi.org/10.3390/s23115288 |
_version_ | 1785057015749410816 |
---|---|
author | Lu, Tingyu Wan, Luhe Qi, Shaoqun Gao, Meixiang |
author_facet | Lu, Tingyu Wan, Luhe Qi, Shaoqun Gao, Meixiang |
author_sort | Lu, Tingyu |
collection | PubMed |
description | High-precision land cover maps of remote sensing images based on an intelligent extraction method are an important research field for many scholars. In recent years, deep learning represented by convolutional neural networks has been introduced into the field of land cover remote sensing mapping. In view of the problem that a convolution operation is good at extracting local features but has limitations in modeling long-distance dependence relationships, a semantic segmentation network, DE-UNet, with a dual encoder is proposed in this paper. The Swin Transformer and convolutional neural network are used to design the hybrid architecture. The Swin Transformer pays attention to multi-scale global features and learns local features through the convolutional neural network. Integrated features take into account both global and local context information. In the experiment, remote sensing images from UAVs were used to test three deep learning models including DE-UNet. DE-UNet achieved the highest classification accuracy, and the average overall accuracy was 0.28% and 4.81% higher than UNet and UNet++, respectively. It shows that the introduction of a Transformer enhances the model fitting ability. |
format | Online Article Text |
id | pubmed-10256031 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102560312023-06-10 Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture Lu, Tingyu Wan, Luhe Qi, Shaoqun Gao, Meixiang Sensors (Basel) Article High-precision land cover maps of remote sensing images based on an intelligent extraction method are an important research field for many scholars. In recent years, deep learning represented by convolutional neural networks has been introduced into the field of land cover remote sensing mapping. In view of the problem that a convolution operation is good at extracting local features but has limitations in modeling long-distance dependence relationships, a semantic segmentation network, DE-UNet, with a dual encoder is proposed in this paper. The Swin Transformer and convolutional neural network are used to design the hybrid architecture. The Swin Transformer pays attention to multi-scale global features and learns local features through the convolutional neural network. Integrated features take into account both global and local context information. In the experiment, remote sensing images from UAVs were used to test three deep learning models including DE-UNet. DE-UNet achieved the highest classification accuracy, and the average overall accuracy was 0.28% and 4.81% higher than UNet and UNet++, respectively. It shows that the introduction of a Transformer enhances the model fitting ability. MDPI 2023-06-02 /pmc/articles/PMC10256031/ /pubmed/37300015 http://dx.doi.org/10.3390/s23115288 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lu, Tingyu Wan, Luhe Qi, Shaoqun Gao, Meixiang Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture |
title | Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture |
title_full | Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture |
title_fullStr | Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture |
title_full_unstemmed | Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture |
title_short | Land Cover Classification of UAV Remote Sensing Based on Transformer–CNN Hybrid Architecture |
title_sort | land cover classification of uav remote sensing based on transformer–cnn hybrid architecture |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256031/ https://www.ncbi.nlm.nih.gov/pubmed/37300015 http://dx.doi.org/10.3390/s23115288 |
work_keys_str_mv | AT lutingyu landcoverclassificationofuavremotesensingbasedontransformercnnhybridarchitecture AT wanluhe landcoverclassificationofuavremotesensingbasedontransformercnnhybridarchitecture AT qishaoqun landcoverclassificationofuavremotesensingbasedontransformercnnhybridarchitecture AT gaomeixiang landcoverclassificationofuavremotesensingbasedontransformercnnhybridarchitecture |