Cargando…

CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification

Convolutional neural networks (CNNs) have been prominent in most hyperspectral image (HSI) processing applications due to their advantages in extracting local information. Despite their success, the locality of the convolutional layers within CNNs results in heavyweight models and time-consuming def...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhiwen, Li, Teng, Tang, Xuebin, Hu, Xiang, Peng, Yuanxi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9146051/
https://www.ncbi.nlm.nih.gov/pubmed/35632310
http://dx.doi.org/10.3390/s22103902
_version_ 1784716465370300416
author Zhang, Zhiwen
Li, Teng
Tang, Xuebin
Hu, Xiang
Peng, Yuanxi
author_facet Zhang, Zhiwen
Li, Teng
Tang, Xuebin
Hu, Xiang
Peng, Yuanxi
author_sort Zhang, Zhiwen
collection PubMed
description Convolutional neural networks (CNNs) have been prominent in most hyperspectral image (HSI) processing applications due to their advantages in extracting local information. Despite their success, the locality of the convolutional layers within CNNs results in heavyweight models and time-consuming defects. In this study, inspired by the excellent performance of transformers that are used for long-range representation learning in computer vision tasks, we built a lightweight vision transformer for HSI classification that can extract local and global information simultaneously, thereby facilitating accurate classification. Moreover, as traditional dimensionality reduction methods are limited in their linear representation ability, a three-dimensional convolutional autoencoder was adopted to capture the nonlinear characteristics between spectral bands. Based on the aforementioned three-dimensional convolutional autoencoder and lightweight vision transformer, we designed an HSI classification network, namely the “convolutional autoencoder meets lightweight vision transformer” (CAEVT). Finally, we validated the performance of the proposed CAEVT network using four widely used hyperspectral datasets. Our approach showed superiority, especially in the absence of sufficient labeled samples, which demonstrates the effectiveness and efficiency of the CAEVT network.
format Online
Article
Text
id pubmed-9146051
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91460512022-05-29 CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification Zhang, Zhiwen Li, Teng Tang, Xuebin Hu, Xiang Peng, Yuanxi Sensors (Basel) Article Convolutional neural networks (CNNs) have been prominent in most hyperspectral image (HSI) processing applications due to their advantages in extracting local information. Despite their success, the locality of the convolutional layers within CNNs results in heavyweight models and time-consuming defects. In this study, inspired by the excellent performance of transformers that are used for long-range representation learning in computer vision tasks, we built a lightweight vision transformer for HSI classification that can extract local and global information simultaneously, thereby facilitating accurate classification. Moreover, as traditional dimensionality reduction methods are limited in their linear representation ability, a three-dimensional convolutional autoencoder was adopted to capture the nonlinear characteristics between spectral bands. Based on the aforementioned three-dimensional convolutional autoencoder and lightweight vision transformer, we designed an HSI classification network, namely the “convolutional autoencoder meets lightweight vision transformer” (CAEVT). Finally, we validated the performance of the proposed CAEVT network using four widely used hyperspectral datasets. Our approach showed superiority, especially in the absence of sufficient labeled samples, which demonstrates the effectiveness and efficiency of the CAEVT network. MDPI 2022-05-20 /pmc/articles/PMC9146051/ /pubmed/35632310 http://dx.doi.org/10.3390/s22103902 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Zhiwen
Li, Teng
Tang, Xuebin
Hu, Xiang
Peng, Yuanxi
CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
title CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
title_full CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
title_fullStr CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
title_full_unstemmed CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
title_short CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
title_sort caevt: convolutional autoencoder meets lightweight vision transformer for hyperspectral image classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9146051/
https://www.ncbi.nlm.nih.gov/pubmed/35632310
http://dx.doi.org/10.3390/s22103902
work_keys_str_mv AT zhangzhiwen caevtconvolutionalautoencodermeetslightweightvisiontransformerforhyperspectralimageclassification
AT liteng caevtconvolutionalautoencodermeetslightweightvisiontransformerforhyperspectralimageclassification
AT tangxuebin caevtconvolutionalautoencodermeetslightweightvisiontransformerforhyperspectralimageclassification
AT huxiang caevtconvolutionalautoencodermeetslightweightvisiontransformerforhyperspectralimageclassification
AT pengyuanxi caevtconvolutionalautoencodermeetslightweightvisiontransformerforhyperspectralimageclassification