Cargando…

Hyperspectral Image Labeling and Classification Using an Ensemble Semi-Supervised Machine Learning Approach

Hyperspectral remote sensing has tremendous potential for monitoring land cover and water bodies from the rich spatial and spectral information contained in the images. It is a time and resource consuming task to obtain groundtruth data for these images by field sampling. A semi-supervised method fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Manian, Vidya, Alfaro-Mejía, Estefanía, Tokars, Roger P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8877511/
https://www.ncbi.nlm.nih.gov/pubmed/35214523
http://dx.doi.org/10.3390/s22041623
_version_ 1784658438229327872
author Manian, Vidya
Alfaro-Mejía, Estefanía
Tokars, Roger P.
author_facet Manian, Vidya
Alfaro-Mejía, Estefanía
Tokars, Roger P.
author_sort Manian, Vidya
collection PubMed
description Hyperspectral remote sensing has tremendous potential for monitoring land cover and water bodies from the rich spatial and spectral information contained in the images. It is a time and resource consuming task to obtain groundtruth data for these images by field sampling. A semi-supervised method for labeling and classification of hyperspectral images is presented. The unsupervised stage consists of image enhancement by feature extraction, followed by clustering for labeling and generating the groundtruth image. The supervised stage for classification consists of a preprocessing stage involving normalization, computation of principal components, and feature extraction. An ensemble of machine learning models takes the extracted features and groundtruth data from the unsupervised stage as input and a decision block then combines the output of the machines to label the image based on majority voting. The ensemble of machine learning methods includes support vector machines, gradient boosting, Gaussian classifier, and linear perceptron. Overall, the gradient boosting method gives the best performance for supervised classification of hyperspectral images. The presented ensemble method is useful for generating labeled data for hyperspectral images that do not have groundtruth information. It gives an overall accuracy of 93.74% for the Jasper hyperspectral image, 100% accuracy for the HSI2 Lake Erie images, and 99.92% for the classification of cyanobacteria or harmful algal blooms and surface scum. The method distinguishes well between blue green algae and surface scum. The full pipeline ensemble method for classifying Lake Erie images in a cloud server runs 24 times faster than a workstation.
format Online
Article
Text
id pubmed-8877511
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88775112022-02-26 Hyperspectral Image Labeling and Classification Using an Ensemble Semi-Supervised Machine Learning Approach Manian, Vidya Alfaro-Mejía, Estefanía Tokars, Roger P. Sensors (Basel) Article Hyperspectral remote sensing has tremendous potential for monitoring land cover and water bodies from the rich spatial and spectral information contained in the images. It is a time and resource consuming task to obtain groundtruth data for these images by field sampling. A semi-supervised method for labeling and classification of hyperspectral images is presented. The unsupervised stage consists of image enhancement by feature extraction, followed by clustering for labeling and generating the groundtruth image. The supervised stage for classification consists of a preprocessing stage involving normalization, computation of principal components, and feature extraction. An ensemble of machine learning models takes the extracted features and groundtruth data from the unsupervised stage as input and a decision block then combines the output of the machines to label the image based on majority voting. The ensemble of machine learning methods includes support vector machines, gradient boosting, Gaussian classifier, and linear perceptron. Overall, the gradient boosting method gives the best performance for supervised classification of hyperspectral images. The presented ensemble method is useful for generating labeled data for hyperspectral images that do not have groundtruth information. It gives an overall accuracy of 93.74% for the Jasper hyperspectral image, 100% accuracy for the HSI2 Lake Erie images, and 99.92% for the classification of cyanobacteria or harmful algal blooms and surface scum. The method distinguishes well between blue green algae and surface scum. The full pipeline ensemble method for classifying Lake Erie images in a cloud server runs 24 times faster than a workstation. MDPI 2022-02-18 /pmc/articles/PMC8877511/ /pubmed/35214523 http://dx.doi.org/10.3390/s22041623 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Manian, Vidya
Alfaro-Mejía, Estefanía
Tokars, Roger P.
Hyperspectral Image Labeling and Classification Using an Ensemble Semi-Supervised Machine Learning Approach
title Hyperspectral Image Labeling and Classification Using an Ensemble Semi-Supervised Machine Learning Approach
title_full Hyperspectral Image Labeling and Classification Using an Ensemble Semi-Supervised Machine Learning Approach
title_fullStr Hyperspectral Image Labeling and Classification Using an Ensemble Semi-Supervised Machine Learning Approach
title_full_unstemmed Hyperspectral Image Labeling and Classification Using an Ensemble Semi-Supervised Machine Learning Approach
title_short Hyperspectral Image Labeling and Classification Using an Ensemble Semi-Supervised Machine Learning Approach
title_sort hyperspectral image labeling and classification using an ensemble semi-supervised machine learning approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8877511/
https://www.ncbi.nlm.nih.gov/pubmed/35214523
http://dx.doi.org/10.3390/s22041623
work_keys_str_mv AT manianvidya hyperspectralimagelabelingandclassificationusinganensemblesemisupervisedmachinelearningapproach
AT alfaromejiaestefania hyperspectralimagelabelingandclassificationusinganensemblesemisupervisedmachinelearningapproach
AT tokarsrogerp hyperspectralimagelabelingandclassificationusinganensemblesemisupervisedmachinelearningapproach