Cargando…

Improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands

Soil organic matter (SOM) is a key index of soil fertility. Calculating spectral index and screening characteristic band reduce redundancy information of hyperspectral data, and improve the accuracy of SOM prediction. This study aimed to compare the improvement of model accuracy by spectral index an...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Ming-Song, Wang, Tao, Lu, Yuanyuan, Wang, Shihang, Wu, Yunjin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10266645/
https://www.ncbi.nlm.nih.gov/pubmed/37315071
http://dx.doi.org/10.1371/journal.pone.0286825
_version_ 1785058782799200256
author Zhao, Ming-Song
Wang, Tao
Lu, Yuanyuan
Wang, Shihang
Wu, Yunjin
author_facet Zhao, Ming-Song
Wang, Tao
Lu, Yuanyuan
Wang, Shihang
Wu, Yunjin
author_sort Zhao, Ming-Song
collection PubMed
description Soil organic matter (SOM) is a key index of soil fertility. Calculating spectral index and screening characteristic band reduce redundancy information of hyperspectral data, and improve the accuracy of SOM prediction. This study aimed to compare the improvement of model accuracy by spectral index and characteristic band. This study collected 178 samples of topsoil (0–20 cm) in the central plain of Jiangsu, East China. Firstly, visible and near-infrared (VNIR, 350–2500 nm) reflectance spectra were measured using ASD FieldSpec 4 Std-Res spectral radiometer in the laboratory, and inverse-log reflectance (LR), continuum removal (CR), first-order derivative reflectance (FDR) were applied to transform the original reflectance (R). Secondly, optimal spectral indexes (including deviation of arch, difference index, ratio index, and normalized difference index) were calculated from each type of VNIR spectra. Characteristic bands were selected from each type of spectra by the competitive adaptive reweighted sampling (CARS) algorithm, respectively. Thirdly, SOM prediction models were established based on random forest (RF), support vector regression (SVR), deep neural networks (DNN) and partial least squares regression (PLSR) methods using optimal spectral indexes, denoted here as SI-based models. Meanwhile, SOM prediction models were established using characteristic wavelengths, denoted here as CARS-based models. Finally, this research compared and assessed accuracy of SI-based models and CARS-based models, and selected optimal model. Results showed: (1) The correlation between optimal spectral indexes and SOM was enhanced, with absolute value of correlation coefficient between 0.66 and 0.83. The SI-based models predicted SOM content accurately, with the coefficient of determination (R(2)) and root mean square error (RMSE) values ranging from 0.80 to 0.87, 2.40 g/kg to 2.88 g/kg in validation sets, and relative percent deviation (RPD) value between 2.14 and 2.52. (2) The accuracy of CARS-based models differed with models and spectral transformations. For all spectral transformations, PLSR and SVR combined with CARS displayed the best prediction (R(2) and RMSE values ranged from 0.87 to 0.92, 1.91 g/kg to 2.56 g/kg in validation sets, and RPD value ranged from 2.41 to 3.23). For FDR and CR spectra, DNN and RF models achieved more accuracy (R(2) and RMSE values ranged from 0.69 to 0.91, 1.90 g/kg to 3.57 g/kg in validation sets, and RPD value ranged from 1.73 to 3.25) than LR and R spectra (R(2) and RMSE values from 0.20 to 0.35, 5.08 g/kg to 6.44 g/kg in validation sets, and RPD value ranged from 0.96 to 1.21). (3) Overall, the accuracy of SI-based models was slightly lower than that of CARS-based models. But spectral index had a good adaptability to the models, and each SI-based model displayed the similar accuracy. For different spectra, the accuracy of CARS-based model differed from modeling methods. (4) The optimal CARS-based model was model CARS-CR-SVR (R(2) and RMSE: 0.92 and 1.91 g/kg in validation set, RPD: 3.23). The optimal SI-based model was model SI3-SVR (R(2) and RMSE: 0.87 and 2.40 g/kg in validation set, RPD: 2.57) and model SI-SVR (R(2) and RMSE: 0.84 and 2.63 g/kg in validation set, RPD: 2.35).
format Online
Article
Text
id pubmed-10266645
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-102666452023-06-15 Improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands Zhao, Ming-Song Wang, Tao Lu, Yuanyuan Wang, Shihang Wu, Yunjin PLoS One Research Article Soil organic matter (SOM) is a key index of soil fertility. Calculating spectral index and screening characteristic band reduce redundancy information of hyperspectral data, and improve the accuracy of SOM prediction. This study aimed to compare the improvement of model accuracy by spectral index and characteristic band. This study collected 178 samples of topsoil (0–20 cm) in the central plain of Jiangsu, East China. Firstly, visible and near-infrared (VNIR, 350–2500 nm) reflectance spectra were measured using ASD FieldSpec 4 Std-Res spectral radiometer in the laboratory, and inverse-log reflectance (LR), continuum removal (CR), first-order derivative reflectance (FDR) were applied to transform the original reflectance (R). Secondly, optimal spectral indexes (including deviation of arch, difference index, ratio index, and normalized difference index) were calculated from each type of VNIR spectra. Characteristic bands were selected from each type of spectra by the competitive adaptive reweighted sampling (CARS) algorithm, respectively. Thirdly, SOM prediction models were established based on random forest (RF), support vector regression (SVR), deep neural networks (DNN) and partial least squares regression (PLSR) methods using optimal spectral indexes, denoted here as SI-based models. Meanwhile, SOM prediction models were established using characteristic wavelengths, denoted here as CARS-based models. Finally, this research compared and assessed accuracy of SI-based models and CARS-based models, and selected optimal model. Results showed: (1) The correlation between optimal spectral indexes and SOM was enhanced, with absolute value of correlation coefficient between 0.66 and 0.83. The SI-based models predicted SOM content accurately, with the coefficient of determination (R(2)) and root mean square error (RMSE) values ranging from 0.80 to 0.87, 2.40 g/kg to 2.88 g/kg in validation sets, and relative percent deviation (RPD) value between 2.14 and 2.52. (2) The accuracy of CARS-based models differed with models and spectral transformations. For all spectral transformations, PLSR and SVR combined with CARS displayed the best prediction (R(2) and RMSE values ranged from 0.87 to 0.92, 1.91 g/kg to 2.56 g/kg in validation sets, and RPD value ranged from 2.41 to 3.23). For FDR and CR spectra, DNN and RF models achieved more accuracy (R(2) and RMSE values ranged from 0.69 to 0.91, 1.90 g/kg to 3.57 g/kg in validation sets, and RPD value ranged from 1.73 to 3.25) than LR and R spectra (R(2) and RMSE values from 0.20 to 0.35, 5.08 g/kg to 6.44 g/kg in validation sets, and RPD value ranged from 0.96 to 1.21). (3) Overall, the accuracy of SI-based models was slightly lower than that of CARS-based models. But spectral index had a good adaptability to the models, and each SI-based model displayed the similar accuracy. For different spectra, the accuracy of CARS-based model differed from modeling methods. (4) The optimal CARS-based model was model CARS-CR-SVR (R(2) and RMSE: 0.92 and 1.91 g/kg in validation set, RPD: 3.23). The optimal SI-based model was model SI3-SVR (R(2) and RMSE: 0.87 and 2.40 g/kg in validation set, RPD: 2.57) and model SI-SVR (R(2) and RMSE: 0.84 and 2.63 g/kg in validation set, RPD: 2.35). Public Library of Science 2023-06-14 /pmc/articles/PMC10266645/ /pubmed/37315071 http://dx.doi.org/10.1371/journal.pone.0286825 Text en © 2023 Zhao et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhao, Ming-Song
Wang, Tao
Lu, Yuanyuan
Wang, Shihang
Wu, Yunjin
Improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands
title Improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands
title_full Improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands
title_fullStr Improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands
title_full_unstemmed Improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands
title_short Improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands
title_sort improved multivariate modeling for soil organic matter content estimation using hyperspectral indexes and characteristic bands
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10266645/
https://www.ncbi.nlm.nih.gov/pubmed/37315071
http://dx.doi.org/10.1371/journal.pone.0286825
work_keys_str_mv AT zhaomingsong improvedmultivariatemodelingforsoilorganicmattercontentestimationusinghyperspectralindexesandcharacteristicbands
AT wangtao improvedmultivariatemodelingforsoilorganicmattercontentestimationusinghyperspectralindexesandcharacteristicbands
AT luyuanyuan improvedmultivariatemodelingforsoilorganicmattercontentestimationusinghyperspectralindexesandcharacteristicbands
AT wangshihang improvedmultivariatemodelingforsoilorganicmattercontentestimationusinghyperspectralindexesandcharacteristicbands
AT wuyunjin improvedmultivariatemodelingforsoilorganicmattercontentestimationusinghyperspectralindexesandcharacteristicbands