Cargando…

An ensemble variable selection method for vibrational spectroscopic data analysis

Wavelength selection is a critical factor for pattern recognition of vibrational spectroscopic data. Not only does it alleviate the effect of dimensionality on an algorithm's generalization performance, but it also enhances the understanding and interpretability of multivariate classification m...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jixiong, Yan, Hong, Xiong, Yanmei, Li, Qianqian, Min, Shungeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9087301/
https://www.ncbi.nlm.nih.gov/pubmed/35548689
http://dx.doi.org/10.1039/c8ra08754g
_version_ 1784704174150123520
author Zhang, Jixiong
Yan, Hong
Xiong, Yanmei
Li, Qianqian
Min, Shungeng
author_facet Zhang, Jixiong
Yan, Hong
Xiong, Yanmei
Li, Qianqian
Min, Shungeng
author_sort Zhang, Jixiong
collection PubMed
description Wavelength selection is a critical factor for pattern recognition of vibrational spectroscopic data. Not only does it alleviate the effect of dimensionality on an algorithm's generalization performance, but it also enhances the understanding and interpretability of multivariate classification models. In this study, a novel partial least squares discriminant analysis (PLSDA)-based wavelength selection algorithm, termed ensemble of bootstrapping space shrinkage (EBSS), has been devised for vibrational spectroscopic data analysis. In the algorithm, a set of subsets are generated from a data set using random sampling. For an individual subset, a feature space is determined by maximizing the expected 10-fold cross-validation accuracy with a weighted bootstrap sampling strategy. Then an ensemble strategy and a sequential forward selection method are applied to the feature spaces to select characteristic variables. Experimental results obtained from analysis of real vibrational spectroscopic data sets demonstrate that the ensemble wavelength selection algorithm can reserve stable and informative variables for the final modeling and improve predictive ability for multivariate classification models.
format Online
Article
Text
id pubmed-9087301
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-90873012022-05-10 An ensemble variable selection method for vibrational spectroscopic data analysis Zhang, Jixiong Yan, Hong Xiong, Yanmei Li, Qianqian Min, Shungeng RSC Adv Chemistry Wavelength selection is a critical factor for pattern recognition of vibrational spectroscopic data. Not only does it alleviate the effect of dimensionality on an algorithm's generalization performance, but it also enhances the understanding and interpretability of multivariate classification models. In this study, a novel partial least squares discriminant analysis (PLSDA)-based wavelength selection algorithm, termed ensemble of bootstrapping space shrinkage (EBSS), has been devised for vibrational spectroscopic data analysis. In the algorithm, a set of subsets are generated from a data set using random sampling. For an individual subset, a feature space is determined by maximizing the expected 10-fold cross-validation accuracy with a weighted bootstrap sampling strategy. Then an ensemble strategy and a sequential forward selection method are applied to the feature spaces to select characteristic variables. Experimental results obtained from analysis of real vibrational spectroscopic data sets demonstrate that the ensemble wavelength selection algorithm can reserve stable and informative variables for the final modeling and improve predictive ability for multivariate classification models. The Royal Society of Chemistry 2019-02-26 /pmc/articles/PMC9087301/ /pubmed/35548689 http://dx.doi.org/10.1039/c8ra08754g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Zhang, Jixiong
Yan, Hong
Xiong, Yanmei
Li, Qianqian
Min, Shungeng
An ensemble variable selection method for vibrational spectroscopic data analysis
title An ensemble variable selection method for vibrational spectroscopic data analysis
title_full An ensemble variable selection method for vibrational spectroscopic data analysis
title_fullStr An ensemble variable selection method for vibrational spectroscopic data analysis
title_full_unstemmed An ensemble variable selection method for vibrational spectroscopic data analysis
title_short An ensemble variable selection method for vibrational spectroscopic data analysis
title_sort ensemble variable selection method for vibrational spectroscopic data analysis
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9087301/
https://www.ncbi.nlm.nih.gov/pubmed/35548689
http://dx.doi.org/10.1039/c8ra08754g
work_keys_str_mv AT zhangjixiong anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT yanhong anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT xiongyanmei anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT liqianqian anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT minshungeng anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT zhangjixiong ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT yanhong ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT xiongyanmei ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT liqianqian ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis
AT minshungeng ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis