Cargando…
An ensemble variable selection method for vibrational spectroscopic data analysis
Wavelength selection is a critical factor for pattern recognition of vibrational spectroscopic data. Not only does it alleviate the effect of dimensionality on an algorithm's generalization performance, but it also enhances the understanding and interpretability of multivariate classification m...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9087301/ https://www.ncbi.nlm.nih.gov/pubmed/35548689 http://dx.doi.org/10.1039/c8ra08754g |
_version_ | 1784704174150123520 |
---|---|
author | Zhang, Jixiong Yan, Hong Xiong, Yanmei Li, Qianqian Min, Shungeng |
author_facet | Zhang, Jixiong Yan, Hong Xiong, Yanmei Li, Qianqian Min, Shungeng |
author_sort | Zhang, Jixiong |
collection | PubMed |
description | Wavelength selection is a critical factor for pattern recognition of vibrational spectroscopic data. Not only does it alleviate the effect of dimensionality on an algorithm's generalization performance, but it also enhances the understanding and interpretability of multivariate classification models. In this study, a novel partial least squares discriminant analysis (PLSDA)-based wavelength selection algorithm, termed ensemble of bootstrapping space shrinkage (EBSS), has been devised for vibrational spectroscopic data analysis. In the algorithm, a set of subsets are generated from a data set using random sampling. For an individual subset, a feature space is determined by maximizing the expected 10-fold cross-validation accuracy with a weighted bootstrap sampling strategy. Then an ensemble strategy and a sequential forward selection method are applied to the feature spaces to select characteristic variables. Experimental results obtained from analysis of real vibrational spectroscopic data sets demonstrate that the ensemble wavelength selection algorithm can reserve stable and informative variables for the final modeling and improve predictive ability for multivariate classification models. |
format | Online Article Text |
id | pubmed-9087301 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-90873012022-05-10 An ensemble variable selection method for vibrational spectroscopic data analysis Zhang, Jixiong Yan, Hong Xiong, Yanmei Li, Qianqian Min, Shungeng RSC Adv Chemistry Wavelength selection is a critical factor for pattern recognition of vibrational spectroscopic data. Not only does it alleviate the effect of dimensionality on an algorithm's generalization performance, but it also enhances the understanding and interpretability of multivariate classification models. In this study, a novel partial least squares discriminant analysis (PLSDA)-based wavelength selection algorithm, termed ensemble of bootstrapping space shrinkage (EBSS), has been devised for vibrational spectroscopic data analysis. In the algorithm, a set of subsets are generated from a data set using random sampling. For an individual subset, a feature space is determined by maximizing the expected 10-fold cross-validation accuracy with a weighted bootstrap sampling strategy. Then an ensemble strategy and a sequential forward selection method are applied to the feature spaces to select characteristic variables. Experimental results obtained from analysis of real vibrational spectroscopic data sets demonstrate that the ensemble wavelength selection algorithm can reserve stable and informative variables for the final modeling and improve predictive ability for multivariate classification models. The Royal Society of Chemistry 2019-02-26 /pmc/articles/PMC9087301/ /pubmed/35548689 http://dx.doi.org/10.1039/c8ra08754g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Chemistry Zhang, Jixiong Yan, Hong Xiong, Yanmei Li, Qianqian Min, Shungeng An ensemble variable selection method for vibrational spectroscopic data analysis |
title | An ensemble variable selection method for vibrational spectroscopic data analysis |
title_full | An ensemble variable selection method for vibrational spectroscopic data analysis |
title_fullStr | An ensemble variable selection method for vibrational spectroscopic data analysis |
title_full_unstemmed | An ensemble variable selection method for vibrational spectroscopic data analysis |
title_short | An ensemble variable selection method for vibrational spectroscopic data analysis |
title_sort | ensemble variable selection method for vibrational spectroscopic data analysis |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9087301/ https://www.ncbi.nlm.nih.gov/pubmed/35548689 http://dx.doi.org/10.1039/c8ra08754g |
work_keys_str_mv | AT zhangjixiong anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT yanhong anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT xiongyanmei anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT liqianqian anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT minshungeng anensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT zhangjixiong ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT yanhong ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT xiongyanmei ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT liqianqian ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis AT minshungeng ensemblevariableselectionmethodforvibrationalspectroscopicdataanalysis |