Cargando…

Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data

The partial least squares method has many advantages in multivariable linear regression, but it does not include the function of feature selection. This method cannot screen for the best feature subset (referred to in this study as the “Gold Standard”) or optimize the model, although contrarily usin...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Canyi, Du, Jianqiang, Nie, Bin, Yu, Riyue, Xiong, Wangping, Zeng, Qingxia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6636449/
https://www.ncbi.nlm.nih.gov/pubmed/31354860
http://dx.doi.org/10.1155/2019/9580126
_version_ 1783436064679002112
author Huang, Canyi
Du, Jianqiang
Nie, Bin
Yu, Riyue
Xiong, Wangping
Zeng, Qingxia
author_facet Huang, Canyi
Du, Jianqiang
Nie, Bin
Yu, Riyue
Xiong, Wangping
Zeng, Qingxia
author_sort Huang, Canyi
collection PubMed
description The partial least squares method has many advantages in multivariable linear regression, but it does not include the function of feature selection. This method cannot screen for the best feature subset (referred to in this study as the “Gold Standard”) or optimize the model, although contrarily using the L1 norm can achieve the sparse representation of parameters, leading to feature selection. In this study, a feature selection method based on partial least squares is proposed. In the new method, exploiting partial least squares allows extraction of the latent variables required for performing multivariable linear regression, and this method applies the L1 regular term constraint to the sum of the absolute values of the regression coefficients. This technique is then combined with the coordinate descent method to perform multiple iterations to select a better feature subset. Analyzing traditional Chinese medicine data and University of California, Irvine (UCI), datasets with the model, the experimental results show that the feature selection method based on partial least squares exhibits preferable adaptability for traditional Chinese medicine data and UCI datasets.
format Online
Article
Text
id pubmed-6636449
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-66364492019-07-28 Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data Huang, Canyi Du, Jianqiang Nie, Bin Yu, Riyue Xiong, Wangping Zeng, Qingxia Comput Math Methods Med Research Article The partial least squares method has many advantages in multivariable linear regression, but it does not include the function of feature selection. This method cannot screen for the best feature subset (referred to in this study as the “Gold Standard”) or optimize the model, although contrarily using the L1 norm can achieve the sparse representation of parameters, leading to feature selection. In this study, a feature selection method based on partial least squares is proposed. In the new method, exploiting partial least squares allows extraction of the latent variables required for performing multivariable linear regression, and this method applies the L1 regular term constraint to the sum of the absolute values of the regression coefficients. This technique is then combined with the coordinate descent method to perform multiple iterations to select a better feature subset. Analyzing traditional Chinese medicine data and University of California, Irvine (UCI), datasets with the model, the experimental results show that the feature selection method based on partial least squares exhibits preferable adaptability for traditional Chinese medicine data and UCI datasets. Hindawi 2019-07-01 /pmc/articles/PMC6636449/ /pubmed/31354860 http://dx.doi.org/10.1155/2019/9580126 Text en Copyright © 2019 Canyi Huang et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Huang, Canyi
Du, Jianqiang
Nie, Bin
Yu, Riyue
Xiong, Wangping
Zeng, Qingxia
Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data
title Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data
title_full Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data
title_fullStr Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data
title_full_unstemmed Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data
title_short Feature Selection Method Based on Partial Least Squares and Analysis of Traditional Chinese Medicine Data
title_sort feature selection method based on partial least squares and analysis of traditional chinese medicine data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6636449/
https://www.ncbi.nlm.nih.gov/pubmed/31354860
http://dx.doi.org/10.1155/2019/9580126
work_keys_str_mv AT huangcanyi featureselectionmethodbasedonpartialleastsquaresandanalysisoftraditionalchinesemedicinedata
AT dujianqiang featureselectionmethodbasedonpartialleastsquaresandanalysisoftraditionalchinesemedicinedata
AT niebin featureselectionmethodbasedonpartialleastsquaresandanalysisoftraditionalchinesemedicinedata
AT yuriyue featureselectionmethodbasedonpartialleastsquaresandanalysisoftraditionalchinesemedicinedata
AT xiongwangping featureselectionmethodbasedonpartialleastsquaresandanalysisoftraditionalchinesemedicinedata
AT zengqingxia featureselectionmethodbasedonpartialleastsquaresandanalysisoftraditionalchinesemedicinedata