Cargando…

Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket

The basic experimental data of traditional Chinese medicine are generally obtained by high-performance liquid chromatography and mass spectrometry. The data often show the characteristics of high dimensionality and few samples, and there are many irrelevant features and redundant features in the dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Canyi, Li, Keding, Du, Jianqiang, Nie, Bin, Xu, Guoliang, Xiong, Wangping, Luo, Jigen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7166270/
https://www.ncbi.nlm.nih.gov/pubmed/32328156
http://dx.doi.org/10.1155/2020/8308173
_version_ 1783523525060984832
author Huang, Canyi
Li, Keding
Du, Jianqiang
Nie, Bin
Xu, Guoliang
Xiong, Wangping
Luo, Jigen
author_facet Huang, Canyi
Li, Keding
Du, Jianqiang
Nie, Bin
Xu, Guoliang
Xiong, Wangping
Luo, Jigen
author_sort Huang, Canyi
collection PubMed
description The basic experimental data of traditional Chinese medicine are generally obtained by high-performance liquid chromatography and mass spectrometry. The data often show the characteristics of high dimensionality and few samples, and there are many irrelevant features and redundant features in the data, which bring challenges to the in-depth exploration of Chinese medicine material information. A hybrid feature selection method based on iterative approximate Markov blanket (CI_AMB) is proposed in the paper. The method uses the maximum information coefficient to measure the correlation between features and target variables and achieves the purpose of filtering irrelevant features according to the evaluation criteria, firstly. The iterative approximation Markov blanket strategy analyzes the redundancy between features and implements the elimination of redundant features and then selects an effective feature subset finally. Comparative experiments using traditional Chinese medicine material basic experimental data and UCI's multiple public datasets show that the new method has a better advantage to select a small number of highly explanatory features, compared with Lasso, XGBoost, and the classic approximate Markov blanket method.
format Online
Article
Text
id pubmed-7166270
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-71662702020-04-23 Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket Huang, Canyi Li, Keding Du, Jianqiang Nie, Bin Xu, Guoliang Xiong, Wangping Luo, Jigen Comput Math Methods Med Research Article The basic experimental data of traditional Chinese medicine are generally obtained by high-performance liquid chromatography and mass spectrometry. The data often show the characteristics of high dimensionality and few samples, and there are many irrelevant features and redundant features in the data, which bring challenges to the in-depth exploration of Chinese medicine material information. A hybrid feature selection method based on iterative approximate Markov blanket (CI_AMB) is proposed in the paper. The method uses the maximum information coefficient to measure the correlation between features and target variables and achieves the purpose of filtering irrelevant features according to the evaluation criteria, firstly. The iterative approximation Markov blanket strategy analyzes the redundancy between features and implements the elimination of redundant features and then selects an effective feature subset finally. Comparative experiments using traditional Chinese medicine material basic experimental data and UCI's multiple public datasets show that the new method has a better advantage to select a small number of highly explanatory features, compared with Lasso, XGBoost, and the classic approximate Markov blanket method. Hindawi 2020-04-07 /pmc/articles/PMC7166270/ /pubmed/32328156 http://dx.doi.org/10.1155/2020/8308173 Text en Copyright © 2020 Canyi Huang et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Huang, Canyi
Li, Keding
Du, Jianqiang
Nie, Bin
Xu, Guoliang
Xiong, Wangping
Luo, Jigen
Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket
title Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket
title_full Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket
title_fullStr Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket
title_full_unstemmed Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket
title_short Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket
title_sort research on hybrid feature selection method based on iterative approximation markov blanket
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7166270/
https://www.ncbi.nlm.nih.gov/pubmed/32328156
http://dx.doi.org/10.1155/2020/8308173
work_keys_str_mv AT huangcanyi researchonhybridfeatureselectionmethodbasedoniterativeapproximationmarkovblanket
AT likeding researchonhybridfeatureselectionmethodbasedoniterativeapproximationmarkovblanket
AT dujianqiang researchonhybridfeatureselectionmethodbasedoniterativeapproximationmarkovblanket
AT niebin researchonhybridfeatureselectionmethodbasedoniterativeapproximationmarkovblanket
AT xuguoliang researchonhybridfeatureselectionmethodbasedoniterativeapproximationmarkovblanket
AT xiongwangping researchonhybridfeatureselectionmethodbasedoniterativeapproximationmarkovblanket
AT luojigen researchonhybridfeatureselectionmethodbasedoniterativeapproximationmarkovblanket