Cargando…

Estimation of Discriminative Feature Subset Using Community Modularity

Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k-features and applying community modularity to select highly informative features as a gr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Guodong, Liu, Sanming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848544/
https://www.ncbi.nlm.nih.gov/pubmed/27121171
http://dx.doi.org/10.1038/srep25040
_version_ 1782429364569243648
author Zhao, Guodong
Liu, Sanming
author_facet Zhao, Guodong
Liu, Sanming
author_sort Zhao, Guodong
collection PubMed
description Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k-features and applying community modularity to select highly informative features as a group. However, these features may not be relevant as an individual. Furthermore, relevant in-dependency rather than irrelevant redundancy among the selected features is effectively measured with the community modularity Q value of the sample graph in the k-features. An efficient FS method called k-features sample graph feature selection is presented. A key property of this approach is that the discriminative cues of a feature subset with the maximum relevant in-dependency among features can be accurately determined. This community modularity-based method is then verified with the theory of k-means cluster. Compared with other state-of-the-art methods, the proposed approach is more effective, as verified by the results of several experiments.
format Online
Article
Text
id pubmed-4848544
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-48485442016-05-05 Estimation of Discriminative Feature Subset Using Community Modularity Zhao, Guodong Liu, Sanming Sci Rep Article Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k-features and applying community modularity to select highly informative features as a group. However, these features may not be relevant as an individual. Furthermore, relevant in-dependency rather than irrelevant redundancy among the selected features is effectively measured with the community modularity Q value of the sample graph in the k-features. An efficient FS method called k-features sample graph feature selection is presented. A key property of this approach is that the discriminative cues of a feature subset with the maximum relevant in-dependency among features can be accurately determined. This community modularity-based method is then verified with the theory of k-means cluster. Compared with other state-of-the-art methods, the proposed approach is more effective, as verified by the results of several experiments. Nature Publishing Group 2016-04-28 /pmc/articles/PMC4848544/ /pubmed/27121171 http://dx.doi.org/10.1038/srep25040 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Zhao, Guodong
Liu, Sanming
Estimation of Discriminative Feature Subset Using Community Modularity
title Estimation of Discriminative Feature Subset Using Community Modularity
title_full Estimation of Discriminative Feature Subset Using Community Modularity
title_fullStr Estimation of Discriminative Feature Subset Using Community Modularity
title_full_unstemmed Estimation of Discriminative Feature Subset Using Community Modularity
title_short Estimation of Discriminative Feature Subset Using Community Modularity
title_sort estimation of discriminative feature subset using community modularity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848544/
https://www.ncbi.nlm.nih.gov/pubmed/27121171
http://dx.doi.org/10.1038/srep25040
work_keys_str_mv AT zhaoguodong estimationofdiscriminativefeaturesubsetusingcommunitymodularity
AT liusanming estimationofdiscriminativefeaturesubsetusingcommunitymodularity