Cargando…
Estimation of Discriminative Feature Subset Using Community Modularity
Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k-features and applying community modularity to select highly informative features as a gr...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848544/ https://www.ncbi.nlm.nih.gov/pubmed/27121171 http://dx.doi.org/10.1038/srep25040 |
_version_ | 1782429364569243648 |
---|---|
author | Zhao, Guodong Liu, Sanming |
author_facet | Zhao, Guodong Liu, Sanming |
author_sort | Zhao, Guodong |
collection | PubMed |
description | Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k-features and applying community modularity to select highly informative features as a group. However, these features may not be relevant as an individual. Furthermore, relevant in-dependency rather than irrelevant redundancy among the selected features is effectively measured with the community modularity Q value of the sample graph in the k-features. An efficient FS method called k-features sample graph feature selection is presented. A key property of this approach is that the discriminative cues of a feature subset with the maximum relevant in-dependency among features can be accurately determined. This community modularity-based method is then verified with the theory of k-means cluster. Compared with other state-of-the-art methods, the proposed approach is more effective, as verified by the results of several experiments. |
format | Online Article Text |
id | pubmed-4848544 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-48485442016-05-05 Estimation of Discriminative Feature Subset Using Community Modularity Zhao, Guodong Liu, Sanming Sci Rep Article Feature selection (FS) is an important preprocessing step in machine learning and data mining. In this paper, a new feature subset evaluation method is proposed by constructing a sample graph (SG) in different k-features and applying community modularity to select highly informative features as a group. However, these features may not be relevant as an individual. Furthermore, relevant in-dependency rather than irrelevant redundancy among the selected features is effectively measured with the community modularity Q value of the sample graph in the k-features. An efficient FS method called k-features sample graph feature selection is presented. A key property of this approach is that the discriminative cues of a feature subset with the maximum relevant in-dependency among features can be accurately determined. This community modularity-based method is then verified with the theory of k-means cluster. Compared with other state-of-the-art methods, the proposed approach is more effective, as verified by the results of several experiments. Nature Publishing Group 2016-04-28 /pmc/articles/PMC4848544/ /pubmed/27121171 http://dx.doi.org/10.1038/srep25040 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Zhao, Guodong Liu, Sanming Estimation of Discriminative Feature Subset Using Community Modularity |
title | Estimation of Discriminative Feature Subset Using Community Modularity |
title_full | Estimation of Discriminative Feature Subset Using Community Modularity |
title_fullStr | Estimation of Discriminative Feature Subset Using Community Modularity |
title_full_unstemmed | Estimation of Discriminative Feature Subset Using Community Modularity |
title_short | Estimation of Discriminative Feature Subset Using Community Modularity |
title_sort | estimation of discriminative feature subset using community modularity |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848544/ https://www.ncbi.nlm.nih.gov/pubmed/27121171 http://dx.doi.org/10.1038/srep25040 |
work_keys_str_mv | AT zhaoguodong estimationofdiscriminativefeaturesubsetusingcommunitymodularity AT liusanming estimationofdiscriminativefeaturesubsetusingcommunitymodularity |