Cargando…

Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems

Feature selection, also known as attribute selection, is the technique of selecting a subset of relevant features for building robust object models. It is becoming more and more important for large-scale sensors applications with AI capabilities. The core idea of this paper is derived from a straigh...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Gang, Cai, Yuanli, Shi, Juan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3574678/
https://www.ncbi.nlm.nih.gov/pubmed/23271599
http://dx.doi.org/10.3390/s130100292
_version_ 1782259613034348544
author Chen, Gang
Cai, Yuanli
Shi, Juan
author_facet Chen, Gang
Cai, Yuanli
Shi, Juan
author_sort Chen, Gang
collection PubMed
description Feature selection, also known as attribute selection, is the technique of selecting a subset of relevant features for building robust object models. It is becoming more and more important for large-scale sensors applications with AI capabilities. The core idea of this paper is derived from a straightforward and intuitive principle saying that, if a feature subset (pattern) has more representativeness, it should be more self-organized, and as a result it should be more insensitive to artificially seeded noise points. In the light of this heuristic finding, we established the whole set of theoretical principles, based on which we proposed a two-stage framework to evaluate the relative importance of feature subsets, called seeding and harvest (S&H for short). At the first stage, we inject a number of artificial noise points into the original dataset; then at the second stage, we resort to an outlier detector to identify them under various feature patterns. The more precisely the seeded points can be extracted under a particular feature pattern, the more valuable and important the corresponding feature pattern should be. Besides, we compared our method with several state-of-the-art feature selection methods on a number of real-life datasets. The experiment results significantly confirm that our method can accomplish feature reduction tasks with high accuracy as well as low computing complexity.
format Online
Article
Text
id pubmed-3574678
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-35746782013-02-25 Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems Chen, Gang Cai, Yuanli Shi, Juan Sensors (Basel) Article Feature selection, also known as attribute selection, is the technique of selecting a subset of relevant features for building robust object models. It is becoming more and more important for large-scale sensors applications with AI capabilities. The core idea of this paper is derived from a straightforward and intuitive principle saying that, if a feature subset (pattern) has more representativeness, it should be more self-organized, and as a result it should be more insensitive to artificially seeded noise points. In the light of this heuristic finding, we established the whole set of theoretical principles, based on which we proposed a two-stage framework to evaluate the relative importance of feature subsets, called seeding and harvest (S&H for short). At the first stage, we inject a number of artificial noise points into the original dataset; then at the second stage, we resort to an outlier detector to identify them under various feature patterns. The more precisely the seeded points can be extracted under a particular feature pattern, the more valuable and important the corresponding feature pattern should be. Besides, we compared our method with several state-of-the-art feature selection methods on a number of real-life datasets. The experiment results significantly confirm that our method can accomplish feature reduction tasks with high accuracy as well as low computing complexity. MDPI 2012-12-27 /pmc/articles/PMC3574678/ /pubmed/23271599 http://dx.doi.org/10.3390/s130100292 Text en © 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Chen, Gang
Cai, Yuanli
Shi, Juan
Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems
title Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems
title_full Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems
title_fullStr Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems
title_full_unstemmed Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems
title_short Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems
title_sort seeding and harvest: a framework for unsupervised feature selection problems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3574678/
https://www.ncbi.nlm.nih.gov/pubmed/23271599
http://dx.doi.org/10.3390/s130100292
work_keys_str_mv AT chengang seedingandharvestaframeworkforunsupervisedfeatureselectionproblems
AT caiyuanli seedingandharvestaframeworkforunsupervisedfeatureselectionproblems
AT shijuan seedingandharvestaframeworkforunsupervisedfeatureselectionproblems