Cargando…

CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness

BACKGROUND: The goal of feature selection is to select useful features and simultaneously exclude garbage features from a given dataset for classification purposes. This is expected to bring reduction of processing time and improvement of classification accuracy. METHODOLOGY: In this study, we devis...

Descripción completa

Detalles Bibliográficos
Autores principales: Seo, Minseok, Oh, Sejong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3391246/
https://www.ncbi.nlm.nih.gov/pubmed/22792310
http://dx.doi.org/10.1371/journal.pone.0040419
_version_ 1782237502534320128
author Seo, Minseok
Oh, Sejong
author_facet Seo, Minseok
Oh, Sejong
author_sort Seo, Minseok
collection PubMed
description BACKGROUND: The goal of feature selection is to select useful features and simultaneously exclude garbage features from a given dataset for classification purposes. This is expected to bring reduction of processing time and improvement of classification accuracy. METHODOLOGY: In this study, we devised a new feature selection algorithm (CBFS) based on clearness of features. Feature clearness expresses separability among classes in a feature. Highly clear features contribute towards obtaining high classification accuracy. CScore is a measure to score clearness of each feature and is based on clustered samples to centroid of classes in a feature. We also suggest combining CBFS and other algorithms to improve classification accuracy. CONCLUSIONS/SIGNIFICANCE: From the experiment we confirm that CBFS is more excellent than up-to-date feature selection algorithms including FeaLect. CBFS can be applied to microarray gene selection, text categorization, and image classification.
format Online
Article
Text
id pubmed-3391246
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33912462012-07-12 CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness Seo, Minseok Oh, Sejong PLoS One Research Article BACKGROUND: The goal of feature selection is to select useful features and simultaneously exclude garbage features from a given dataset for classification purposes. This is expected to bring reduction of processing time and improvement of classification accuracy. METHODOLOGY: In this study, we devised a new feature selection algorithm (CBFS) based on clearness of features. Feature clearness expresses separability among classes in a feature. Highly clear features contribute towards obtaining high classification accuracy. CScore is a measure to score clearness of each feature and is based on clustered samples to centroid of classes in a feature. We also suggest combining CBFS and other algorithms to improve classification accuracy. CONCLUSIONS/SIGNIFICANCE: From the experiment we confirm that CBFS is more excellent than up-to-date feature selection algorithms including FeaLect. CBFS can be applied to microarray gene selection, text categorization, and image classification. Public Library of Science 2012-07-06 /pmc/articles/PMC3391246/ /pubmed/22792310 http://dx.doi.org/10.1371/journal.pone.0040419 Text en Seo, Oh. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Seo, Minseok
Oh, Sejong
CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness
title CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness
title_full CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness
title_fullStr CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness
title_full_unstemmed CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness
title_short CBFS: High Performance Feature Selection Algorithm Based on Feature Clearness
title_sort cbfs: high performance feature selection algorithm based on feature clearness
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3391246/
https://www.ncbi.nlm.nih.gov/pubmed/22792310
http://dx.doi.org/10.1371/journal.pone.0040419
work_keys_str_mv AT seominseok cbfshighperformancefeatureselectionalgorithmbasedonfeatureclearness
AT ohsejong cbfshighperformancefeatureselectionalgorithmbasedonfeatureclearness