Cargando…

A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms

Feature selection plays a critical role in text categorization. During feature selecting, high-frequency terms and the interclass and intraclass relative contributions of terms all have significant effects on classification results. So we put forward a feature selection approach, IIRCT, based on int...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Hongfang, Guo, Jie, Wang, Yinghui, Zhao, Minghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992800/
https://www.ncbi.nlm.nih.gov/pubmed/27579032
http://dx.doi.org/10.1155/2016/1715780
_version_ 1782449055199133696
author Zhou, Hongfang
Guo, Jie
Wang, Yinghui
Zhao, Minghua
author_facet Zhou, Hongfang
Guo, Jie
Wang, Yinghui
Zhao, Minghua
author_sort Zhou, Hongfang
collection PubMed
description Feature selection plays a critical role in text categorization. During feature selecting, high-frequency terms and the interclass and intraclass relative contributions of terms all have significant effects on classification results. So we put forward a feature selection approach, IIRCT, based on interclass and intraclass relative contributions of terms in the paper. In our proposed algorithm, three critical factors, which are term frequency and the interclass relative contribution and the intraclass relative contribution of terms, are all considered synthetically. Finally, experiments are made with the help of kNN classifier. And the corresponding results on 20 NewsGroup and SougouCS corpora show that IIRCT algorithm achieves better performance than DF, t-Test, and CMFS algorithms.
format Online
Article
Text
id pubmed-4992800
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-49928002016-08-30 A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms Zhou, Hongfang Guo, Jie Wang, Yinghui Zhao, Minghua Comput Intell Neurosci Research Article Feature selection plays a critical role in text categorization. During feature selecting, high-frequency terms and the interclass and intraclass relative contributions of terms all have significant effects on classification results. So we put forward a feature selection approach, IIRCT, based on interclass and intraclass relative contributions of terms in the paper. In our proposed algorithm, three critical factors, which are term frequency and the interclass relative contribution and the intraclass relative contribution of terms, are all considered synthetically. Finally, experiments are made with the help of kNN classifier. And the corresponding results on 20 NewsGroup and SougouCS corpora show that IIRCT algorithm achieves better performance than DF, t-Test, and CMFS algorithms. Hindawi Publishing Corporation 2016 2016-08-08 /pmc/articles/PMC4992800/ /pubmed/27579032 http://dx.doi.org/10.1155/2016/1715780 Text en Copyright © 2016 Hongfang Zhou et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhou, Hongfang
Guo, Jie
Wang, Yinghui
Zhao, Minghua
A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms
title A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms
title_full A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms
title_fullStr A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms
title_full_unstemmed A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms
title_short A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms
title_sort feature selection approach based on interclass and intraclass relative contributions of terms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992800/
https://www.ncbi.nlm.nih.gov/pubmed/27579032
http://dx.doi.org/10.1155/2016/1715780
work_keys_str_mv AT zhouhongfang afeatureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms
AT guojie afeatureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms
AT wangyinghui afeatureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms
AT zhaominghua afeatureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms
AT zhouhongfang featureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms
AT guojie featureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms
AT wangyinghui featureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms
AT zhaominghua featureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms