Cargando…
A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms
Feature selection plays a critical role in text categorization. During feature selecting, high-frequency terms and the interclass and intraclass relative contributions of terms all have significant effects on classification results. So we put forward a feature selection approach, IIRCT, based on int...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992800/ https://www.ncbi.nlm.nih.gov/pubmed/27579032 http://dx.doi.org/10.1155/2016/1715780 |
_version_ | 1782449055199133696 |
---|---|
author | Zhou, Hongfang Guo, Jie Wang, Yinghui Zhao, Minghua |
author_facet | Zhou, Hongfang Guo, Jie Wang, Yinghui Zhao, Minghua |
author_sort | Zhou, Hongfang |
collection | PubMed |
description | Feature selection plays a critical role in text categorization. During feature selecting, high-frequency terms and the interclass and intraclass relative contributions of terms all have significant effects on classification results. So we put forward a feature selection approach, IIRCT, based on interclass and intraclass relative contributions of terms in the paper. In our proposed algorithm, three critical factors, which are term frequency and the interclass relative contribution and the intraclass relative contribution of terms, are all considered synthetically. Finally, experiments are made with the help of kNN classifier. And the corresponding results on 20 NewsGroup and SougouCS corpora show that IIRCT algorithm achieves better performance than DF, t-Test, and CMFS algorithms. |
format | Online Article Text |
id | pubmed-4992800 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-49928002016-08-30 A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms Zhou, Hongfang Guo, Jie Wang, Yinghui Zhao, Minghua Comput Intell Neurosci Research Article Feature selection plays a critical role in text categorization. During feature selecting, high-frequency terms and the interclass and intraclass relative contributions of terms all have significant effects on classification results. So we put forward a feature selection approach, IIRCT, based on interclass and intraclass relative contributions of terms in the paper. In our proposed algorithm, three critical factors, which are term frequency and the interclass relative contribution and the intraclass relative contribution of terms, are all considered synthetically. Finally, experiments are made with the help of kNN classifier. And the corresponding results on 20 NewsGroup and SougouCS corpora show that IIRCT algorithm achieves better performance than DF, t-Test, and CMFS algorithms. Hindawi Publishing Corporation 2016 2016-08-08 /pmc/articles/PMC4992800/ /pubmed/27579032 http://dx.doi.org/10.1155/2016/1715780 Text en Copyright © 2016 Hongfang Zhou et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhou, Hongfang Guo, Jie Wang, Yinghui Zhao, Minghua A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms |
title | A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms |
title_full | A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms |
title_fullStr | A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms |
title_full_unstemmed | A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms |
title_short | A Feature Selection Approach Based on Interclass and Intraclass Relative Contributions of Terms |
title_sort | feature selection approach based on interclass and intraclass relative contributions of terms |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992800/ https://www.ncbi.nlm.nih.gov/pubmed/27579032 http://dx.doi.org/10.1155/2016/1715780 |
work_keys_str_mv | AT zhouhongfang afeatureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms AT guojie afeatureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms AT wangyinghui afeatureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms AT zhaominghua afeatureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms AT zhouhongfang featureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms AT guojie featureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms AT wangyinghui featureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms AT zhaominghua featureselectionapproachbasedoninterclassandintraclassrelativecontributionsofterms |