Cargando…
Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models
The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud c...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953649/ https://www.ncbi.nlm.nih.gov/pubmed/24711737 http://dx.doi.org/10.1155/2014/784392 |
_version_ | 1782307395214508032 |
---|---|
author | Dai, Jin Liu, Xin |
author_facet | Dai, Jin Liu, Xin |
author_sort | Dai, Jin |
collection | PubMed |
description | The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC) is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers. |
format | Online Article Text |
id | pubmed-3953649 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-39536492014-04-07 Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models Dai, Jin Liu, Xin ScientificWorldJournal Research Article The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC) is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers. Hindawi Publishing Corporation 2014-02-23 /pmc/articles/PMC3953649/ /pubmed/24711737 http://dx.doi.org/10.1155/2014/784392 Text en Copyright © 2014 J. Dai and X. Liu. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Dai, Jin Liu, Xin Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models |
title | Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models |
title_full | Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models |
title_fullStr | Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models |
title_full_unstemmed | Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models |
title_short | Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models |
title_sort | approach for text classification based on the similarity measurement between normal cloud models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953649/ https://www.ncbi.nlm.nih.gov/pubmed/24711737 http://dx.doi.org/10.1155/2014/784392 |
work_keys_str_mv | AT daijin approachfortextclassificationbasedonthesimilaritymeasurementbetweennormalcloudmodels AT liuxin approachfortextclassificationbasedonthesimilaritymeasurementbetweennormalcloudmodels |