Cargando…

Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models

The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud c...

Descripción completa

Detalles Bibliográficos
Autores principales: Dai, Jin, Liu, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953649/
https://www.ncbi.nlm.nih.gov/pubmed/24711737
http://dx.doi.org/10.1155/2014/784392
_version_ 1782307395214508032
author Dai, Jin
Liu, Xin
author_facet Dai, Jin
Liu, Xin
author_sort Dai, Jin
collection PubMed
description The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC) is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers.
format Online
Article
Text
id pubmed-3953649
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-39536492014-04-07 Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models Dai, Jin Liu, Xin ScientificWorldJournal Research Article The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC) is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers. Hindawi Publishing Corporation 2014-02-23 /pmc/articles/PMC3953649/ /pubmed/24711737 http://dx.doi.org/10.1155/2014/784392 Text en Copyright © 2014 J. Dai and X. Liu. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dai, Jin
Liu, Xin
Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models
title Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models
title_full Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models
title_fullStr Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models
title_full_unstemmed Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models
title_short Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models
title_sort approach for text classification based on the similarity measurement between normal cloud models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953649/
https://www.ncbi.nlm.nih.gov/pubmed/24711737
http://dx.doi.org/10.1155/2014/784392
work_keys_str_mv AT daijin approachfortextclassificationbasedonthesimilaritymeasurementbetweennormalcloudmodels
AT liuxin approachfortextclassificationbasedonthesimilaritymeasurementbetweennormalcloudmodels