Cargando…

Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means

How to discover hotspot in the Internet public opinions effectively is a hot research field for the researchers related which plays a key role for governments and corporations to find useful information from mass data in the Internet. An improved K-means algorithm for hotspot discovery in internet p...

Descripción completa

Detalles Bibliográficos
Autor principal: Wang, Gensheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3782808/
https://www.ncbi.nlm.nih.gov/pubmed/24106496
http://dx.doi.org/10.1155/2013/230946
_version_ 1782285610211344384
author Wang, Gensheng
author_facet Wang, Gensheng
author_sort Wang, Gensheng
collection PubMed
description How to discover hotspot in the Internet public opinions effectively is a hot research field for the researchers related which plays a key role for governments and corporations to find useful information from mass data in the Internet. An improved K-means algorithm for hotspot discovery in internet public opinions is presented based on the analysis of existing defects and calculation principle of original K-means algorithm. First, some new methods are designed to preprocess website texts, select and express the characteristics of website texts, and define the similarity between two website texts, respectively. Second, clustering principle and the method of initial classification centers selection are analyzed and improved in order to overcome the limitations of original K-means algorithm. Finally, the experimental results verify that the improved algorithm can improve the clustering stability and classification accuracy of hotspot discovery in internet public opinions when used in practice.
format Online
Article
Text
id pubmed-3782808
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-37828082013-10-08 Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means Wang, Gensheng Comput Intell Neurosci Research Article How to discover hotspot in the Internet public opinions effectively is a hot research field for the researchers related which plays a key role for governments and corporations to find useful information from mass data in the Internet. An improved K-means algorithm for hotspot discovery in internet public opinions is presented based on the analysis of existing defects and calculation principle of original K-means algorithm. First, some new methods are designed to preprocess website texts, select and express the characteristics of website texts, and define the similarity between two website texts, respectively. Second, clustering principle and the method of initial classification centers selection are analyzed and improved in order to overcome the limitations of original K-means algorithm. Finally, the experimental results verify that the improved algorithm can improve the clustering stability and classification accuracy of hotspot discovery in internet public opinions when used in practice. Hindawi Publishing Corporation 2013 2013-09-10 /pmc/articles/PMC3782808/ /pubmed/24106496 http://dx.doi.org/10.1155/2013/230946 Text en Copyright © 2013 Gensheng Wang. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wang, Gensheng
Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means
title Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means
title_full Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means
title_fullStr Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means
title_full_unstemmed Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means
title_short Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means
title_sort research on hotspot discovery in internet public opinions based on improved k-means
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3782808/
https://www.ncbi.nlm.nih.gov/pubmed/24106496
http://dx.doi.org/10.1155/2013/230946
work_keys_str_mv AT wanggensheng researchonhotspotdiscoveryininternetpublicopinionsbasedonimprovedkmeans