Cargando…

Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service

Traditional science and technology literature search mainly provides users with reliable and detailed information materials and services through technical means, data resources, and service strategies. With the development of network technology, computer technology, and information technology, digit...

Descripción completa

Detalles Bibliográficos
Autor principal: Zhang, Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9420566/
https://www.ncbi.nlm.nih.gov/pubmed/36045966
http://dx.doi.org/10.1155/2022/3392489
_version_ 1784777418909679616
author Zhang, Chen
author_facet Zhang, Chen
author_sort Zhang, Chen
collection PubMed
description Traditional science and technology literature search mainly provides users with reliable and detailed information materials and services through technical means, data resources, and service strategies. With the development of network technology, computer technology, and information technology, digital information resources are increasing day by day, which continuously impact the traditional knowledge service mode. Some traditional technical methods and service means can no longer meet the information needs of users under large data sets. This paper proposes a model of large-scale literature search service in the context of big data by studying the technical means and service modes used for scientific and technical literature search in universities in the era of big data. Specifically, this paper proposes a method for fast literature retrieval by combining R-tree indexing for the characteristics of diverse data types and large data volume of science and technology literature. The method uses an improved k-mean clustering algorithm to construct an R-tree clustering model and improve the retrieval efficiency of the system by retrieving scientific and technical literature data through R-tree indexing. Experiments on university science and technology literature datasets show that the method in this paper improves both efficiency and precision when searching literature.
format Online
Article
Text
id pubmed-9420566
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-94205662022-08-30 Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service Zhang, Chen Comput Intell Neurosci Research Article Traditional science and technology literature search mainly provides users with reliable and detailed information materials and services through technical means, data resources, and service strategies. With the development of network technology, computer technology, and information technology, digital information resources are increasing day by day, which continuously impact the traditional knowledge service mode. Some traditional technical methods and service means can no longer meet the information needs of users under large data sets. This paper proposes a model of large-scale literature search service in the context of big data by studying the technical means and service modes used for scientific and technical literature search in universities in the era of big data. Specifically, this paper proposes a method for fast literature retrieval by combining R-tree indexing for the characteristics of diverse data types and large data volume of science and technology literature. The method uses an improved k-mean clustering algorithm to construct an R-tree clustering model and improve the retrieval efficiency of the system by retrieving scientific and technical literature data through R-tree indexing. Experiments on university science and technology literature datasets show that the method in this paper improves both efficiency and precision when searching literature. Hindawi 2022-08-21 /pmc/articles/PMC9420566/ /pubmed/36045966 http://dx.doi.org/10.1155/2022/3392489 Text en Copyright © 2022 Chen Zhang. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Chen
Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service
title Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service
title_full Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service
title_fullStr Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service
title_full_unstemmed Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service
title_short Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service
title_sort research on literature clustering algorithm for massive scientific and technical literature query service
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9420566/
https://www.ncbi.nlm.nih.gov/pubmed/36045966
http://dx.doi.org/10.1155/2022/3392489
work_keys_str_mv AT zhangchen researchonliteratureclusteringalgorithmformassivescientificandtechnicalliteraturequeryservice