Cargando…
Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service
Traditional science and technology literature search mainly provides users with reliable and detailed information materials and services through technical means, data resources, and service strategies. With the development of network technology, computer technology, and information technology, digit...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9420566/ https://www.ncbi.nlm.nih.gov/pubmed/36045966 http://dx.doi.org/10.1155/2022/3392489 |
_version_ | 1784777418909679616 |
---|---|
author | Zhang, Chen |
author_facet | Zhang, Chen |
author_sort | Zhang, Chen |
collection | PubMed |
description | Traditional science and technology literature search mainly provides users with reliable and detailed information materials and services through technical means, data resources, and service strategies. With the development of network technology, computer technology, and information technology, digital information resources are increasing day by day, which continuously impact the traditional knowledge service mode. Some traditional technical methods and service means can no longer meet the information needs of users under large data sets. This paper proposes a model of large-scale literature search service in the context of big data by studying the technical means and service modes used for scientific and technical literature search in universities in the era of big data. Specifically, this paper proposes a method for fast literature retrieval by combining R-tree indexing for the characteristics of diverse data types and large data volume of science and technology literature. The method uses an improved k-mean clustering algorithm to construct an R-tree clustering model and improve the retrieval efficiency of the system by retrieving scientific and technical literature data through R-tree indexing. Experiments on university science and technology literature datasets show that the method in this paper improves both efficiency and precision when searching literature. |
format | Online Article Text |
id | pubmed-9420566 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-94205662022-08-30 Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service Zhang, Chen Comput Intell Neurosci Research Article Traditional science and technology literature search mainly provides users with reliable and detailed information materials and services through technical means, data resources, and service strategies. With the development of network technology, computer technology, and information technology, digital information resources are increasing day by day, which continuously impact the traditional knowledge service mode. Some traditional technical methods and service means can no longer meet the information needs of users under large data sets. This paper proposes a model of large-scale literature search service in the context of big data by studying the technical means and service modes used for scientific and technical literature search in universities in the era of big data. Specifically, this paper proposes a method for fast literature retrieval by combining R-tree indexing for the characteristics of diverse data types and large data volume of science and technology literature. The method uses an improved k-mean clustering algorithm to construct an R-tree clustering model and improve the retrieval efficiency of the system by retrieving scientific and technical literature data through R-tree indexing. Experiments on university science and technology literature datasets show that the method in this paper improves both efficiency and precision when searching literature. Hindawi 2022-08-21 /pmc/articles/PMC9420566/ /pubmed/36045966 http://dx.doi.org/10.1155/2022/3392489 Text en Copyright © 2022 Chen Zhang. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhang, Chen Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service |
title | Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service |
title_full | Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service |
title_fullStr | Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service |
title_full_unstemmed | Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service |
title_short | Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service |
title_sort | research on literature clustering algorithm for massive scientific and technical literature query service |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9420566/ https://www.ncbi.nlm.nih.gov/pubmed/36045966 http://dx.doi.org/10.1155/2022/3392489 |
work_keys_str_mv | AT zhangchen researchonliteratureclusteringalgorithmformassivescientificandtechnicalliteraturequeryservice |