Cargando…
In-text citation’s frequencies-based recommendations of relevant research papers
From the past half of a century, identification of the relevant documents is deemed an active area of research due to the rapid increase of data on the web. The traditional models to retrieve relevant documents are based on bibliographic information such as Bibliographic coupling, Co-citations, and...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8189020/ https://www.ncbi.nlm.nih.gov/pubmed/34150995 http://dx.doi.org/10.7717/peerj-cs.524 |
_version_ | 1783705437301899264 |
---|---|
author | Shahid, Abdul Afzal, Muhammad Tanvir Alharbi, Abdullah Aljuaid, Hanan Al-Otaibi, Shaha |
author_facet | Shahid, Abdul Afzal, Muhammad Tanvir Alharbi, Abdullah Aljuaid, Hanan Al-Otaibi, Shaha |
author_sort | Shahid, Abdul |
collection | PubMed |
description | From the past half of a century, identification of the relevant documents is deemed an active area of research due to the rapid increase of data on the web. The traditional models to retrieve relevant documents are based on bibliographic information such as Bibliographic coupling, Co-citations, and Direct citations. However, in the recent past, the scientific community has started to employ textual features to improve existing models’ accuracy. In our previous study, we found that analysis of citations at a deep level (i.e., content level) can play a paramount role in finding more relevant documents than surface level (i.e., just bibliography details). We found that cited and citing papers have a high degree of relevancy when in-text citations frequency of the cited paper is more than five times in the citing paper’s text. This paper is an extension of our previous study in terms of its evaluation of a comprehensive dataset. Moreover, the study results are also compared with other state-of-the-art approaches i.e., content, metadata, and bibliography. For evaluation, a user study is conducted on selected papers from 1,200 documents (comprise about 16,000 references) of an online journal, Journal of Computer Science (J.UCS). The evaluation results indicate that in-text citation frequency has attained higher precision in finding relevant papers than other state-of-the-art techniques such as content, bibliographic coupling, and metadata-based techniques. The use of in-text citation may help in enhancing the quality of existing information systems and digital libraries. Further, more sophisticated measure may be redefined be considering the use of in-text citations. |
format | Online Article Text |
id | pubmed-8189020 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81890202021-06-17 In-text citation’s frequencies-based recommendations of relevant research papers Shahid, Abdul Afzal, Muhammad Tanvir Alharbi, Abdullah Aljuaid, Hanan Al-Otaibi, Shaha PeerJ Comput Sci Algorithms and Analysis of Algorithms From the past half of a century, identification of the relevant documents is deemed an active area of research due to the rapid increase of data on the web. The traditional models to retrieve relevant documents are based on bibliographic information such as Bibliographic coupling, Co-citations, and Direct citations. However, in the recent past, the scientific community has started to employ textual features to improve existing models’ accuracy. In our previous study, we found that analysis of citations at a deep level (i.e., content level) can play a paramount role in finding more relevant documents than surface level (i.e., just bibliography details). We found that cited and citing papers have a high degree of relevancy when in-text citations frequency of the cited paper is more than five times in the citing paper’s text. This paper is an extension of our previous study in terms of its evaluation of a comprehensive dataset. Moreover, the study results are also compared with other state-of-the-art approaches i.e., content, metadata, and bibliography. For evaluation, a user study is conducted on selected papers from 1,200 documents (comprise about 16,000 references) of an online journal, Journal of Computer Science (J.UCS). The evaluation results indicate that in-text citation frequency has attained higher precision in finding relevant papers than other state-of-the-art techniques such as content, bibliographic coupling, and metadata-based techniques. The use of in-text citation may help in enhancing the quality of existing information systems and digital libraries. Further, more sophisticated measure may be redefined be considering the use of in-text citations. PeerJ Inc. 2021-06-04 /pmc/articles/PMC8189020/ /pubmed/34150995 http://dx.doi.org/10.7717/peerj-cs.524 Text en ©2021 Shahid et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Algorithms and Analysis of Algorithms Shahid, Abdul Afzal, Muhammad Tanvir Alharbi, Abdullah Aljuaid, Hanan Al-Otaibi, Shaha In-text citation’s frequencies-based recommendations of relevant research papers |
title | In-text citation’s frequencies-based recommendations of relevant research papers |
title_full | In-text citation’s frequencies-based recommendations of relevant research papers |
title_fullStr | In-text citation’s frequencies-based recommendations of relevant research papers |
title_full_unstemmed | In-text citation’s frequencies-based recommendations of relevant research papers |
title_short | In-text citation’s frequencies-based recommendations of relevant research papers |
title_sort | in-text citation’s frequencies-based recommendations of relevant research papers |
topic | Algorithms and Analysis of Algorithms |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8189020/ https://www.ncbi.nlm.nih.gov/pubmed/34150995 http://dx.doi.org/10.7717/peerj-cs.524 |
work_keys_str_mv | AT shahidabdul intextcitationsfrequenciesbasedrecommendationsofrelevantresearchpapers AT afzalmuhammadtanvir intextcitationsfrequenciesbasedrecommendationsofrelevantresearchpapers AT alharbiabdullah intextcitationsfrequenciesbasedrecommendationsofrelevantresearchpapers AT aljuaidhanan intextcitationsfrequenciesbasedrecommendationsofrelevantresearchpapers AT alotaibishaha intextcitationsfrequenciesbasedrecommendationsofrelevantresearchpapers |