Cargando…

Important citation identification by exploiting content and section-wise in-text citation count

A citation is deemed as a potential parameter to determine linkage between research articles. The parameter has extensively been employed to form multifarious academic aspects like calculating the impact factor of journals, h-Index of researchers, allocate different research grants, find the latest...

Descripción completa

Detalles Bibliográficos
Autores principales: Nazir, Shahzad, Asif, Muhammad, Ahmad, Shahbaz, Bukhari, Faisal, Afzal, Muhammad Tanvir, Aljuaid, Hanan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7058319/
https://www.ncbi.nlm.nih.gov/pubmed/32134940
http://dx.doi.org/10.1371/journal.pone.0228885
_version_ 1783503839129763840
author Nazir, Shahzad
Asif, Muhammad
Ahmad, Shahbaz
Bukhari, Faisal
Afzal, Muhammad Tanvir
Aljuaid, Hanan
author_facet Nazir, Shahzad
Asif, Muhammad
Ahmad, Shahbaz
Bukhari, Faisal
Afzal, Muhammad Tanvir
Aljuaid, Hanan
author_sort Nazir, Shahzad
collection PubMed
description A citation is deemed as a potential parameter to determine linkage between research articles. The parameter has extensively been employed to form multifarious academic aspects like calculating the impact factor of journals, h-Index of researchers, allocate different research grants, find the latest research trends, etc. The current state-of-the-art contends that all citations are not of equal importance. Based on this argument, the current trend in citation classification community categorizes citations into important and non-important reasons. The community has proposed different approaches to extract important citations such as citation count, context-based, metadata, and textual based approaches. The contemporary state-of-the-art in citation classification community ignores significantly potential features that can play a vital role in citation classification. This research presents a novel approach for binary citation classification by exploiting section-wise in-text citation frequencies, similarity score, and overall citation count-based features. The study also introduces machine learning algorithms based novel approach for assigning appropriate weights to the logical sections of research papers. The weights are allocated to the citations with respect to their sections. To perform the classification, we used three classification techniques, Support Vector Machine, Kernel Linear Regression, and Random Forest. The experiment was performed on two annotated benchmark datasets that contain 465 and 311 citation pairs of research articles respectively. The results revealed that the proposed approach attained an improved value of precision (i.e., 0.84 vs 0.72) from contemporary state-of-the-art approach.
format Online
Article
Text
id pubmed-7058319
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70583192020-03-12 Important citation identification by exploiting content and section-wise in-text citation count Nazir, Shahzad Asif, Muhammad Ahmad, Shahbaz Bukhari, Faisal Afzal, Muhammad Tanvir Aljuaid, Hanan PLoS One Research Article A citation is deemed as a potential parameter to determine linkage between research articles. The parameter has extensively been employed to form multifarious academic aspects like calculating the impact factor of journals, h-Index of researchers, allocate different research grants, find the latest research trends, etc. The current state-of-the-art contends that all citations are not of equal importance. Based on this argument, the current trend in citation classification community categorizes citations into important and non-important reasons. The community has proposed different approaches to extract important citations such as citation count, context-based, metadata, and textual based approaches. The contemporary state-of-the-art in citation classification community ignores significantly potential features that can play a vital role in citation classification. This research presents a novel approach for binary citation classification by exploiting section-wise in-text citation frequencies, similarity score, and overall citation count-based features. The study also introduces machine learning algorithms based novel approach for assigning appropriate weights to the logical sections of research papers. The weights are allocated to the citations with respect to their sections. To perform the classification, we used three classification techniques, Support Vector Machine, Kernel Linear Regression, and Random Forest. The experiment was performed on two annotated benchmark datasets that contain 465 and 311 citation pairs of research articles respectively. The results revealed that the proposed approach attained an improved value of precision (i.e., 0.84 vs 0.72) from contemporary state-of-the-art approach. Public Library of Science 2020-03-05 /pmc/articles/PMC7058319/ /pubmed/32134940 http://dx.doi.org/10.1371/journal.pone.0228885 Text en © 2020 Nazir et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Nazir, Shahzad
Asif, Muhammad
Ahmad, Shahbaz
Bukhari, Faisal
Afzal, Muhammad Tanvir
Aljuaid, Hanan
Important citation identification by exploiting content and section-wise in-text citation count
title Important citation identification by exploiting content and section-wise in-text citation count
title_full Important citation identification by exploiting content and section-wise in-text citation count
title_fullStr Important citation identification by exploiting content and section-wise in-text citation count
title_full_unstemmed Important citation identification by exploiting content and section-wise in-text citation count
title_short Important citation identification by exploiting content and section-wise in-text citation count
title_sort important citation identification by exploiting content and section-wise in-text citation count
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7058319/
https://www.ncbi.nlm.nih.gov/pubmed/32134940
http://dx.doi.org/10.1371/journal.pone.0228885
work_keys_str_mv AT nazirshahzad importantcitationidentificationbyexploitingcontentandsectionwiseintextcitationcount
AT asifmuhammad importantcitationidentificationbyexploitingcontentandsectionwiseintextcitationcount
AT ahmadshahbaz importantcitationidentificationbyexploitingcontentandsectionwiseintextcitationcount
AT bukharifaisal importantcitationidentificationbyexploitingcontentandsectionwiseintextcitationcount
AT afzalmuhammadtanvir importantcitationidentificationbyexploitingcontentandsectionwiseintextcitationcount
AT aljuaidhanan importantcitationidentificationbyexploitingcontentandsectionwiseintextcitationcount