Cargando…

Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers

Sentiment analysis has been researched extensively during the last few years, however, the sentiment analysis of citations in a research article is an unexplored research area. Sentiment analysis of citations can provide new applications in bibliometrics and provide insights for a better understandi...

Descripción completa

Detalles Bibliográficos
Autores principales: Karim, Musarat, Saad Missen, Malik Muhammad, Umer, Muhammad, Fida, Alisha, Eshmawi, Ala’ Abdulmajid, Mohamed, Abdullah, Ashraf, Imran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280177/
https://www.ncbi.nlm.nih.gov/pubmed/37346319
http://dx.doi.org/10.7717/peerj-cs.1107
_version_ 1785060745651683328
author Karim, Musarat
Saad Missen, Malik Muhammad
Umer, Muhammad
Fida, Alisha
Eshmawi, Ala’ Abdulmajid
Mohamed, Abdullah
Ashraf, Imran
author_facet Karim, Musarat
Saad Missen, Malik Muhammad
Umer, Muhammad
Fida, Alisha
Eshmawi, Ala’ Abdulmajid
Mohamed, Abdullah
Ashraf, Imran
author_sort Karim, Musarat
collection PubMed
description Sentiment analysis has been researched extensively during the last few years, however, the sentiment analysis of citations in a research article is an unexplored research area. Sentiment analysis of citations can provide new applications in bibliometrics and provide insights for a better understanding of scientific knowledge. Citation count, as it is used today to measure the quality of a paper, does not portray the quality of a scientific article, as the article may be cited to indicate its weakness. So determining the polarity of a citation is an important task to quantify the quality of the cited article and ascertain its impact and ranking. This article presents an approach to determine the polarity of the cited article using term frequency-inverse document frequency and machine learning classifiers. To analyze the influence of an imbalanced dataset, several experiments are performed with and without the synthetic minority oversampling technique (SMOTE) and uni-gram and bi-gram term frequency-inverse document frequency (TF-IDF). Results indicate that the proposed methodology achieves high accuracy of 99.0% with the extra tree classifier when trained on SMOTE oversampled dataset and bi-gram features.
format Online
Article
Text
id pubmed-10280177
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-102801772023-06-21 Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers Karim, Musarat Saad Missen, Malik Muhammad Umer, Muhammad Fida, Alisha Eshmawi, Ala’ Abdulmajid Mohamed, Abdullah Ashraf, Imran PeerJ Comput Sci Computational Linguistics Sentiment analysis has been researched extensively during the last few years, however, the sentiment analysis of citations in a research article is an unexplored research area. Sentiment analysis of citations can provide new applications in bibliometrics and provide insights for a better understanding of scientific knowledge. Citation count, as it is used today to measure the quality of a paper, does not portray the quality of a scientific article, as the article may be cited to indicate its weakness. So determining the polarity of a citation is an important task to quantify the quality of the cited article and ascertain its impact and ranking. This article presents an approach to determine the polarity of the cited article using term frequency-inverse document frequency and machine learning classifiers. To analyze the influence of an imbalanced dataset, several experiments are performed with and without the synthetic minority oversampling technique (SMOTE) and uni-gram and bi-gram term frequency-inverse document frequency (TF-IDF). Results indicate that the proposed methodology achieves high accuracy of 99.0% with the extra tree classifier when trained on SMOTE oversampled dataset and bi-gram features. PeerJ Inc. 2022-12-13 /pmc/articles/PMC10280177/ /pubmed/37346319 http://dx.doi.org/10.7717/peerj-cs.1107 Text en ©Karim 2022 et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Computational Linguistics
Karim, Musarat
Saad Missen, Malik Muhammad
Umer, Muhammad
Fida, Alisha
Eshmawi, Ala’ Abdulmajid
Mohamed, Abdullah
Ashraf, Imran
Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers
title Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers
title_full Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers
title_fullStr Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers
title_full_unstemmed Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers
title_short Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers
title_sort comprehension of polarity of articles by citation sentiment analysis using tf-idf and ml classifiers
topic Computational Linguistics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280177/
https://www.ncbi.nlm.nih.gov/pubmed/37346319
http://dx.doi.org/10.7717/peerj-cs.1107
work_keys_str_mv AT karimmusarat comprehensionofpolarityofarticlesbycitationsentimentanalysisusingtfidfandmlclassifiers
AT saadmissenmalikmuhammad comprehensionofpolarityofarticlesbycitationsentimentanalysisusingtfidfandmlclassifiers
AT umermuhammad comprehensionofpolarityofarticlesbycitationsentimentanalysisusingtfidfandmlclassifiers
AT fidaalisha comprehensionofpolarityofarticlesbycitationsentimentanalysisusingtfidfandmlclassifiers
AT eshmawialaabdulmajid comprehensionofpolarityofarticlesbycitationsentimentanalysisusingtfidfandmlclassifiers
AT mohamedabdullah comprehensionofpolarityofarticlesbycitationsentimentanalysisusingtfidfandmlclassifiers
AT ashrafimran comprehensionofpolarityofarticlesbycitationsentimentanalysisusingtfidfandmlclassifiers