Cargando…

Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression

In this paper we refine our method of measuring the innovativeness of scientific papers. Given a diachronic corpus of papers from a particular field of study, published over a period of a number of years, we extract latent topics and train an ordinal regression model to predict publication years bas...

Descripción completa

Detalles Bibliográficos
Autores principales: Savov, Pavel, Jatowt, Adam, Nielek, Radoslaw
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302826/
http://dx.doi.org/10.1007/978-3-030-50417-5_48
_version_ 1783547929942818816
author Savov, Pavel
Jatowt, Adam
Nielek, Radoslaw
author_facet Savov, Pavel
Jatowt, Adam
Nielek, Radoslaw
author_sort Savov, Pavel
collection PubMed
description In this paper we refine our method of measuring the innovativeness of scientific papers. Given a diachronic corpus of papers from a particular field of study, published over a period of a number of years, we extract latent topics and train an ordinal regression model to predict publication years based on topic distributions. Using the prediction error we calculate a real-number based innovation score, which may be used to complement citation analysis in identifying potential breakthrough publications. The innovation score we had proposed previously could not be compared for papers published in different years. The main contribution we make in this work is adjusting the innovation score to account for the publication year, making the scores of papers published in different years directly comparable. We have also improved the prediction accuracy by replacing multiclass classification with ordinal regression and Latent Dirichlet Allocation models with Correlated Topic Models. This also allows for better understanding of the evolution of research topics. We demonstrate our method on two corpora: 3,577 papers published at the International World Wide Web Conference (WWW) between the years 1994 and 2019, and 835 articles published in the Journal of Artificial Societies and Social Simulation (JASSS) from 1998 to 2019.
format Online
Article
Text
id pubmed-7302826
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73028262020-06-19 Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression Savov, Pavel Jatowt, Adam Nielek, Radoslaw Computational Science – ICCS 2020 Article In this paper we refine our method of measuring the innovativeness of scientific papers. Given a diachronic corpus of papers from a particular field of study, published over a period of a number of years, we extract latent topics and train an ordinal regression model to predict publication years based on topic distributions. Using the prediction error we calculate a real-number based innovation score, which may be used to complement citation analysis in identifying potential breakthrough publications. The innovation score we had proposed previously could not be compared for papers published in different years. The main contribution we make in this work is adjusting the innovation score to account for the publication year, making the scores of papers published in different years directly comparable. We have also improved the prediction accuracy by replacing multiclass classification with ordinal regression and Latent Dirichlet Allocation models with Correlated Topic Models. This also allows for better understanding of the evolution of research topics. We demonstrate our method on two corpora: 3,577 papers published at the International World Wide Web Conference (WWW) between the years 1994 and 2019, and 835 articles published in the Journal of Artificial Societies and Social Simulation (JASSS) from 1998 to 2019. 2020-06-15 /pmc/articles/PMC7302826/ http://dx.doi.org/10.1007/978-3-030-50417-5_48 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Savov, Pavel
Jatowt, Adam
Nielek, Radoslaw
Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression
title Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression
title_full Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression
title_fullStr Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression
title_full_unstemmed Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression
title_short Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression
title_sort innovativeness analysis of scholarly publications by age prediction using ordinal regression
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302826/
http://dx.doi.org/10.1007/978-3-030-50417-5_48
work_keys_str_mv AT savovpavel innovativenessanalysisofscholarlypublicationsbyagepredictionusingordinalregression
AT jatowtadam innovativenessanalysisofscholarlypublicationsbyagepredictionusingordinalregression
AT nielekradoslaw innovativenessanalysisofscholarlypublicationsbyagepredictionusingordinalregression