Cargando…
Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression
In this paper we refine our method of measuring the innovativeness of scientific papers. Given a diachronic corpus of papers from a particular field of study, published over a period of a number of years, we extract latent topics and train an ordinal regression model to predict publication years bas...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302826/ http://dx.doi.org/10.1007/978-3-030-50417-5_48 |
_version_ | 1783547929942818816 |
---|---|
author | Savov, Pavel Jatowt, Adam Nielek, Radoslaw |
author_facet | Savov, Pavel Jatowt, Adam Nielek, Radoslaw |
author_sort | Savov, Pavel |
collection | PubMed |
description | In this paper we refine our method of measuring the innovativeness of scientific papers. Given a diachronic corpus of papers from a particular field of study, published over a period of a number of years, we extract latent topics and train an ordinal regression model to predict publication years based on topic distributions. Using the prediction error we calculate a real-number based innovation score, which may be used to complement citation analysis in identifying potential breakthrough publications. The innovation score we had proposed previously could not be compared for papers published in different years. The main contribution we make in this work is adjusting the innovation score to account for the publication year, making the scores of papers published in different years directly comparable. We have also improved the prediction accuracy by replacing multiclass classification with ordinal regression and Latent Dirichlet Allocation models with Correlated Topic Models. This also allows for better understanding of the evolution of research topics. We demonstrate our method on two corpora: 3,577 papers published at the International World Wide Web Conference (WWW) between the years 1994 and 2019, and 835 articles published in the Journal of Artificial Societies and Social Simulation (JASSS) from 1998 to 2019. |
format | Online Article Text |
id | pubmed-7302826 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73028262020-06-19 Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression Savov, Pavel Jatowt, Adam Nielek, Radoslaw Computational Science – ICCS 2020 Article In this paper we refine our method of measuring the innovativeness of scientific papers. Given a diachronic corpus of papers from a particular field of study, published over a period of a number of years, we extract latent topics and train an ordinal regression model to predict publication years based on topic distributions. Using the prediction error we calculate a real-number based innovation score, which may be used to complement citation analysis in identifying potential breakthrough publications. The innovation score we had proposed previously could not be compared for papers published in different years. The main contribution we make in this work is adjusting the innovation score to account for the publication year, making the scores of papers published in different years directly comparable. We have also improved the prediction accuracy by replacing multiclass classification with ordinal regression and Latent Dirichlet Allocation models with Correlated Topic Models. This also allows for better understanding of the evolution of research topics. We demonstrate our method on two corpora: 3,577 papers published at the International World Wide Web Conference (WWW) between the years 1994 and 2019, and 835 articles published in the Journal of Artificial Societies and Social Simulation (JASSS) from 1998 to 2019. 2020-06-15 /pmc/articles/PMC7302826/ http://dx.doi.org/10.1007/978-3-030-50417-5_48 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Savov, Pavel Jatowt, Adam Nielek, Radoslaw Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression |
title | Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression |
title_full | Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression |
title_fullStr | Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression |
title_full_unstemmed | Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression |
title_short | Innovativeness Analysis of Scholarly Publications by Age Prediction Using Ordinal Regression |
title_sort | innovativeness analysis of scholarly publications by age prediction using ordinal regression |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302826/ http://dx.doi.org/10.1007/978-3-030-50417-5_48 |
work_keys_str_mv | AT savovpavel innovativenessanalysisofscholarlypublicationsbyagepredictionusingordinalregression AT jatowtadam innovativenessanalysisofscholarlypublicationsbyagepredictionusingordinalregression AT nielekradoslaw innovativenessanalysisofscholarlypublicationsbyagepredictionusingordinalregression |