Cargando…

A stochastic generative model for citation networks among academic papers

We propose a stochastic generative model to represent a directed graph constructed by citations among academic papers, where nodes and directed edges represent papers with discrete publication time and citations respectively. The proposed model assumes that a citation between two papers occurs with...

Descripción completa

Detalles Bibliográficos
Autores principales: Yasui, Yuichiro, Nakano, Junji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9242511/
https://www.ncbi.nlm.nih.gov/pubmed/35767539
http://dx.doi.org/10.1371/journal.pone.0269845
_version_ 1784738076283633664
author Yasui, Yuichiro
Nakano, Junji
author_facet Yasui, Yuichiro
Nakano, Junji
author_sort Yasui, Yuichiro
collection PubMed
description We propose a stochastic generative model to represent a directed graph constructed by citations among academic papers, where nodes and directed edges represent papers with discrete publication time and citations respectively. The proposed model assumes that a citation between two papers occurs with a probability based on the type of the citing paper, the importance of cited paper, and the difference between their publication times, like the existing models. We consider the out-degrees of citing paper as its type, because, for example, survey paper cites many papers. We approximate the importance of a cited paper by its in-degrees. In our model, we adopt three functions: a logistic function for illustrating the numbers of papers published in discrete time, an inverse Gaussian probability distribution function to express the aging effect based on the difference between publication times, and an exponential distribution (or a generalized Pareto distribution) for describing the out-degree distribution. We consider that our model is a more reasonable and appropriate stochastic model than other existing models and can perform complete simulations without using original data. In this paper, we first use the Web of Science database and see the features used in our model. By using the proposed model, we can generate simulated graphs and demonstrate that they are similar to the original data concerning the in- and out-degree distributions, and node triangle participation. In addition, we analyze two other citation networks derived from physics papers in the arXiv database and verify the effectiveness of the model.
format Online
Article
Text
id pubmed-9242511
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-92425112022-06-30 A stochastic generative model for citation networks among academic papers Yasui, Yuichiro Nakano, Junji PLoS One Research Article We propose a stochastic generative model to represent a directed graph constructed by citations among academic papers, where nodes and directed edges represent papers with discrete publication time and citations respectively. The proposed model assumes that a citation between two papers occurs with a probability based on the type of the citing paper, the importance of cited paper, and the difference between their publication times, like the existing models. We consider the out-degrees of citing paper as its type, because, for example, survey paper cites many papers. We approximate the importance of a cited paper by its in-degrees. In our model, we adopt three functions: a logistic function for illustrating the numbers of papers published in discrete time, an inverse Gaussian probability distribution function to express the aging effect based on the difference between publication times, and an exponential distribution (or a generalized Pareto distribution) for describing the out-degree distribution. We consider that our model is a more reasonable and appropriate stochastic model than other existing models and can perform complete simulations without using original data. In this paper, we first use the Web of Science database and see the features used in our model. By using the proposed model, we can generate simulated graphs and demonstrate that they are similar to the original data concerning the in- and out-degree distributions, and node triangle participation. In addition, we analyze two other citation networks derived from physics papers in the arXiv database and verify the effectiveness of the model. Public Library of Science 2022-06-29 /pmc/articles/PMC9242511/ /pubmed/35767539 http://dx.doi.org/10.1371/journal.pone.0269845 Text en © 2022 Yasui, Nakano https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Yasui, Yuichiro
Nakano, Junji
A stochastic generative model for citation networks among academic papers
title A stochastic generative model for citation networks among academic papers
title_full A stochastic generative model for citation networks among academic papers
title_fullStr A stochastic generative model for citation networks among academic papers
title_full_unstemmed A stochastic generative model for citation networks among academic papers
title_short A stochastic generative model for citation networks among academic papers
title_sort stochastic generative model for citation networks among academic papers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9242511/
https://www.ncbi.nlm.nih.gov/pubmed/35767539
http://dx.doi.org/10.1371/journal.pone.0269845
work_keys_str_mv AT yasuiyuichiro astochasticgenerativemodelforcitationnetworksamongacademicpapers
AT nakanojunji astochasticgenerativemodelforcitationnetworksamongacademicpapers
AT yasuiyuichiro stochasticgenerativemodelforcitationnetworksamongacademicpapers
AT nakanojunji stochasticgenerativemodelforcitationnetworksamongacademicpapers