Cargando…

The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics

The presence of web-based communities is a distinctive signature of Web 2.0. The web-based feature means that information propagation within each community is highly facilitated, promoting complex collective dynamics in view of information exchange. In this work, we focus on a community of scientist...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Koon-Kiu, Gerstein, Mark
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3095621/
https://www.ncbi.nlm.nih.gov/pubmed/21603617
http://dx.doi.org/10.1371/journal.pone.0019917
_version_ 1782203675781890048
author Yan, Koon-Kiu
Gerstein, Mark
author_facet Yan, Koon-Kiu
Gerstein, Mark
author_sort Yan, Koon-Kiu
collection PubMed
description The presence of web-based communities is a distinctive signature of Web 2.0. The web-based feature means that information propagation within each community is highly facilitated, promoting complex collective dynamics in view of information exchange. In this work, we focus on a community of scientists and study, in particular, how the awareness of a scientific paper is spread. Our work is based on the web usage statistics obtained from the PLoS Article Level Metrics dataset compiled by PLoS. The cumulative number of HTML views was found to follow a long tail distribution which is reasonably well-fitted by a lognormal one. We modeled the diffusion of information by a random multiplicative process, and thus extracted the rates of information spread at different stages after the publication of a paper. We found that the spread of information displays two distinct decay regimes: a rapid downfall in the first month after publication, and a gradual power law decay afterwards. We identified these two regimes with two distinct driving processes: a short-term behavior driven by the fame of a paper, and a long-term behavior consistent with citation statistics. The patterns of information spread were found to be remarkably similar in data from different journals, but there are intrinsic differences for different types of web usage (HTML views and PDF downloads versus XML). These similarities and differences shed light on the theoretical understanding of different complex systems, as well as a better design of the corresponding web applications that is of high potential marketing impact.
format Text
id pubmed-3095621
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30956212011-05-19 The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics Yan, Koon-Kiu Gerstein, Mark PLoS One Research Article The presence of web-based communities is a distinctive signature of Web 2.0. The web-based feature means that information propagation within each community is highly facilitated, promoting complex collective dynamics in view of information exchange. In this work, we focus on a community of scientists and study, in particular, how the awareness of a scientific paper is spread. Our work is based on the web usage statistics obtained from the PLoS Article Level Metrics dataset compiled by PLoS. The cumulative number of HTML views was found to follow a long tail distribution which is reasonably well-fitted by a lognormal one. We modeled the diffusion of information by a random multiplicative process, and thus extracted the rates of information spread at different stages after the publication of a paper. We found that the spread of information displays two distinct decay regimes: a rapid downfall in the first month after publication, and a gradual power law decay afterwards. We identified these two regimes with two distinct driving processes: a short-term behavior driven by the fame of a paper, and a long-term behavior consistent with citation statistics. The patterns of information spread were found to be remarkably similar in data from different journals, but there are intrinsic differences for different types of web usage (HTML views and PDF downloads versus XML). These similarities and differences shed light on the theoretical understanding of different complex systems, as well as a better design of the corresponding web applications that is of high potential marketing impact. Public Library of Science 2011-05-16 /pmc/articles/PMC3095621/ /pubmed/21603617 http://dx.doi.org/10.1371/journal.pone.0019917 Text en Yan, Gerstein. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Yan, Koon-Kiu
Gerstein, Mark
The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics
title The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics
title_full The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics
title_fullStr The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics
title_full_unstemmed The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics
title_short The Spread of Scientific Information: Insights from the Web Usage Statistics in PLoS Article-Level Metrics
title_sort spread of scientific information: insights from the web usage statistics in plos article-level metrics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3095621/
https://www.ncbi.nlm.nih.gov/pubmed/21603617
http://dx.doi.org/10.1371/journal.pone.0019917
work_keys_str_mv AT yankoonkiu thespreadofscientificinformationinsightsfromthewebusagestatisticsinplosarticlelevelmetrics
AT gersteinmark thespreadofscientificinformationinsightsfromthewebusagestatisticsinplosarticlelevelmetrics
AT yankoonkiu spreadofscientificinformationinsightsfromthewebusagestatisticsinplosarticlelevelmetrics
AT gersteinmark spreadofscientificinformationinsightsfromthewebusagestatisticsinplosarticlelevelmetrics