Cargando…

Three Journal Similarity Metrics and Their Application to Biomedical Journals

In the present paper, we have created several novel journal similarity metrics. The MeSH odds ratio measures the topical similarity of any pair of journals, based on the major MeSH headings assigned to articles in MEDLINE. The second metric employed the 2009 Author-ity author name disambiguation dat...

Descripción completa

Detalles Bibliográficos
Autores principales: D′Souza, Jennifer L., Smalheiser, Neil R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275247/
https://www.ncbi.nlm.nih.gov/pubmed/25536326
http://dx.doi.org/10.1371/journal.pone.0115681
_version_ 1782350106200113152
author D′Souza, Jennifer L.
Smalheiser, Neil R.
author_facet D′Souza, Jennifer L.
Smalheiser, Neil R.
author_sort D′Souza, Jennifer L.
collection PubMed
description In the present paper, we have created several novel journal similarity metrics. The MeSH odds ratio measures the topical similarity of any pair of journals, based on the major MeSH headings assigned to articles in MEDLINE. The second metric employed the 2009 Author-ity author name disambiguation dataset as a gold standard for estimating the author odds ratio. This gives a straightforward, intuitive answer to the question: Given two articles in PubMed that share the same author name (lastname, first initial), how does knowing only the identity of the journals (in which the articles were published) predict the relative likelihood that they are written by the same person vs. different persons? The article pair odds ratio detects the tendency of authors to publish repeatedly in the same journal, as well as in specific pairs of journals. The metrics can be applied not only to estimate the similarity of a pair of journals, but to provide novel profiles of individual journals as well. For example, for each journal, one can define the MeSH cloud as the number of other journals that are topically more similar to it than expected by chance, and the author cloud as the number of other journals that share more authors than expected by chance. These metrics for journal pairs and individual journals have been provided in the form of public datasets that can be readily studied and utilized by others.
format Online
Article
Text
id pubmed-4275247
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42752472014-12-31 Three Journal Similarity Metrics and Their Application to Biomedical Journals D′Souza, Jennifer L. Smalheiser, Neil R. PLoS One Research Article In the present paper, we have created several novel journal similarity metrics. The MeSH odds ratio measures the topical similarity of any pair of journals, based on the major MeSH headings assigned to articles in MEDLINE. The second metric employed the 2009 Author-ity author name disambiguation dataset as a gold standard for estimating the author odds ratio. This gives a straightforward, intuitive answer to the question: Given two articles in PubMed that share the same author name (lastname, first initial), how does knowing only the identity of the journals (in which the articles were published) predict the relative likelihood that they are written by the same person vs. different persons? The article pair odds ratio detects the tendency of authors to publish repeatedly in the same journal, as well as in specific pairs of journals. The metrics can be applied not only to estimate the similarity of a pair of journals, but to provide novel profiles of individual journals as well. For example, for each journal, one can define the MeSH cloud as the number of other journals that are topically more similar to it than expected by chance, and the author cloud as the number of other journals that share more authors than expected by chance. These metrics for journal pairs and individual journals have been provided in the form of public datasets that can be readily studied and utilized by others. Public Library of Science 2014-12-23 /pmc/articles/PMC4275247/ /pubmed/25536326 http://dx.doi.org/10.1371/journal.pone.0115681 Text en © 2014 D′Souza, Smalheiser http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
D′Souza, Jennifer L.
Smalheiser, Neil R.
Three Journal Similarity Metrics and Their Application to Biomedical Journals
title Three Journal Similarity Metrics and Their Application to Biomedical Journals
title_full Three Journal Similarity Metrics and Their Application to Biomedical Journals
title_fullStr Three Journal Similarity Metrics and Their Application to Biomedical Journals
title_full_unstemmed Three Journal Similarity Metrics and Their Application to Biomedical Journals
title_short Three Journal Similarity Metrics and Their Application to Biomedical Journals
title_sort three journal similarity metrics and their application to biomedical journals
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275247/
https://www.ncbi.nlm.nih.gov/pubmed/25536326
http://dx.doi.org/10.1371/journal.pone.0115681
work_keys_str_mv AT dsouzajenniferl threejournalsimilaritymetricsandtheirapplicationtobiomedicaljournals
AT smalheiserneilr threejournalsimilaritymetricsandtheirapplicationtobiomedicaljournals