Cargando…
Three Journal Similarity Metrics and Their Application to Biomedical Journals
In the present paper, we have created several novel journal similarity metrics. The MeSH odds ratio measures the topical similarity of any pair of journals, based on the major MeSH headings assigned to articles in MEDLINE. The second metric employed the 2009 Author-ity author name disambiguation dat...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275247/ https://www.ncbi.nlm.nih.gov/pubmed/25536326 http://dx.doi.org/10.1371/journal.pone.0115681 |
_version_ | 1782350106200113152 |
---|---|
author | D′Souza, Jennifer L. Smalheiser, Neil R. |
author_facet | D′Souza, Jennifer L. Smalheiser, Neil R. |
author_sort | D′Souza, Jennifer L. |
collection | PubMed |
description | In the present paper, we have created several novel journal similarity metrics. The MeSH odds ratio measures the topical similarity of any pair of journals, based on the major MeSH headings assigned to articles in MEDLINE. The second metric employed the 2009 Author-ity author name disambiguation dataset as a gold standard for estimating the author odds ratio. This gives a straightforward, intuitive answer to the question: Given two articles in PubMed that share the same author name (lastname, first initial), how does knowing only the identity of the journals (in which the articles were published) predict the relative likelihood that they are written by the same person vs. different persons? The article pair odds ratio detects the tendency of authors to publish repeatedly in the same journal, as well as in specific pairs of journals. The metrics can be applied not only to estimate the similarity of a pair of journals, but to provide novel profiles of individual journals as well. For example, for each journal, one can define the MeSH cloud as the number of other journals that are topically more similar to it than expected by chance, and the author cloud as the number of other journals that share more authors than expected by chance. These metrics for journal pairs and individual journals have been provided in the form of public datasets that can be readily studied and utilized by others. |
format | Online Article Text |
id | pubmed-4275247 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-42752472014-12-31 Three Journal Similarity Metrics and Their Application to Biomedical Journals D′Souza, Jennifer L. Smalheiser, Neil R. PLoS One Research Article In the present paper, we have created several novel journal similarity metrics. The MeSH odds ratio measures the topical similarity of any pair of journals, based on the major MeSH headings assigned to articles in MEDLINE. The second metric employed the 2009 Author-ity author name disambiguation dataset as a gold standard for estimating the author odds ratio. This gives a straightforward, intuitive answer to the question: Given two articles in PubMed that share the same author name (lastname, first initial), how does knowing only the identity of the journals (in which the articles were published) predict the relative likelihood that they are written by the same person vs. different persons? The article pair odds ratio detects the tendency of authors to publish repeatedly in the same journal, as well as in specific pairs of journals. The metrics can be applied not only to estimate the similarity of a pair of journals, but to provide novel profiles of individual journals as well. For example, for each journal, one can define the MeSH cloud as the number of other journals that are topically more similar to it than expected by chance, and the author cloud as the number of other journals that share more authors than expected by chance. These metrics for journal pairs and individual journals have been provided in the form of public datasets that can be readily studied and utilized by others. Public Library of Science 2014-12-23 /pmc/articles/PMC4275247/ /pubmed/25536326 http://dx.doi.org/10.1371/journal.pone.0115681 Text en © 2014 D′Souza, Smalheiser http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article D′Souza, Jennifer L. Smalheiser, Neil R. Three Journal Similarity Metrics and Their Application to Biomedical Journals |
title | Three Journal Similarity Metrics and Their Application to Biomedical Journals |
title_full | Three Journal Similarity Metrics and Their Application to Biomedical Journals |
title_fullStr | Three Journal Similarity Metrics and Their Application to Biomedical Journals |
title_full_unstemmed | Three Journal Similarity Metrics and Their Application to Biomedical Journals |
title_short | Three Journal Similarity Metrics and Their Application to Biomedical Journals |
title_sort | three journal similarity metrics and their application to biomedical journals |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4275247/ https://www.ncbi.nlm.nih.gov/pubmed/25536326 http://dx.doi.org/10.1371/journal.pone.0115681 |
work_keys_str_mv | AT dsouzajenniferl threejournalsimilaritymetricsandtheirapplicationtobiomedicaljournals AT smalheiserneilr threejournalsimilaritymetricsandtheirapplicationtobiomedicaljournals |