Cargando…
A systematic metadata harvesting workflow for analysing scientific networks
One of the disciplines behind the science of science is the study of scientific networks. This work focuses on scientific networks as a social network having different nodes and connections. Nodes can be represented by authors, articles or journals while connections by citation, co-citation or co-au...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959659/ https://www.ncbi.nlm.nih.gov/pubmed/33817056 http://dx.doi.org/10.7717/peerj-cs.421 |
_version_ | 1783664997069488128 |
---|---|
author | Butt, Bilal H. Rafi, Muhammad Sabih, Muhammad |
author_facet | Butt, Bilal H. Rafi, Muhammad Sabih, Muhammad |
author_sort | Butt, Bilal H. |
collection | PubMed |
description | One of the disciplines behind the science of science is the study of scientific networks. This work focuses on scientific networks as a social network having different nodes and connections. Nodes can be represented by authors, articles or journals while connections by citation, co-citation or co-authorship. One of the challenges in creating scientific networks is the lack of publicly available comprehensive data set. It limits the variety of analyses on the same set of nodes of different scientific networks. To supplement such analyses we have worked on publicly available citation metadata from Crossref and OpenCitatons. Using this data a workflow is developed to create scientific networks. Analysis of these networks gives insights into academic research and scholarship. Different techniques of social network analysis have been applied in the literature to study these networks. It includes centrality analysis, community detection, and clustering coefficient. We have used metadata of Scientometrics journal, as a case study, to present our workflow. We did a sample run of the proposed workflow to identify prominent authors using centrality analysis. This work is not a bibliometric study of any field rather it presents replicable Python scripts to perform network analysis. With an increase in the popularity of open access and open metadata, we hypothesise that this workflow shall provide an avenue for understanding scientific scholarship in multiple dimensions. |
format | Online Article Text |
id | pubmed-7959659 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79596592021-04-02 A systematic metadata harvesting workflow for analysing scientific networks Butt, Bilal H. Rafi, Muhammad Sabih, Muhammad PeerJ Comput Sci Digital Libraries One of the disciplines behind the science of science is the study of scientific networks. This work focuses on scientific networks as a social network having different nodes and connections. Nodes can be represented by authors, articles or journals while connections by citation, co-citation or co-authorship. One of the challenges in creating scientific networks is the lack of publicly available comprehensive data set. It limits the variety of analyses on the same set of nodes of different scientific networks. To supplement such analyses we have worked on publicly available citation metadata from Crossref and OpenCitatons. Using this data a workflow is developed to create scientific networks. Analysis of these networks gives insights into academic research and scholarship. Different techniques of social network analysis have been applied in the literature to study these networks. It includes centrality analysis, community detection, and clustering coefficient. We have used metadata of Scientometrics journal, as a case study, to present our workflow. We did a sample run of the proposed workflow to identify prominent authors using centrality analysis. This work is not a bibliometric study of any field rather it presents replicable Python scripts to perform network analysis. With an increase in the popularity of open access and open metadata, we hypothesise that this workflow shall provide an avenue for understanding scientific scholarship in multiple dimensions. PeerJ Inc. 2021-03-10 /pmc/articles/PMC7959659/ /pubmed/33817056 http://dx.doi.org/10.7717/peerj-cs.421 Text en © 2021 Butt et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Digital Libraries Butt, Bilal H. Rafi, Muhammad Sabih, Muhammad A systematic metadata harvesting workflow for analysing scientific networks |
title | A systematic metadata harvesting workflow for analysing scientific networks |
title_full | A systematic metadata harvesting workflow for analysing scientific networks |
title_fullStr | A systematic metadata harvesting workflow for analysing scientific networks |
title_full_unstemmed | A systematic metadata harvesting workflow for analysing scientific networks |
title_short | A systematic metadata harvesting workflow for analysing scientific networks |
title_sort | systematic metadata harvesting workflow for analysing scientific networks |
topic | Digital Libraries |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959659/ https://www.ncbi.nlm.nih.gov/pubmed/33817056 http://dx.doi.org/10.7717/peerj-cs.421 |
work_keys_str_mv | AT buttbilalh asystematicmetadataharvestingworkflowforanalysingscientificnetworks AT rafimuhammad asystematicmetadataharvestingworkflowforanalysingscientificnetworks AT sabihmuhammad asystematicmetadataharvestingworkflowforanalysingscientificnetworks AT buttbilalh systematicmetadataharvestingworkflowforanalysingscientificnetworks AT rafimuhammad systematicmetadataharvestingworkflowforanalysingscientificnetworks AT sabihmuhammad systematicmetadataharvestingworkflowforanalysingscientificnetworks |