Cargando…

Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches

The aim of this paper is to introduce and assess three algorithms for the identification of overlapping thematic structures in networks of papers. We implemented three recently proposed approaches to the identification of overlapping and hierarchical substructures in graphs and applied the correspon...

Descripción completa

Detalles Bibliográficos
Autores principales: Havemann, Frank, Gläser, Jochen, Heinz, Michael, Struck, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3314014/
https://www.ncbi.nlm.nih.gov/pubmed/22479376
http://dx.doi.org/10.1371/journal.pone.0033255
_version_ 1782228073720053760
author Havemann, Frank
Gläser, Jochen
Heinz, Michael
Struck, Alexander
author_facet Havemann, Frank
Gläser, Jochen
Heinz, Michael
Struck, Alexander
author_sort Havemann, Frank
collection PubMed
description The aim of this paper is to introduce and assess three algorithms for the identification of overlapping thematic structures in networks of papers. We implemented three recently proposed approaches to the identification of overlapping and hierarchical substructures in graphs and applied the corresponding algorithms to a network of 492 information-science papers coupled via their cited sources. The thematic substructures obtained and overlaps produced by the three hierarchical cluster algorithms were compared to a content-based categorisation, which we based on the interpretation of titles, abstracts, and keywords. We defined sets of papers dealing with three topics located on different levels of aggregation: h-index, webometrics, and bibliometrics. We identified these topics with branches in the dendrograms produced by the three cluster algorithms and compared the overlapping topics they detected with one another and with the three predefined paper sets. We discuss the advantages and drawbacks of applying the three approaches to paper networks in research fields.
format Online
Article
Text
id pubmed-3314014
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33140142012-04-04 Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches Havemann, Frank Gläser, Jochen Heinz, Michael Struck, Alexander PLoS One Research Article The aim of this paper is to introduce and assess three algorithms for the identification of overlapping thematic structures in networks of papers. We implemented three recently proposed approaches to the identification of overlapping and hierarchical substructures in graphs and applied the corresponding algorithms to a network of 492 information-science papers coupled via their cited sources. The thematic substructures obtained and overlaps produced by the three hierarchical cluster algorithms were compared to a content-based categorisation, which we based on the interpretation of titles, abstracts, and keywords. We defined sets of papers dealing with three topics located on different levels of aggregation: h-index, webometrics, and bibliometrics. We identified these topics with branches in the dendrograms produced by the three cluster algorithms and compared the overlapping topics they detected with one another and with the three predefined paper sets. We discuss the advantages and drawbacks of applying the three approaches to paper networks in research fields. Public Library of Science 2012-03-27 /pmc/articles/PMC3314014/ /pubmed/22479376 http://dx.doi.org/10.1371/journal.pone.0033255 Text en Havemann et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Havemann, Frank
Gläser, Jochen
Heinz, Michael
Struck, Alexander
Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches
title Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches
title_full Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches
title_fullStr Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches
title_full_unstemmed Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches
title_short Identifying Overlapping and Hierarchical Thematic Structures in Networks of Scholarly Papers: A Comparison of Three Approaches
title_sort identifying overlapping and hierarchical thematic structures in networks of scholarly papers: a comparison of three approaches
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3314014/
https://www.ncbi.nlm.nih.gov/pubmed/22479376
http://dx.doi.org/10.1371/journal.pone.0033255
work_keys_str_mv AT havemannfrank identifyingoverlappingandhierarchicalthematicstructuresinnetworksofscholarlypapersacomparisonofthreeapproaches
AT glaserjochen identifyingoverlappingandhierarchicalthematicstructuresinnetworksofscholarlypapersacomparisonofthreeapproaches
AT heinzmichael identifyingoverlappingandhierarchicalthematicstructuresinnetworksofscholarlypapersacomparisonofthreeapproaches
AT struckalexander identifyingoverlappingandhierarchicalthematicstructuresinnetworksofscholarlypapersacomparisonofthreeapproaches