Cargando…
Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference
This paper presents a contribution to the study of bibliographic corpora through science mapping. From a graph representation of documents and their textual dimension, stochastic block models can provide a simultaneous clustering of documents and words that we call a domain‐topic model. Previous wor...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9299004/ https://www.ncbi.nlm.nih.gov/pubmed/35873705 http://dx.doi.org/10.1002/asi.24606 |
_version_ | 1784750844713893888 |
---|---|
author | Hannud Abdo, Alexandre Cointet, Jean‐Philippe Bourret, Pascale Cambrosio, Alberto |
author_facet | Hannud Abdo, Alexandre Cointet, Jean‐Philippe Bourret, Pascale Cambrosio, Alberto |
author_sort | Hannud Abdo, Alexandre |
collection | PubMed |
description | This paper presents a contribution to the study of bibliographic corpora through science mapping. From a graph representation of documents and their textual dimension, stochastic block models can provide a simultaneous clustering of documents and words that we call a domain‐topic model. Previous work investigated the resulting topics, or word clusters, while ours focuses on the study of the document clusters we call domains. To enable the description and interactive navigation of domains, we introduce measures and interfaces that consider the structure of the model to relate both types of clusters. We then present a procedure that extends the block model to cluster metadata attributes of documents, which we call a domain‐chained model, noting that our measures and interfaces transpose to metadata clusters. We provide an example application to a corpus relevant to current science, technology and society (STS) research and an interesting case for our approach: the abstracts presented between 1995 and 2017 at the American Society of Clinical Oncology Annual Meeting, the major oncology research conference. Through a sequence of domain‐topic and domain‐chained models, we identify and describe a group of domains that have notably grown through the last decades and which we relate to the establishment of “oncopolicy” as a major concern in oncology. |
format | Online Article Text |
id | pubmed-9299004 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92990042022-07-21 Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference Hannud Abdo, Alexandre Cointet, Jean‐Philippe Bourret, Pascale Cambrosio, Alberto J Assoc Inf Sci Technol Research Articles This paper presents a contribution to the study of bibliographic corpora through science mapping. From a graph representation of documents and their textual dimension, stochastic block models can provide a simultaneous clustering of documents and words that we call a domain‐topic model. Previous work investigated the resulting topics, or word clusters, while ours focuses on the study of the document clusters we call domains. To enable the description and interactive navigation of domains, we introduce measures and interfaces that consider the structure of the model to relate both types of clusters. We then present a procedure that extends the block model to cluster metadata attributes of documents, which we call a domain‐chained model, noting that our measures and interfaces transpose to metadata clusters. We provide an example application to a corpus relevant to current science, technology and society (STS) research and an interesting case for our approach: the abstracts presented between 1995 and 2017 at the American Society of Clinical Oncology Annual Meeting, the major oncology research conference. Through a sequence of domain‐topic and domain‐chained models, we identify and describe a group of domains that have notably grown through the last decades and which we relate to the establishment of “oncopolicy” as a major concern in oncology. John Wiley & Sons, Inc. 2021-11-24 2022-07 /pmc/articles/PMC9299004/ /pubmed/35873705 http://dx.doi.org/10.1002/asi.24606 Text en © 2021 The Authors. Journal of the Association for Information Science and Technology published by Wiley Periodicals LLC on behalf of Association for Information Science and Technology. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Hannud Abdo, Alexandre Cointet, Jean‐Philippe Bourret, Pascale Cambrosio, Alberto Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference |
title | Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference |
title_full | Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference |
title_fullStr | Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference |
title_full_unstemmed | Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference |
title_short | Domain‐topic models with chained dimensions: Charting an emergent domain of a major oncology conference |
title_sort | domain‐topic models with chained dimensions: charting an emergent domain of a major oncology conference |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9299004/ https://www.ncbi.nlm.nih.gov/pubmed/35873705 http://dx.doi.org/10.1002/asi.24606 |
work_keys_str_mv | AT hannudabdoalexandre domaintopicmodelswithchaineddimensionschartinganemergentdomainofamajoroncologyconference AT cointetjeanphilippe domaintopicmodelswithchaineddimensionschartinganemergentdomainofamajoroncologyconference AT bourretpascale domaintopicmodelswithchaineddimensionschartinganemergentdomainofamajoroncologyconference AT cambrosioalberto domaintopicmodelswithchaineddimensionschartinganemergentdomainofamajoroncologyconference |