Cargando…

Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades

Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific...

Descripción completa

Detalles Bibliográficos
Autores principales: Lopes, Tiago JS, Rios, Ricardo, Nogueira, Tatiane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9511296/
https://www.ncbi.nlm.nih.gov/pubmed/36172338
http://dx.doi.org/10.1177/11779322221125604
_version_ 1784797623848271872
author Lopes, Tiago JS
Rios, Ricardo
Nogueira, Tatiane
author_facet Lopes, Tiago JS
Rios, Ricardo
Nogueira, Tatiane
author_sort Lopes, Tiago JS
collection PubMed
description Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific literature is increasing so rapidly, even the most avid reader will find it difficult to follow it closely. In this study, we used sophisticated computational techniques to map the hemophilia literature of the last 60 years. We created a network structure to represent authorship collaborations, where the nodes are the researchers and 2 nodes are connected if they co-authored a manuscript. We accurately identified author clusters, namely, researchers who have collaborated systematically for several years, and used text mining techniques to automatically synthesize their research specialties. Overall, this study serves as a historical appreciation of the effort of thousands of hemophilia researchers and demonstrates that a computational framework is able to automatically identify collaboration networks and their research specialties. Importantly, we made all datasets and source code available for the community, and we anticipate that the methods introduced here will pave the way for the development of systems that generate compelling hypothesis based on patterns that are imperceptible to human researchers.
format Online
Article
Text
id pubmed-9511296
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-95112962022-09-27 Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades Lopes, Tiago JS Rios, Ricardo Nogueira, Tatiane Bioinform Biol Insights Original Research Article Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific literature is increasing so rapidly, even the most avid reader will find it difficult to follow it closely. In this study, we used sophisticated computational techniques to map the hemophilia literature of the last 60 years. We created a network structure to represent authorship collaborations, where the nodes are the researchers and 2 nodes are connected if they co-authored a manuscript. We accurately identified author clusters, namely, researchers who have collaborated systematically for several years, and used text mining techniques to automatically synthesize their research specialties. Overall, this study serves as a historical appreciation of the effort of thousands of hemophilia researchers and demonstrates that a computational framework is able to automatically identify collaboration networks and their research specialties. Importantly, we made all datasets and source code available for the community, and we anticipate that the methods introduced here will pave the way for the development of systems that generate compelling hypothesis based on patterns that are imperceptible to human researchers. SAGE Publications 2022-09-22 /pmc/articles/PMC9511296/ /pubmed/36172338 http://dx.doi.org/10.1177/11779322221125604 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research Article
Lopes, Tiago JS
Rios, Ricardo
Nogueira, Tatiane
Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_full Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_fullStr Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_full_unstemmed Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_short Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_sort computational analyses reveal fundamental properties of the hemophilia literature in the last 6 decades
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9511296/
https://www.ncbi.nlm.nih.gov/pubmed/36172338
http://dx.doi.org/10.1177/11779322221125604
work_keys_str_mv AT lopestiagojs computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades
AT riosricardo computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades
AT nogueiratatiane computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades