Cargando…

Hierarchical network analysis of co-occurring bioentities in literature

Biomedical databases grow by more than a thousand new publications every day. The large volume of biomedical literature that is being published at an unprecedented rate hinders the discovery of relevant knowledge from keywords of interest to gather new insights and form hypotheses. A text-mining too...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Heejung, Lee, Namgil, Park, Beomjun, Park, Jinyoung, Lee, Jiho, Jang, Hyeon Seok, Yoo, Hojin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9098521/
https://www.ncbi.nlm.nih.gov/pubmed/35550589
http://dx.doi.org/10.1038/s41598-022-12093-9
_version_ 1784706402489466880
author Yang, Heejung
Lee, Namgil
Park, Beomjun
Park, Jinyoung
Lee, Jiho
Jang, Hyeon Seok
Yoo, Hojin
author_facet Yang, Heejung
Lee, Namgil
Park, Beomjun
Park, Jinyoung
Lee, Jiho
Jang, Hyeon Seok
Yoo, Hojin
author_sort Yang, Heejung
collection PubMed
description Biomedical databases grow by more than a thousand new publications every day. The large volume of biomedical literature that is being published at an unprecedented rate hinders the discovery of relevant knowledge from keywords of interest to gather new insights and form hypotheses. A text-mining tool, PubTator, helps to automatically annotate bioentities, such as species, chemicals, genes, and diseases, from PubMed abstracts and full-text articles. However, the manual re-organization and analysis of bioentities is a non-trivial and highly time-consuming task. ChexMix was designed to extract the unique identifiers of bioentities from query results. Herein, ChexMix was used to construct a taxonomic tree with allied species among Korean native plants and to extract the medical subject headings unique identifier of the bioentities, which co-occurred with the keywords in the same literature. ChexMix discovered the allied species related to a keyword of interest and experimentally proved its usefulness for multi-species analysis.
format Online
Article
Text
id pubmed-9098521
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-90985212022-05-14 Hierarchical network analysis of co-occurring bioentities in literature Yang, Heejung Lee, Namgil Park, Beomjun Park, Jinyoung Lee, Jiho Jang, Hyeon Seok Yoo, Hojin Sci Rep Article Biomedical databases grow by more than a thousand new publications every day. The large volume of biomedical literature that is being published at an unprecedented rate hinders the discovery of relevant knowledge from keywords of interest to gather new insights and form hypotheses. A text-mining tool, PubTator, helps to automatically annotate bioentities, such as species, chemicals, genes, and diseases, from PubMed abstracts and full-text articles. However, the manual re-organization and analysis of bioentities is a non-trivial and highly time-consuming task. ChexMix was designed to extract the unique identifiers of bioentities from query results. Herein, ChexMix was used to construct a taxonomic tree with allied species among Korean native plants and to extract the medical subject headings unique identifier of the bioentities, which co-occurred with the keywords in the same literature. ChexMix discovered the allied species related to a keyword of interest and experimentally proved its usefulness for multi-species analysis. Nature Publishing Group UK 2022-05-12 /pmc/articles/PMC9098521/ /pubmed/35550589 http://dx.doi.org/10.1038/s41598-022-12093-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Yang, Heejung
Lee, Namgil
Park, Beomjun
Park, Jinyoung
Lee, Jiho
Jang, Hyeon Seok
Yoo, Hojin
Hierarchical network analysis of co-occurring bioentities in literature
title Hierarchical network analysis of co-occurring bioentities in literature
title_full Hierarchical network analysis of co-occurring bioentities in literature
title_fullStr Hierarchical network analysis of co-occurring bioentities in literature
title_full_unstemmed Hierarchical network analysis of co-occurring bioentities in literature
title_short Hierarchical network analysis of co-occurring bioentities in literature
title_sort hierarchical network analysis of co-occurring bioentities in literature
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9098521/
https://www.ncbi.nlm.nih.gov/pubmed/35550589
http://dx.doi.org/10.1038/s41598-022-12093-9
work_keys_str_mv AT yangheejung hierarchicalnetworkanalysisofcooccurringbioentitiesinliterature
AT leenamgil hierarchicalnetworkanalysisofcooccurringbioentitiesinliterature
AT parkbeomjun hierarchicalnetworkanalysisofcooccurringbioentitiesinliterature
AT parkjinyoung hierarchicalnetworkanalysisofcooccurringbioentitiesinliterature
AT leejiho hierarchicalnetworkanalysisofcooccurringbioentitiesinliterature
AT janghyeonseok hierarchicalnetworkanalysisofcooccurringbioentitiesinliterature
AT yoohojin hierarchicalnetworkanalysisofcooccurringbioentitiesinliterature