Cargando…

An automatic hypothesis generation for plausible linkage between xanthium and diabetes

There has been a significant increase in text mining implementation for biomedical literature in recent years. Previous studies introduced the implementation of text mining and literature-based discovery to generate hypotheses of potential candidates for drug development. By conducting a hypothesis-...

Descripción completa

Detalles Bibliográficos
Autores principales: Syafiandini, Arida Ferti, Song, Gyuri, Ahn, Yuri, Kim, Heeyoung, Song, Min
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585073/
https://www.ncbi.nlm.nih.gov/pubmed/36266295
http://dx.doi.org/10.1038/s41598-022-20752-0
_version_ 1784813417361571840
author Syafiandini, Arida Ferti
Song, Gyuri
Ahn, Yuri
Kim, Heeyoung
Song, Min
author_facet Syafiandini, Arida Ferti
Song, Gyuri
Ahn, Yuri
Kim, Heeyoung
Song, Min
author_sort Syafiandini, Arida Ferti
collection PubMed
description There has been a significant increase in text mining implementation for biomedical literature in recent years. Previous studies introduced the implementation of text mining and literature-based discovery to generate hypotheses of potential candidates for drug development. By conducting a hypothesis-generation step and using evidence from published journal articles or proceedings, previous studies have managed to reduce experimental time and costs. First, we applied the closed discovery approach from Swanson’s ABC model to collect publications related to 36 Xanthium compounds or diabetes. Second, we extracted biomedical entities and relations using a knowledge extraction engine, the Public Knowledge Discovery Engine for Java or PKDE4J. Third, we built a knowledge graph using the obtained bio entities and relations and then generated paths with Xanthium compounds as source nodes and diabetes as the target node. Lastly, we employed graph embeddings to rank each path and evaluated the results based on domain experts’ opinions and literature. Among 36 Xanthium compounds, 35 had direct paths to five diabetes-related nodes. We ranked 2,740,314 paths in total between 35 Xanthium compounds and three diabetes-related phrases: type 1 diabetes, type 2 diabetes, and diabetes mellitus. Based on the top five percentile paths, we concluded that adenosine, choline, beta-sitosterol, rhamnose, and scopoletin were potential candidates for diabetes drug development using natural products. Our framework for hypothesis generation employs a closed discovery from Swanson’s ABC model that has proven very helpful in discovering biological linkages between bio entities. The PKDE4J tools we used to capture bio entities from our document collection could label entities into five categories: genes, compounds, phenotypes, biological processes, and molecular functions. Using the BioPREP model, we managed to interpret the semantic relatedness between two nodes and provided paths containing valuable hypotheses. Lastly, using a graph-embedding algorithm in our path-ranking analysis, we exploited the semantic relatedness while preserving the graph structure properties.
format Online
Article
Text
id pubmed-9585073
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-95850732022-10-22 An automatic hypothesis generation for plausible linkage between xanthium and diabetes Syafiandini, Arida Ferti Song, Gyuri Ahn, Yuri Kim, Heeyoung Song, Min Sci Rep Article There has been a significant increase in text mining implementation for biomedical literature in recent years. Previous studies introduced the implementation of text mining and literature-based discovery to generate hypotheses of potential candidates for drug development. By conducting a hypothesis-generation step and using evidence from published journal articles or proceedings, previous studies have managed to reduce experimental time and costs. First, we applied the closed discovery approach from Swanson’s ABC model to collect publications related to 36 Xanthium compounds or diabetes. Second, we extracted biomedical entities and relations using a knowledge extraction engine, the Public Knowledge Discovery Engine for Java or PKDE4J. Third, we built a knowledge graph using the obtained bio entities and relations and then generated paths with Xanthium compounds as source nodes and diabetes as the target node. Lastly, we employed graph embeddings to rank each path and evaluated the results based on domain experts’ opinions and literature. Among 36 Xanthium compounds, 35 had direct paths to five diabetes-related nodes. We ranked 2,740,314 paths in total between 35 Xanthium compounds and three diabetes-related phrases: type 1 diabetes, type 2 diabetes, and diabetes mellitus. Based on the top five percentile paths, we concluded that adenosine, choline, beta-sitosterol, rhamnose, and scopoletin were potential candidates for diabetes drug development using natural products. Our framework for hypothesis generation employs a closed discovery from Swanson’s ABC model that has proven very helpful in discovering biological linkages between bio entities. The PKDE4J tools we used to capture bio entities from our document collection could label entities into five categories: genes, compounds, phenotypes, biological processes, and molecular functions. Using the BioPREP model, we managed to interpret the semantic relatedness between two nodes and provided paths containing valuable hypotheses. Lastly, using a graph-embedding algorithm in our path-ranking analysis, we exploited the semantic relatedness while preserving the graph structure properties. Nature Publishing Group UK 2022-10-20 /pmc/articles/PMC9585073/ /pubmed/36266295 http://dx.doi.org/10.1038/s41598-022-20752-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Syafiandini, Arida Ferti
Song, Gyuri
Ahn, Yuri
Kim, Heeyoung
Song, Min
An automatic hypothesis generation for plausible linkage between xanthium and diabetes
title An automatic hypothesis generation for plausible linkage between xanthium and diabetes
title_full An automatic hypothesis generation for plausible linkage between xanthium and diabetes
title_fullStr An automatic hypothesis generation for plausible linkage between xanthium and diabetes
title_full_unstemmed An automatic hypothesis generation for plausible linkage between xanthium and diabetes
title_short An automatic hypothesis generation for plausible linkage between xanthium and diabetes
title_sort automatic hypothesis generation for plausible linkage between xanthium and diabetes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585073/
https://www.ncbi.nlm.nih.gov/pubmed/36266295
http://dx.doi.org/10.1038/s41598-022-20752-0
work_keys_str_mv AT syafiandiniaridaferti anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT songgyuri anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT ahnyuri anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT kimheeyoung anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT songmin anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT syafiandiniaridaferti automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT songgyuri automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT ahnyuri automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT kimheeyoung automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes
AT songmin automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes