Cargando…
An automatic hypothesis generation for plausible linkage between xanthium and diabetes
There has been a significant increase in text mining implementation for biomedical literature in recent years. Previous studies introduced the implementation of text mining and literature-based discovery to generate hypotheses of potential candidates for drug development. By conducting a hypothesis-...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585073/ https://www.ncbi.nlm.nih.gov/pubmed/36266295 http://dx.doi.org/10.1038/s41598-022-20752-0 |
_version_ | 1784813417361571840 |
---|---|
author | Syafiandini, Arida Ferti Song, Gyuri Ahn, Yuri Kim, Heeyoung Song, Min |
author_facet | Syafiandini, Arida Ferti Song, Gyuri Ahn, Yuri Kim, Heeyoung Song, Min |
author_sort | Syafiandini, Arida Ferti |
collection | PubMed |
description | There has been a significant increase in text mining implementation for biomedical literature in recent years. Previous studies introduced the implementation of text mining and literature-based discovery to generate hypotheses of potential candidates for drug development. By conducting a hypothesis-generation step and using evidence from published journal articles or proceedings, previous studies have managed to reduce experimental time and costs. First, we applied the closed discovery approach from Swanson’s ABC model to collect publications related to 36 Xanthium compounds or diabetes. Second, we extracted biomedical entities and relations using a knowledge extraction engine, the Public Knowledge Discovery Engine for Java or PKDE4J. Third, we built a knowledge graph using the obtained bio entities and relations and then generated paths with Xanthium compounds as source nodes and diabetes as the target node. Lastly, we employed graph embeddings to rank each path and evaluated the results based on domain experts’ opinions and literature. Among 36 Xanthium compounds, 35 had direct paths to five diabetes-related nodes. We ranked 2,740,314 paths in total between 35 Xanthium compounds and three diabetes-related phrases: type 1 diabetes, type 2 diabetes, and diabetes mellitus. Based on the top five percentile paths, we concluded that adenosine, choline, beta-sitosterol, rhamnose, and scopoletin were potential candidates for diabetes drug development using natural products. Our framework for hypothesis generation employs a closed discovery from Swanson’s ABC model that has proven very helpful in discovering biological linkages between bio entities. The PKDE4J tools we used to capture bio entities from our document collection could label entities into five categories: genes, compounds, phenotypes, biological processes, and molecular functions. Using the BioPREP model, we managed to interpret the semantic relatedness between two nodes and provided paths containing valuable hypotheses. Lastly, using a graph-embedding algorithm in our path-ranking analysis, we exploited the semantic relatedness while preserving the graph structure properties. |
format | Online Article Text |
id | pubmed-9585073 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-95850732022-10-22 An automatic hypothesis generation for plausible linkage between xanthium and diabetes Syafiandini, Arida Ferti Song, Gyuri Ahn, Yuri Kim, Heeyoung Song, Min Sci Rep Article There has been a significant increase in text mining implementation for biomedical literature in recent years. Previous studies introduced the implementation of text mining and literature-based discovery to generate hypotheses of potential candidates for drug development. By conducting a hypothesis-generation step and using evidence from published journal articles or proceedings, previous studies have managed to reduce experimental time and costs. First, we applied the closed discovery approach from Swanson’s ABC model to collect publications related to 36 Xanthium compounds or diabetes. Second, we extracted biomedical entities and relations using a knowledge extraction engine, the Public Knowledge Discovery Engine for Java or PKDE4J. Third, we built a knowledge graph using the obtained bio entities and relations and then generated paths with Xanthium compounds as source nodes and diabetes as the target node. Lastly, we employed graph embeddings to rank each path and evaluated the results based on domain experts’ opinions and literature. Among 36 Xanthium compounds, 35 had direct paths to five diabetes-related nodes. We ranked 2,740,314 paths in total between 35 Xanthium compounds and three diabetes-related phrases: type 1 diabetes, type 2 diabetes, and diabetes mellitus. Based on the top five percentile paths, we concluded that adenosine, choline, beta-sitosterol, rhamnose, and scopoletin were potential candidates for diabetes drug development using natural products. Our framework for hypothesis generation employs a closed discovery from Swanson’s ABC model that has proven very helpful in discovering biological linkages between bio entities. The PKDE4J tools we used to capture bio entities from our document collection could label entities into five categories: genes, compounds, phenotypes, biological processes, and molecular functions. Using the BioPREP model, we managed to interpret the semantic relatedness between two nodes and provided paths containing valuable hypotheses. Lastly, using a graph-embedding algorithm in our path-ranking analysis, we exploited the semantic relatedness while preserving the graph structure properties. Nature Publishing Group UK 2022-10-20 /pmc/articles/PMC9585073/ /pubmed/36266295 http://dx.doi.org/10.1038/s41598-022-20752-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Syafiandini, Arida Ferti Song, Gyuri Ahn, Yuri Kim, Heeyoung Song, Min An automatic hypothesis generation for plausible linkage between xanthium and diabetes |
title | An automatic hypothesis generation for plausible linkage between xanthium and diabetes |
title_full | An automatic hypothesis generation for plausible linkage between xanthium and diabetes |
title_fullStr | An automatic hypothesis generation for plausible linkage between xanthium and diabetes |
title_full_unstemmed | An automatic hypothesis generation for plausible linkage between xanthium and diabetes |
title_short | An automatic hypothesis generation for plausible linkage between xanthium and diabetes |
title_sort | automatic hypothesis generation for plausible linkage between xanthium and diabetes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585073/ https://www.ncbi.nlm.nih.gov/pubmed/36266295 http://dx.doi.org/10.1038/s41598-022-20752-0 |
work_keys_str_mv | AT syafiandiniaridaferti anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT songgyuri anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT ahnyuri anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT kimheeyoung anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT songmin anautomatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT syafiandiniaridaferti automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT songgyuri automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT ahnyuri automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT kimheeyoung automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes AT songmin automatichypothesisgenerationforplausiblelinkagebetweenxanthiumanddiabetes |