Cargando…

Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing

BACKGROUND: To date, there are no effective treatments for most neurodegenerative diseases. Knowledge graphs can provide comprehensive and semantic representation for heterogeneous data, and have been successfully leveraged in many biomedical applications including drug repurposing. Our objective is...

Descripción completa

Detalles Bibliográficos
Autores principales: Nian, Yi, Hu, Xinyue, Zhang, Rui, Feng, Jingna, Du, Jingcheng, Li, Fang, Bu, Larry, Zhang, Yuji, Chen, Yong, Tao, Cui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523633/
https://www.ncbi.nlm.nih.gov/pubmed/36180861
http://dx.doi.org/10.1186/s12859-022-04934-1
_version_ 1784800330868850688
author Nian, Yi
Hu, Xinyue
Zhang, Rui
Feng, Jingna
Du, Jingcheng
Li, Fang
Bu, Larry
Zhang, Yuji
Chen, Yong
Tao, Cui
author_facet Nian, Yi
Hu, Xinyue
Zhang, Rui
Feng, Jingna
Du, Jingcheng
Li, Fang
Bu, Larry
Zhang, Yuji
Chen, Yong
Tao, Cui
author_sort Nian, Yi
collection PubMed
description BACKGROUND: To date, there are no effective treatments for most neurodegenerative diseases. Knowledge graphs can provide comprehensive and semantic representation for heterogeneous data, and have been successfully leveraged in many biomedical applications including drug repurposing. Our objective is to construct a knowledge graph from literature to study the relations between Alzheimer’s disease (AD) and chemicals, drugs and dietary supplements in order to identify opportunities to prevent or delay neurodegenerative progression. We collected biomedical annotations and extracted their relations using SemRep via SemMedDB. We used both a BERT-based classifier and rule-based methods during data preprocessing to exclude noise while preserving most AD-related semantic triples. The 1,672,110 filtered triples were used to train with knowledge graph completion algorithms (i.e., TransE, DistMult, and ComplEx) to predict candidates that might be helpful for AD treatment or prevention. RESULTS: Among three knowledge graph completion models, TransE outperformed the other two (MR = 10.53, Hits@1 = 0.28). We leveraged the time-slicing technique to further evaluate the prediction results. We found supporting evidence for most highly ranked candidates predicted by our model which indicates that our approach can inform reliable new knowledge. CONCLUSION: This paper shows that our graph mining model can predict reliable new relationships between AD and other entities (i.e., dietary supplements, chemicals, and drugs). The knowledge graph constructed can facilitate data-driven knowledge discoveries and the generation of novel hypotheses. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12859-022-04934-1).
format Online
Article
Text
id pubmed-9523633
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-95236332022-09-30 Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing Nian, Yi Hu, Xinyue Zhang, Rui Feng, Jingna Du, Jingcheng Li, Fang Bu, Larry Zhang, Yuji Chen, Yong Tao, Cui BMC Bioinformatics Research BACKGROUND: To date, there are no effective treatments for most neurodegenerative diseases. Knowledge graphs can provide comprehensive and semantic representation for heterogeneous data, and have been successfully leveraged in many biomedical applications including drug repurposing. Our objective is to construct a knowledge graph from literature to study the relations between Alzheimer’s disease (AD) and chemicals, drugs and dietary supplements in order to identify opportunities to prevent or delay neurodegenerative progression. We collected biomedical annotations and extracted their relations using SemRep via SemMedDB. We used both a BERT-based classifier and rule-based methods during data preprocessing to exclude noise while preserving most AD-related semantic triples. The 1,672,110 filtered triples were used to train with knowledge graph completion algorithms (i.e., TransE, DistMult, and ComplEx) to predict candidates that might be helpful for AD treatment or prevention. RESULTS: Among three knowledge graph completion models, TransE outperformed the other two (MR = 10.53, Hits@1 = 0.28). We leveraged the time-slicing technique to further evaluate the prediction results. We found supporting evidence for most highly ranked candidates predicted by our model which indicates that our approach can inform reliable new knowledge. CONCLUSION: This paper shows that our graph mining model can predict reliable new relationships between AD and other entities (i.e., dietary supplements, chemicals, and drugs). The knowledge graph constructed can facilitate data-driven knowledge discoveries and the generation of novel hypotheses. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12859-022-04934-1). BioMed Central 2022-09-30 /pmc/articles/PMC9523633/ /pubmed/36180861 http://dx.doi.org/10.1186/s12859-022-04934-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Nian, Yi
Hu, Xinyue
Zhang, Rui
Feng, Jingna
Du, Jingcheng
Li, Fang
Bu, Larry
Zhang, Yuji
Chen, Yong
Tao, Cui
Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing
title Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing
title_full Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing
title_fullStr Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing
title_full_unstemmed Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing
title_short Mining on Alzheimer’s diseases related knowledge graph to identity potential AD-related semantic triples for drug repurposing
title_sort mining on alzheimer’s diseases related knowledge graph to identity potential ad-related semantic triples for drug repurposing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523633/
https://www.ncbi.nlm.nih.gov/pubmed/36180861
http://dx.doi.org/10.1186/s12859-022-04934-1
work_keys_str_mv AT nianyi miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT huxinyue miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT zhangrui miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT fengjingna miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT dujingcheng miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT lifang miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT bularry miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT zhangyuji miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT chenyong miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing
AT taocui miningonalzheimersdiseasesrelatedknowledgegraphtoidentitypotentialadrelatedsemantictriplesfordrugrepurposing