Cargando…
Drug repurposing for COVID-19 via knowledge graph completion
OBJECTIVE: To discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods. METHODS: We propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other C...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7869625/ https://www.ncbi.nlm.nih.gov/pubmed/33571675 http://dx.doi.org/10.1016/j.jbi.2021.103696 |
_version_ | 1783648665637748736 |
---|---|
author | Zhang, Rui Hristovski, Dimitar Schutte, Dalton Kastrin, Andrej Fiszman, Marcelo Kilicoglu, Halil |
author_facet | Zhang, Rui Hristovski, Dimitar Schutte, Dalton Kastrin, Andrej Fiszman, Marcelo Kilicoglu, Halil |
author_sort | Zhang, Rui |
collection | PubMed |
description | OBJECTIVE: To discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods. METHODS: We propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other COVID-19-focused research literature. Our approach relies on semantic triples extracted using SemRep (via SemMedDB). We identified an informative and accurate subset of semantic triples using filtering rules and an accuracy classifier developed on a BERT variant. We used this subset to construct a knowledge graph, and applied five state-of-the-art, neural knowledge graph completion algorithms (i.e., TransE, RotatE, DistMult, ComplEx, and STELP) to predict drug repurposing candidates. The models were trained and assessed using a time slicing approach and the predicted drugs were compared with a list of drugs reported in the literature and evaluated in clinical trials. These models were complemented by a discovery pattern-based approach. RESULTS: Accuracy classifier based on PubMedBERT achieved the best performance (F(1) = 0.854) in identifying accurate semantic predications. Among five knowledge graph completion models, TransE outperformed others (MR = 0.923, Hits@1 = 0.417). Some known drugs linked to COVID-19 in the literature were identified, as well as others that have not yet been studied. Discovery patterns enabled identification of additional candidate drugs and generation of plausible hypotheses regarding the links between the candidate drugs and COVID-19. Among them, five highly ranked and novel drugs (i.e., paclitaxel, SB 203580, alpha 2-antiplasmin, metoclopramide, and oxymatrine) and the mechanistic explanations for their potential use are further discussed. CONCLUSION: We showed that a LBD approach can be feasible not only for discovering drug candidates for COVID-19, but also for generating mechanistic explanations. Our approach can be generalized to other diseases as well as to other clinical questions. Source code and data are available at https://github.com/kilicogluh/lbd-covid. |
format | Online Article Text |
id | pubmed-7869625 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-78696252021-02-09 Drug repurposing for COVID-19 via knowledge graph completion Zhang, Rui Hristovski, Dimitar Schutte, Dalton Kastrin, Andrej Fiszman, Marcelo Kilicoglu, Halil J Biomed Inform Original Research OBJECTIVE: To discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods. METHODS: We propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other COVID-19-focused research literature. Our approach relies on semantic triples extracted using SemRep (via SemMedDB). We identified an informative and accurate subset of semantic triples using filtering rules and an accuracy classifier developed on a BERT variant. We used this subset to construct a knowledge graph, and applied five state-of-the-art, neural knowledge graph completion algorithms (i.e., TransE, RotatE, DistMult, ComplEx, and STELP) to predict drug repurposing candidates. The models were trained and assessed using a time slicing approach and the predicted drugs were compared with a list of drugs reported in the literature and evaluated in clinical trials. These models were complemented by a discovery pattern-based approach. RESULTS: Accuracy classifier based on PubMedBERT achieved the best performance (F(1) = 0.854) in identifying accurate semantic predications. Among five knowledge graph completion models, TransE outperformed others (MR = 0.923, Hits@1 = 0.417). Some known drugs linked to COVID-19 in the literature were identified, as well as others that have not yet been studied. Discovery patterns enabled identification of additional candidate drugs and generation of plausible hypotheses regarding the links between the candidate drugs and COVID-19. Among them, five highly ranked and novel drugs (i.e., paclitaxel, SB 203580, alpha 2-antiplasmin, metoclopramide, and oxymatrine) and the mechanistic explanations for their potential use are further discussed. CONCLUSION: We showed that a LBD approach can be feasible not only for discovering drug candidates for COVID-19, but also for generating mechanistic explanations. Our approach can be generalized to other diseases as well as to other clinical questions. Source code and data are available at https://github.com/kilicogluh/lbd-covid. Elsevier Inc. 2021-03 2021-02-08 /pmc/articles/PMC7869625/ /pubmed/33571675 http://dx.doi.org/10.1016/j.jbi.2021.103696 Text en © 2021 Elsevier Inc. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Original Research Zhang, Rui Hristovski, Dimitar Schutte, Dalton Kastrin, Andrej Fiszman, Marcelo Kilicoglu, Halil Drug repurposing for COVID-19 via knowledge graph completion |
title | Drug repurposing for COVID-19 via knowledge graph completion |
title_full | Drug repurposing for COVID-19 via knowledge graph completion |
title_fullStr | Drug repurposing for COVID-19 via knowledge graph completion |
title_full_unstemmed | Drug repurposing for COVID-19 via knowledge graph completion |
title_short | Drug repurposing for COVID-19 via knowledge graph completion |
title_sort | drug repurposing for covid-19 via knowledge graph completion |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7869625/ https://www.ncbi.nlm.nih.gov/pubmed/33571675 http://dx.doi.org/10.1016/j.jbi.2021.103696 |
work_keys_str_mv | AT zhangrui drugrepurposingforcovid19viaknowledgegraphcompletion AT hristovskidimitar drugrepurposingforcovid19viaknowledgegraphcompletion AT schuttedalton drugrepurposingforcovid19viaknowledgegraphcompletion AT kastrinandrej drugrepurposingforcovid19viaknowledgegraphcompletion AT fiszmanmarcelo drugrepurposingforcovid19viaknowledgegraphcompletion AT kilicogluhalil drugrepurposingforcovid19viaknowledgegraphcompletion |