Cargando…

Drug repurposing for COVID-19 via knowledge graph completion

OBJECTIVE: To discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods. METHODS: We propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other C...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Rui, Hristovski, Dimitar, Schutte, Dalton, Kastrin, Andrej, Fiszman, Marcelo, Kilicoglu, Halil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7869625/
https://www.ncbi.nlm.nih.gov/pubmed/33571675
http://dx.doi.org/10.1016/j.jbi.2021.103696
_version_ 1783648665637748736
author Zhang, Rui
Hristovski, Dimitar
Schutte, Dalton
Kastrin, Andrej
Fiszman, Marcelo
Kilicoglu, Halil
author_facet Zhang, Rui
Hristovski, Dimitar
Schutte, Dalton
Kastrin, Andrej
Fiszman, Marcelo
Kilicoglu, Halil
author_sort Zhang, Rui
collection PubMed
description OBJECTIVE: To discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods. METHODS: We propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other COVID-19-focused research literature. Our approach relies on semantic triples extracted using SemRep (via SemMedDB). We identified an informative and accurate subset of semantic triples using filtering rules and an accuracy classifier developed on a BERT variant. We used this subset to construct a knowledge graph, and applied five state-of-the-art, neural knowledge graph completion algorithms (i.e., TransE, RotatE, DistMult, ComplEx, and STELP) to predict drug repurposing candidates. The models were trained and assessed using a time slicing approach and the predicted drugs were compared with a list of drugs reported in the literature and evaluated in clinical trials. These models were complemented by a discovery pattern-based approach. RESULTS: Accuracy classifier based on PubMedBERT achieved the best performance (F(1) = 0.854) in identifying accurate semantic predications. Among five knowledge graph completion models, TransE outperformed others (MR = 0.923, Hits@1 = 0.417). Some known drugs linked to COVID-19 in the literature were identified, as well as others that have not yet been studied. Discovery patterns enabled identification of additional candidate drugs and generation of plausible hypotheses regarding the links between the candidate drugs and COVID-19. Among them, five highly ranked and novel drugs (i.e., paclitaxel, SB 203580, alpha 2-antiplasmin, metoclopramide, and oxymatrine) and the mechanistic explanations for their potential use are further discussed. CONCLUSION: We showed that a LBD approach can be feasible not only for discovering drug candidates for COVID-19, but also for generating mechanistic explanations. Our approach can be generalized to other diseases as well as to other clinical questions. Source code and data are available at https://github.com/kilicogluh/lbd-covid.
format Online
Article
Text
id pubmed-7869625
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier Inc.
record_format MEDLINE/PubMed
spelling pubmed-78696252021-02-09 Drug repurposing for COVID-19 via knowledge graph completion Zhang, Rui Hristovski, Dimitar Schutte, Dalton Kastrin, Andrej Fiszman, Marcelo Kilicoglu, Halil J Biomed Inform Original Research OBJECTIVE: To discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods. METHODS: We propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other COVID-19-focused research literature. Our approach relies on semantic triples extracted using SemRep (via SemMedDB). We identified an informative and accurate subset of semantic triples using filtering rules and an accuracy classifier developed on a BERT variant. We used this subset to construct a knowledge graph, and applied five state-of-the-art, neural knowledge graph completion algorithms (i.e., TransE, RotatE, DistMult, ComplEx, and STELP) to predict drug repurposing candidates. The models were trained and assessed using a time slicing approach and the predicted drugs were compared with a list of drugs reported in the literature and evaluated in clinical trials. These models were complemented by a discovery pattern-based approach. RESULTS: Accuracy classifier based on PubMedBERT achieved the best performance (F(1) = 0.854) in identifying accurate semantic predications. Among five knowledge graph completion models, TransE outperformed others (MR = 0.923, Hits@1 = 0.417). Some known drugs linked to COVID-19 in the literature were identified, as well as others that have not yet been studied. Discovery patterns enabled identification of additional candidate drugs and generation of plausible hypotheses regarding the links between the candidate drugs and COVID-19. Among them, five highly ranked and novel drugs (i.e., paclitaxel, SB 203580, alpha 2-antiplasmin, metoclopramide, and oxymatrine) and the mechanistic explanations for their potential use are further discussed. CONCLUSION: We showed that a LBD approach can be feasible not only for discovering drug candidates for COVID-19, but also for generating mechanistic explanations. Our approach can be generalized to other diseases as well as to other clinical questions. Source code and data are available at https://github.com/kilicogluh/lbd-covid. Elsevier Inc. 2021-03 2021-02-08 /pmc/articles/PMC7869625/ /pubmed/33571675 http://dx.doi.org/10.1016/j.jbi.2021.103696 Text en © 2021 Elsevier Inc. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Original Research
Zhang, Rui
Hristovski, Dimitar
Schutte, Dalton
Kastrin, Andrej
Fiszman, Marcelo
Kilicoglu, Halil
Drug repurposing for COVID-19 via knowledge graph completion
title Drug repurposing for COVID-19 via knowledge graph completion
title_full Drug repurposing for COVID-19 via knowledge graph completion
title_fullStr Drug repurposing for COVID-19 via knowledge graph completion
title_full_unstemmed Drug repurposing for COVID-19 via knowledge graph completion
title_short Drug repurposing for COVID-19 via knowledge graph completion
title_sort drug repurposing for covid-19 via knowledge graph completion
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7869625/
https://www.ncbi.nlm.nih.gov/pubmed/33571675
http://dx.doi.org/10.1016/j.jbi.2021.103696
work_keys_str_mv AT zhangrui drugrepurposingforcovid19viaknowledgegraphcompletion
AT hristovskidimitar drugrepurposingforcovid19viaknowledgegraphcompletion
AT schuttedalton drugrepurposingforcovid19viaknowledgegraphcompletion
AT kastrinandrej drugrepurposingforcovid19viaknowledgegraphcompletion
AT fiszmanmarcelo drugrepurposingforcovid19viaknowledgegraphcompletion
AT kilicogluhalil drugrepurposingforcovid19viaknowledgegraphcompletion