Cargando…

COVID-19 Knowledge Extractor (COKE): a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19

OBJECTIVE: The COVID-19 pandemic has catalyzed a widespread effort to identify drug candidates and biological targets of relevance to SARS-COV-2 infection, which resulted in large numbers of publications on this subject. We have built the COVID-19 Knowledge Extractor (COKE), a web application to ext...

Descripción completa

Detalles Bibliográficos
Autores principales: Korn, Daniel, Pervitsky, Vera, Bobrowski, Tesia, Alves, Vinicius M., Schmitt, Charles, Bizon, Chris, Baker, Nancy, Chirkova, Rada, Cherkasov, Artem, Muratov, Eugene, Tropsha, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: ChemRxiv 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7709174/
https://www.ncbi.nlm.nih.gov/pubmed/33269341
http://dx.doi.org/10.26434/chemrxiv.13289222
_version_ 1783617696037863424
author Korn, Daniel
Pervitsky, Vera
Bobrowski, Tesia
Alves, Vinicius M.
Schmitt, Charles
Bizon, Chris
Baker, Nancy
Chirkova, Rada
Cherkasov, Artem
Muratov, Eugene
Tropsha, Alexander
author_facet Korn, Daniel
Pervitsky, Vera
Bobrowski, Tesia
Alves, Vinicius M.
Schmitt, Charles
Bizon, Chris
Baker, Nancy
Chirkova, Rada
Cherkasov, Artem
Muratov, Eugene
Tropsha, Alexander
author_sort Korn, Daniel
collection PubMed
description OBJECTIVE: The COVID-19 pandemic has catalyzed a widespread effort to identify drug candidates and biological targets of relevance to SARS-COV-2 infection, which resulted in large numbers of publications on this subject. We have built the COVID-19 Knowledge Extractor (COKE), a web application to extract, curate, and annotate essential drug-target relationships from the research literature on COVID-19 to assist drug repurposing efforts. MATERIALS AND METHODS: SciBiteAI ontological tagging of the COVID Open Research Dataset (CORD-19), a repository of COVID-19 scientific publications, was employed to identify drug-target relationships. Entity identifiers were resolved through lookup routines using UniProt and DrugBank. A custom algorithm was used to identify co-occurrences of protein and drug terms, and confidence scores were calculated for each entity pair. RESULTS: COKE processing of the current CORD-19 database identified about 3,000 drug-protein pairs, including 29 unique proteins and 500 investigational, experimental, and approved drugs. Some of these drugs are presently undergoing clinical trials for COVID-19. DISCUSSION: The rapidly evolving situation concerning the COVID-19 pandemic has resulted in a dramatic growth of publications on this subject in a short period. These circumstances call for methods that can condense the literature into the key concepts and relationships necessary for insights into SARS-CoV-2 drug repurposing. CONCLUSION: The COKE repository and web application deliver key drug - target protein relationships to researchers studying SARS-CoV-2. COKE portal may provide comprehensive and critical information on studies concerning drug repurposing against COVID-19. COKE is freely available at https://coke.mml.unc.edu/ and the code is available at https://github.com/DnlRKorn/CoKE.
format Online
Article
Text
id pubmed-7709174
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher ChemRxiv
record_format MEDLINE/PubMed
spelling pubmed-77091742020-12-03 COVID-19 Knowledge Extractor (COKE): a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19 Korn, Daniel Pervitsky, Vera Bobrowski, Tesia Alves, Vinicius M. Schmitt, Charles Bizon, Chris Baker, Nancy Chirkova, Rada Cherkasov, Artem Muratov, Eugene Tropsha, Alexander ChemRxiv Article OBJECTIVE: The COVID-19 pandemic has catalyzed a widespread effort to identify drug candidates and biological targets of relevance to SARS-COV-2 infection, which resulted in large numbers of publications on this subject. We have built the COVID-19 Knowledge Extractor (COKE), a web application to extract, curate, and annotate essential drug-target relationships from the research literature on COVID-19 to assist drug repurposing efforts. MATERIALS AND METHODS: SciBiteAI ontological tagging of the COVID Open Research Dataset (CORD-19), a repository of COVID-19 scientific publications, was employed to identify drug-target relationships. Entity identifiers were resolved through lookup routines using UniProt and DrugBank. A custom algorithm was used to identify co-occurrences of protein and drug terms, and confidence scores were calculated for each entity pair. RESULTS: COKE processing of the current CORD-19 database identified about 3,000 drug-protein pairs, including 29 unique proteins and 500 investigational, experimental, and approved drugs. Some of these drugs are presently undergoing clinical trials for COVID-19. DISCUSSION: The rapidly evolving situation concerning the COVID-19 pandemic has resulted in a dramatic growth of publications on this subject in a short period. These circumstances call for methods that can condense the literature into the key concepts and relationships necessary for insights into SARS-CoV-2 drug repurposing. CONCLUSION: The COKE repository and web application deliver key drug - target protein relationships to researchers studying SARS-CoV-2. COKE portal may provide comprehensive and critical information on studies concerning drug repurposing against COVID-19. COKE is freely available at https://coke.mml.unc.edu/ and the code is available at https://github.com/DnlRKorn/CoKE. ChemRxiv 2020-11-26 /pmc/articles/PMC7709174/ /pubmed/33269341 http://dx.doi.org/10.26434/chemrxiv.13289222 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Korn, Daniel
Pervitsky, Vera
Bobrowski, Tesia
Alves, Vinicius M.
Schmitt, Charles
Bizon, Chris
Baker, Nancy
Chirkova, Rada
Cherkasov, Artem
Muratov, Eugene
Tropsha, Alexander
COVID-19 Knowledge Extractor (COKE): a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19
title COVID-19 Knowledge Extractor (COKE): a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19
title_full COVID-19 Knowledge Extractor (COKE): a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19
title_fullStr COVID-19 Knowledge Extractor (COKE): a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19
title_full_unstemmed COVID-19 Knowledge Extractor (COKE): a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19
title_short COVID-19 Knowledge Extractor (COKE): a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19
title_sort covid-19 knowledge extractor (coke): a tool and a web portal to extract drug - target protein associations from the cord-19 corpus of scientific publications on covid-19
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7709174/
https://www.ncbi.nlm.nih.gov/pubmed/33269341
http://dx.doi.org/10.26434/chemrxiv.13289222
work_keys_str_mv AT korndaniel covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT pervitskyvera covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT bobrowskitesia covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT alvesviniciusm covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT schmittcharles covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT bizonchris covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT bakernancy covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT chirkovarada covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT cherkasovartem covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT muratoveugene covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19
AT tropshaalexander covid19knowledgeextractorcokeatoolandawebportaltoextractdrugtargetproteinassociationsfromthecord19corpusofscientificpublicationsoncovid19