Cargando…

A novel method of literature mining to identify candidate COVID-19 drugs

SUMMARY: COVID-19 is a serious infectious disease that has recently emerged and continues to spread worldwide. Its spreading rate is too high to expect that new specific drugs will be developed in sufficient time. As an alternative, drugs already developed for other diseases have been tested for use...

Descripción completa

Detalles Bibliográficos
Autores principales: Muramatsu, Tomonari, Tanokura, Masaru
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710631/
https://www.ncbi.nlm.nih.gov/pubmed/36700092
http://dx.doi.org/10.1093/bioadv/vbab013
_version_ 1784841408048267264
author Muramatsu, Tomonari
Tanokura, Masaru
author_facet Muramatsu, Tomonari
Tanokura, Masaru
author_sort Muramatsu, Tomonari
collection PubMed
description SUMMARY: COVID-19 is a serious infectious disease that has recently emerged and continues to spread worldwide. Its spreading rate is too high to expect that new specific drugs will be developed in sufficient time. As an alternative, drugs already developed for other diseases have been tested for use in the treatment of COVID-19 (drug repositioning). However, to select candidate drugs from a large number of compounds, numerous inhibition assays involving viral infection of cultured cells are required. For efficiency, it would be useful to narrow the list of candidates down using logical considerations prior to performing these assays. We have developed a powerful tool to predict candidate drugs for the treatment of COVID-19 and other diseases. This tool is based on the concatenation of events/substances, each of which is linked to a KEGG (Kyoto Encyclopedia of Genes and Genomes) code based on a relationship obtained from text mining of the vast literature in the PubMed database. By analyzing 21 589 326 records with abstracts from PubMed, 98 556 KEGG codes with NAME/DEFINITION fields were connected. Among them, 9799 KEGG drug codes were connected to COVID-19, of which 7492 codes had no direct connection to COVID-19. Although this report focuses on COVID-19, the program developed here can be applied to other infectious diseases and used to quickly identify drug candidates when new infectious diseases appear in the future. AVAILABILITY AND IMPLEMENTATION: The programs and data underlying this article will be shared on reasonable request to the corresponding authors. CONTACT: atmuramatsu@g.ecc.u-tokyo.ac.jp, amtanok@mail.ecc.u-tokyo.ac.jp SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-9710631
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97106312023-01-24 A novel method of literature mining to identify candidate COVID-19 drugs Muramatsu, Tomonari Tanokura, Masaru Bioinform Adv Original Article SUMMARY: COVID-19 is a serious infectious disease that has recently emerged and continues to spread worldwide. Its spreading rate is too high to expect that new specific drugs will be developed in sufficient time. As an alternative, drugs already developed for other diseases have been tested for use in the treatment of COVID-19 (drug repositioning). However, to select candidate drugs from a large number of compounds, numerous inhibition assays involving viral infection of cultured cells are required. For efficiency, it would be useful to narrow the list of candidates down using logical considerations prior to performing these assays. We have developed a powerful tool to predict candidate drugs for the treatment of COVID-19 and other diseases. This tool is based on the concatenation of events/substances, each of which is linked to a KEGG (Kyoto Encyclopedia of Genes and Genomes) code based on a relationship obtained from text mining of the vast literature in the PubMed database. By analyzing 21 589 326 records with abstracts from PubMed, 98 556 KEGG codes with NAME/DEFINITION fields were connected. Among them, 9799 KEGG drug codes were connected to COVID-19, of which 7492 codes had no direct connection to COVID-19. Although this report focuses on COVID-19, the program developed here can be applied to other infectious diseases and used to quickly identify drug candidates when new infectious diseases appear in the future. AVAILABILITY AND IMPLEMENTATION: The programs and data underlying this article will be shared on reasonable request to the corresponding authors. CONTACT: atmuramatsu@g.ecc.u-tokyo.ac.jp, amtanok@mail.ecc.u-tokyo.ac.jp SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2021-07-22 /pmc/articles/PMC9710631/ /pubmed/36700092 http://dx.doi.org/10.1093/bioadv/vbab013 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Muramatsu, Tomonari
Tanokura, Masaru
A novel method of literature mining to identify candidate COVID-19 drugs
title A novel method of literature mining to identify candidate COVID-19 drugs
title_full A novel method of literature mining to identify candidate COVID-19 drugs
title_fullStr A novel method of literature mining to identify candidate COVID-19 drugs
title_full_unstemmed A novel method of literature mining to identify candidate COVID-19 drugs
title_short A novel method of literature mining to identify candidate COVID-19 drugs
title_sort novel method of literature mining to identify candidate covid-19 drugs
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710631/
https://www.ncbi.nlm.nih.gov/pubmed/36700092
http://dx.doi.org/10.1093/bioadv/vbab013
work_keys_str_mv AT muramatsutomonari anovelmethodofliteratureminingtoidentifycandidatecovid19drugs
AT tanokuramasaru anovelmethodofliteratureminingtoidentifycandidatecovid19drugs
AT muramatsutomonari novelmethodofliteratureminingtoidentifycandidatecovid19drugs
AT tanokuramasaru novelmethodofliteratureminingtoidentifycandidatecovid19drugs