Cargando…

Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature

OBJECTIVE: The coronavirus disease 2019 (COVID-19) health crisis that began at the end of 2019 made researchers around the world quickly race to find effective solutions. Related literature exploded and it was inevitable that an automated approach was needed to find useful information, namely text m...

Descripción completa

Detalles Bibliográficos
Autores principales:	Supianto, Ahmad Afif, Nurdiansyah, Rizky, Weng, Chia-Wei, Zilvan, Vicky, Yuwana, Raden Sandra, Arisal, Andria, Pardede, Hilman Ferdinandus, Lee, Min-Min, Huang, Chien-Hung, Ng, Ka-Lok
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Taibah University 2023
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9810500/ https://www.ncbi.nlm.nih.gov/pubmed/36618881 http://dx.doi.org/10.1016/j.jtumed.2022.12.015

_version_	1784863319195123712
author	Supianto, Ahmad Afif Nurdiansyah, Rizky Weng, Chia-Wei Zilvan, Vicky Yuwana, Raden Sandra Arisal, Andria Pardede, Hilman Ferdinandus Lee, Min-Min Huang, Chien-Hung Ng, Ka-Lok
author_facet	Supianto, Ahmad Afif Nurdiansyah, Rizky Weng, Chia-Wei Zilvan, Vicky Yuwana, Raden Sandra Arisal, Andria Pardede, Hilman Ferdinandus Lee, Min-Min Huang, Chien-Hung Ng, Ka-Lok
author_sort	Supianto, Ahmad Afif
collection	PubMed
description	OBJECTIVE: The coronavirus disease 2019 (COVID-19) health crisis that began at the end of 2019 made researchers around the world quickly race to find effective solutions. Related literature exploded and it was inevitable that an automated approach was needed to find useful information, namely text mining, to overcome COVID-19, especially in terms of drug candidate discovery. While text mining methods for finding drug candidates mostly try to extract bioentity associations from PubMed, very few of them mine with a clustering approach. The purpose of this study was to demonstrate the effectiveness of our approach to identify drugs for the prevention of COVID-19 through literature review, cluster analysis, drug docking calculations, and clinical trial data. METHODS: This research was conducted in four main stages. First, the text mining stage was carried out by involving Bidirectional Encoder Representations from Transformers for Biomedical to obtain vector representation of each word in the sentence from texts. The next stage generated the disease-drug associations, which were obtained from the correlation between disease and drug. Next, the clustering stage grouped the rules through the similarity of diseases by utilizing Term Frequency-Inverse Document Frequency as its feature. Finally, the drug candidate extraction stage was processed through leveraging PubChem and DrugBank databases. We further used the drug docking package AUTODOCK VINA in PyRx software to verify the results. RESULTS: Comparative analyses showed that the percentage of findings using mining with clustering outperformed mining without clustering in all experimental settings. In addition, we suggest that the top three drugs/phytochemicals by drug docking analysis may be effective in preventing COVID-19. CONCLUSIONS: The proposed method for text mining utilizing the clustering method is quite promising in the discovery of drug candidates for the prevention of COVID-19 through the biomedical literature.
format	Online Article Text
id	pubmed-9810500
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Taibah University
record_format	MEDLINE/PubMed
spelling	pubmed-98105002023-01-04 Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature Supianto, Ahmad Afif Nurdiansyah, Rizky Weng, Chia-Wei Zilvan, Vicky Yuwana, Raden Sandra Arisal, Andria Pardede, Hilman Ferdinandus Lee, Min-Min Huang, Chien-Hung Ng, Ka-Lok J Taibah Univ Med Sci Original Article OBJECTIVE: The coronavirus disease 2019 (COVID-19) health crisis that began at the end of 2019 made researchers around the world quickly race to find effective solutions. Related literature exploded and it was inevitable that an automated approach was needed to find useful information, namely text mining, to overcome COVID-19, especially in terms of drug candidate discovery. While text mining methods for finding drug candidates mostly try to extract bioentity associations from PubMed, very few of them mine with a clustering approach. The purpose of this study was to demonstrate the effectiveness of our approach to identify drugs for the prevention of COVID-19 through literature review, cluster analysis, drug docking calculations, and clinical trial data. METHODS: This research was conducted in four main stages. First, the text mining stage was carried out by involving Bidirectional Encoder Representations from Transformers for Biomedical to obtain vector representation of each word in the sentence from texts. The next stage generated the disease-drug associations, which were obtained from the correlation between disease and drug. Next, the clustering stage grouped the rules through the similarity of diseases by utilizing Term Frequency-Inverse Document Frequency as its feature. Finally, the drug candidate extraction stage was processed through leveraging PubChem and DrugBank databases. We further used the drug docking package AUTODOCK VINA in PyRx software to verify the results. RESULTS: Comparative analyses showed that the percentage of findings using mining with clustering outperformed mining without clustering in all experimental settings. In addition, we suggest that the top three drugs/phytochemicals by drug docking analysis may be effective in preventing COVID-19. CONCLUSIONS: The proposed method for text mining utilizing the clustering method is quite promising in the discovery of drug candidates for the prevention of COVID-19 through the biomedical literature. Taibah University 2023-01-04 /pmc/articles/PMC9810500/ /pubmed/36618881 http://dx.doi.org/10.1016/j.jtumed.2022.12.015 Text en © 2022 [The Author/The Authors] https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Original Article Supianto, Ahmad Afif Nurdiansyah, Rizky Weng, Chia-Wei Zilvan, Vicky Yuwana, Raden Sandra Arisal, Andria Pardede, Hilman Ferdinandus Lee, Min-Min Huang, Chien-Hung Ng, Ka-Lok Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature
title	Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature
title_full	Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature
title_fullStr	Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature
title_full_unstemmed	Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature
title_short	Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature
title_sort	cluster-based text mining for extracting drug candidates for the prevention of covid-19 from the biomedical literature
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9810500/ https://www.ncbi.nlm.nih.gov/pubmed/36618881 http://dx.doi.org/10.1016/j.jtumed.2022.12.015
work_keys_str_mv	AT supiantoahmadafif clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT nurdiansyahrizky clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT wengchiawei clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT zilvanvicky clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT yuwanaradensandra clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT arisalandria clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT pardedehilmanferdinandus clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT leeminmin clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT huangchienhung clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature AT ngkalok clusterbasedtextminingforextractingdrugcandidatesforthepreventionofcovid19fromthebiomedicalliterature

Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature

Ejemplares similares