Cargando…

Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding

A better understanding of the clinical characteristics of coronavirus disease 2019 (COVID-19) is urgently required to address this health crisis. Numerous researchers and pharmaceutical companies are working on developing vaccines and treatments; however, a clear solution has yet to be found. The cu...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Heyoung, Sohn, Eunsoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7998313/
https://www.ncbi.nlm.nih.gov/pubmed/33804131
http://dx.doi.org/10.3390/ijerph18063005
_version_ 1783670523577761792
author Yang, Heyoung
Sohn, Eunsoo
author_facet Yang, Heyoung
Sohn, Eunsoo
author_sort Yang, Heyoung
collection PubMed
description A better understanding of the clinical characteristics of coronavirus disease 2019 (COVID-19) is urgently required to address this health crisis. Numerous researchers and pharmaceutical companies are working on developing vaccines and treatments; however, a clear solution has yet to be found. The current study proposes the use of artificial intelligence methods to comprehend biomedical knowledge and infer the characteristics of COVID-19. A biomedical knowledge base was established via FastText, a word embedding technique, using PubMed literature from the past decade. Subsequently, a new knowledge base was created using recently published COVID-19 articles. Using this newly constructed knowledge base from the word embedding model, a list of anti-infective drugs and proteins of either human or coronavirus origin were inferred to be related, because they are located close to COVID-19 on the knowledge base. This study attempted to form a method to quickly infer related information about COVID-19 using the existing knowledge base, before sufficient knowledge about COVID-19 is accumulated. With COVID-19 not completely overcome, machine learning-based research in the PubMed literature will provide a broad guideline for researchers and pharmaceutical companies working on treatments for COVID-19.
format Online
Article
Text
id pubmed-7998313
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79983132021-03-28 Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding Yang, Heyoung Sohn, Eunsoo Int J Environ Res Public Health Article A better understanding of the clinical characteristics of coronavirus disease 2019 (COVID-19) is urgently required to address this health crisis. Numerous researchers and pharmaceutical companies are working on developing vaccines and treatments; however, a clear solution has yet to be found. The current study proposes the use of artificial intelligence methods to comprehend biomedical knowledge and infer the characteristics of COVID-19. A biomedical knowledge base was established via FastText, a word embedding technique, using PubMed literature from the past decade. Subsequently, a new knowledge base was created using recently published COVID-19 articles. Using this newly constructed knowledge base from the word embedding model, a list of anti-infective drugs and proteins of either human or coronavirus origin were inferred to be related, because they are located close to COVID-19 on the knowledge base. This study attempted to form a method to quickly infer related information about COVID-19 using the existing knowledge base, before sufficient knowledge about COVID-19 is accumulated. With COVID-19 not completely overcome, machine learning-based research in the PubMed literature will provide a broad guideline for researchers and pharmaceutical companies working on treatments for COVID-19. MDPI 2021-03-15 /pmc/articles/PMC7998313/ /pubmed/33804131 http://dx.doi.org/10.3390/ijerph18063005 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yang, Heyoung
Sohn, Eunsoo
Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding
title Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding
title_full Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding
title_fullStr Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding
title_full_unstemmed Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding
title_short Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding
title_sort expanding our understanding of covid-19 from biomedical literature using word embedding
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7998313/
https://www.ncbi.nlm.nih.gov/pubmed/33804131
http://dx.doi.org/10.3390/ijerph18063005
work_keys_str_mv AT yangheyoung expandingourunderstandingofcovid19frombiomedicalliteratureusingwordembedding
AT sohneunsoo expandingourunderstandingofcovid19frombiomedicalliteratureusingwordembedding