Cargando…
Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding
A better understanding of the clinical characteristics of coronavirus disease 2019 (COVID-19) is urgently required to address this health crisis. Numerous researchers and pharmaceutical companies are working on developing vaccines and treatments; however, a clear solution has yet to be found. The cu...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7998313/ https://www.ncbi.nlm.nih.gov/pubmed/33804131 http://dx.doi.org/10.3390/ijerph18063005 |
_version_ | 1783670523577761792 |
---|---|
author | Yang, Heyoung Sohn, Eunsoo |
author_facet | Yang, Heyoung Sohn, Eunsoo |
author_sort | Yang, Heyoung |
collection | PubMed |
description | A better understanding of the clinical characteristics of coronavirus disease 2019 (COVID-19) is urgently required to address this health crisis. Numerous researchers and pharmaceutical companies are working on developing vaccines and treatments; however, a clear solution has yet to be found. The current study proposes the use of artificial intelligence methods to comprehend biomedical knowledge and infer the characteristics of COVID-19. A biomedical knowledge base was established via FastText, a word embedding technique, using PubMed literature from the past decade. Subsequently, a new knowledge base was created using recently published COVID-19 articles. Using this newly constructed knowledge base from the word embedding model, a list of anti-infective drugs and proteins of either human or coronavirus origin were inferred to be related, because they are located close to COVID-19 on the knowledge base. This study attempted to form a method to quickly infer related information about COVID-19 using the existing knowledge base, before sufficient knowledge about COVID-19 is accumulated. With COVID-19 not completely overcome, machine learning-based research in the PubMed literature will provide a broad guideline for researchers and pharmaceutical companies working on treatments for COVID-19. |
format | Online Article Text |
id | pubmed-7998313 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-79983132021-03-28 Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding Yang, Heyoung Sohn, Eunsoo Int J Environ Res Public Health Article A better understanding of the clinical characteristics of coronavirus disease 2019 (COVID-19) is urgently required to address this health crisis. Numerous researchers and pharmaceutical companies are working on developing vaccines and treatments; however, a clear solution has yet to be found. The current study proposes the use of artificial intelligence methods to comprehend biomedical knowledge and infer the characteristics of COVID-19. A biomedical knowledge base was established via FastText, a word embedding technique, using PubMed literature from the past decade. Subsequently, a new knowledge base was created using recently published COVID-19 articles. Using this newly constructed knowledge base from the word embedding model, a list of anti-infective drugs and proteins of either human or coronavirus origin were inferred to be related, because they are located close to COVID-19 on the knowledge base. This study attempted to form a method to quickly infer related information about COVID-19 using the existing knowledge base, before sufficient knowledge about COVID-19 is accumulated. With COVID-19 not completely overcome, machine learning-based research in the PubMed literature will provide a broad guideline for researchers and pharmaceutical companies working on treatments for COVID-19. MDPI 2021-03-15 /pmc/articles/PMC7998313/ /pubmed/33804131 http://dx.doi.org/10.3390/ijerph18063005 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Yang, Heyoung Sohn, Eunsoo Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding |
title | Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding |
title_full | Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding |
title_fullStr | Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding |
title_full_unstemmed | Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding |
title_short | Expanding Our Understanding of COVID-19 from Biomedical Literature Using Word Embedding |
title_sort | expanding our understanding of covid-19 from biomedical literature using word embedding |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7998313/ https://www.ncbi.nlm.nih.gov/pubmed/33804131 http://dx.doi.org/10.3390/ijerph18063005 |
work_keys_str_mv | AT yangheyoung expandingourunderstandingofcovid19frombiomedicalliteratureusingwordembedding AT sohneunsoo expandingourunderstandingofcovid19frombiomedicalliteratureusingwordembedding |