Cargando…
Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora
The number of biomedical articles published is increasing rapidly over the years. Currently there are about 30 million articles in PubMed and over 25 million mentions in Medline. Among these fundamentals, Biomedical Named Entity Recognition (BioNER) and Biomedical Relation Extraction (BioRE) are the...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9807269/ https://www.ncbi.nlm.nih.gov/pubmed/36621289 http://dx.doi.org/10.1016/j.compbiolchem.2022.107808 |
_version_ | 1784862685021601792 |
---|---|
author | Gajendran, Sudhakaran Manjula, D. Sugumaran, Vijayan Hema, R. |
author_facet | Gajendran, Sudhakaran Manjula, D. Sugumaran, Vijayan Hema, R. |
author_sort | Gajendran, Sudhakaran |
collection | PubMed |
description | The number of biomedical articles published is increasing rapidly over the years. Currently there are about 30 million articles in PubMed and over 25 million mentions in Medline. Among these fundamentals, Biomedical Named Entity Recognition (BioNER) and Biomedical Relation Extraction (BioRE) are the most essential in analysing the literature. In the biomedical domain, Knowledge Graph is used to visualize the relationships between various entities such as proteins, chemicals and diseases. Scientific publications have increased dramatically as a result of the search for treatments and potential cures for the new Coronavirus, but efficiently analysing, integrating, and utilising related sources of information remains a difficulty. In order to effectively combat the disease during pandemics like COVID-19, literature must be used quickly and effectively. In this paper, we introduced a fully automated framework consists of BERT-BiLSTM, Knowledge graph, and Representation Learning model to extract the top diseases, chemicals, and proteins related to COVID-19 from the literature. The proposed framework uses Named Entity Recognition models for disease recognition, chemical recognition, and protein recognition. Then the system uses the Chemical - Disease Relation Extraction and Chemical - Protein Relation Extraction models. And the system extracts the entities and relations from the CORD-19 dataset using the models. The system then creates a Knowledge Graph for the extracted relations and entities. The system performs Representation Learning on this KG to get the embeddings of all entities and get the top related diseases, chemicals, and proteins with respect to COVID-19. |
format | Online Article Text |
id | pubmed-9807269 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98072692023-01-04 Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora Gajendran, Sudhakaran Manjula, D. Sugumaran, Vijayan Hema, R. Comput Biol Chem Article The number of biomedical articles published is increasing rapidly over the years. Currently there are about 30 million articles in PubMed and over 25 million mentions in Medline. Among these fundamentals, Biomedical Named Entity Recognition (BioNER) and Biomedical Relation Extraction (BioRE) are the most essential in analysing the literature. In the biomedical domain, Knowledge Graph is used to visualize the relationships between various entities such as proteins, chemicals and diseases. Scientific publications have increased dramatically as a result of the search for treatments and potential cures for the new Coronavirus, but efficiently analysing, integrating, and utilising related sources of information remains a difficulty. In order to effectively combat the disease during pandemics like COVID-19, literature must be used quickly and effectively. In this paper, we introduced a fully automated framework consists of BERT-BiLSTM, Knowledge graph, and Representation Learning model to extract the top diseases, chemicals, and proteins related to COVID-19 from the literature. The proposed framework uses Named Entity Recognition models for disease recognition, chemical recognition, and protein recognition. Then the system uses the Chemical - Disease Relation Extraction and Chemical - Protein Relation Extraction models. And the system extracts the entities and relations from the CORD-19 dataset using the models. The system then creates a Knowledge Graph for the extracted relations and entities. The system performs Representation Learning on this KG to get the embeddings of all entities and get the top related diseases, chemicals, and proteins with respect to COVID-19. Elsevier Ltd. 2023-02 2023-01-02 /pmc/articles/PMC9807269/ /pubmed/36621289 http://dx.doi.org/10.1016/j.compbiolchem.2022.107808 Text en © 2022 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Gajendran, Sudhakaran Manjula, D. Sugumaran, Vijayan Hema, R. Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora |
title | Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora |
title_full | Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora |
title_fullStr | Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora |
title_full_unstemmed | Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora |
title_short | Extraction of knowledge graph of Covid-19 through mining of unstructured biomedical corpora |
title_sort | extraction of knowledge graph of covid-19 through mining of unstructured biomedical corpora |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9807269/ https://www.ncbi.nlm.nih.gov/pubmed/36621289 http://dx.doi.org/10.1016/j.compbiolchem.2022.107808 |
work_keys_str_mv | AT gajendransudhakaran extractionofknowledgegraphofcovid19throughminingofunstructuredbiomedicalcorpora AT manjulad extractionofknowledgegraphofcovid19throughminingofunstructuredbiomedicalcorpora AT sugumaranvijayan extractionofknowledgegraphofcovid19throughminingofunstructuredbiomedicalcorpora AT hemar extractionofknowledgegraphofcovid19throughminingofunstructuredbiomedicalcorpora |